-
公开(公告)号:US12204517B2
公开(公告)日:2025-01-21
申请号:US17933903
申请日:2022-09-21
Applicant: SAP SE
Inventor: Junze Bao , Norman May , Robert Schulze , Christian Lemke , Wei Zhou
IPC: G06F16/22 , G06F7/36 , G06F16/2455 , H03M7/30
Abstract: A database management system is described that can encode data to generate a plurality of data vectors. The database management system can perform the encoding by using a dictionary. The database management system can adaptively reorder the plurality of data vectors to prepare for compression of the plurality of data vectors. During a forward pass of the adaptive reordering, most frequent values of a data vector of the plurality of data vectors can be moved-up in the data vector. During a backward pass of the adaptive reordering, content within a rest range of a plurality of rest ranges can be rearranged within the plurality of data vectors according to frequencies of the content. The reordering according to frequency can further sort the rest range by value. Related apparatuses, systems, methods, techniques, computer programmable products, computer readable media, and articles are also described.
-
公开(公告)号:US11263196B2
公开(公告)日:2022-03-01
申请号:US16358598
申请日:2019-03-19
Applicant: SAP SE
Inventor: Gordon Gaumnitz , Robert Schulze , Lars Dannecker , Ivan T. Bowman , Daniel James Farrar
IPC: G06F16/23 , G06F16/951 , G06F16/22
Abstract: A system and method for faster access for compressed time series data. A set of blocks are generated based on a table stored in a database of the data platform. The table stores data associated with multiple sources of data provided as consecutive values, each block containing index vectors having a range of the consecutive values. A block index is generated for each block having a field start vector representing a starting position of the block relative to the range of consecutive values, and a starting value vector representing a value of the block at the starting position. The field start vector of the block index is accessed to obtain the starting position of a field corresponding to a first block and to the range of the consecutive values of the first block. The starting value vector is then determined from the block index to determine an end and a length of the field of the first block.
-
公开(公告)号:US11151126B2
公开(公告)日:2021-10-19
申请号:US16866766
申请日:2020-05-05
Applicant: SAP SE
Inventor: Colin Florendo , Mihnea Andrei , Amit Pathak , Robert Schulze , Adrian Dragusanu , Reza Sherkat , Sebastian Seifert , Christian Lemke
IPC: G06F16/22 , G06F16/23 , G06F12/02 , G06F3/06 , G06F12/0882 , G06F16/2455 , G06F12/0875 , G06F9/48
Abstract: Disclosed herein are system, method, and computer-program product embodiments for generating a paged and in-memory representation of a database object. An embodiment operates by maintaining in-memory and paged form primitives unique to the database object or a substructure thereof in a database such that the in-memory and paged form primitives are capable of providing the in-memory and paged representations of the database objects, respectively. Thereafter, a load configuration for the database object is determined. Based on the load configuration, the in-memory and/or paged representations of the database object are generated using the in-memory form primitive or the paged form primitive unique to the database object, respectively. Subsequently, the in-memory and/or paged representations of the database object are stored in the database.
-
公开(公告)号:US10762071B2
公开(公告)日:2020-09-01
申请号:US15363274
申请日:2016-11-29
Applicant: SAP SE
Inventor: Robert Schulze , Thomas Peh
Abstract: Innovations in performing sort operations for dictionary-compressed values of columns in a column-store database using value identifiers (“IDs”) are described. For example, a database system includes a data store and an execution engine. The data store stores values at positions of a column A dictionary maps distinct values to corresponding value IDs. An inverted index stores, for each of the corresponding value IDs, a list of those of the positions that contain the associated distinct value. The execution engine processes a request to sort values at an input set of the positions and identify an output set of the positions for sorted values. In particular, the execution engine iterates through positions stored in the lists of the inverted index. For a given position, the execution engine checks if the given position is one of the input set and, if so, adds the given position to the output set.
-
公开(公告)号:US10671625B2
公开(公告)日:2020-06-02
申请号:US15416729
申请日:2017-01-26
Applicant: SAP SE
Inventor: Robert Schulze , Oliver Horn , Klaus Otto Mueller
IPC: G06F16/24 , G06F16/248 , G06F16/25 , G06F16/2455 , G06F16/2453
Abstract: In some example embodiments, a system is provided for executing a primitive call that implements a query operation. The system may include a data processor and a memory. The memory may store instructions that result in operations when executed by the data processor. The operations may include: executing, at an data management engine, the primitive call by at least performing a first operation with respect to a value identifier set, the value identifier set including one or more value identifiers, and the primitive call being configured to access a database storing a plurality of value identifiers; and generating, based at least on a result of the first operation, a result for the primitive call. Related methods and articles of manufacture, including computer program products, are also described.
-
公开(公告)号:US20180210926A1
公开(公告)日:2018-07-26
申请号:US15416729
申请日:2017-01-26
Applicant: SAP SE
Inventor: Robert Schulze , Oliver Horn , Klaus Otto Mueller
IPC: G06F17/30
Abstract: In some example embodiments, a system is provided for executing a primitive call that implements a query operation. The system may include a data processor and a memory. The memory may store instructions that result in operations when executed by the data processor. The operations may include: executing, at an data management engine, the primitive call by at least performing a first operation with respect to a value identifier set, the value identifier set including one or more value identifiers, and the primitive call being configured to access a database storing a plurality of value identifiers; and generating, based at least on a result of the first operation, a result for the primitive call. Related methods and articles of manufacture, including computer program products, are also described.
-
7.
公开(公告)号:US11914589B2
公开(公告)日:2024-02-27
申请号:US16805624
申请日:2020-02-28
Applicant: SAP SE
Inventor: Oliver Horn , Thomas Legler , Robert Schulze
IPC: G06F16/24 , G06F16/2453 , G06F16/2455 , G06F16/22
CPC classification number: G06F16/24537 , G06F16/221 , G06F16/24556 , G06F16/24557
Abstract: In some example embodiments, there may be provided a method including detecting a query operator of a predetermined type requiring a sorting or a scanning responsive to the query operator at a column-oriented database; determining a sorted dictionary at the column-oriented database is eligible for direct access by at least checking that the column-oriented database provides sorted dictionary access and that dictionary collation and a collation of the query operator are the same; when the sorted dictionary is eligible, reading directly from the sorted dictionary a batch of sorted values; and processing, based on the batch sorted values read from the sorted dictionary, the query operator.
-
公开(公告)号:US11789920B1
公开(公告)日:2023-10-17
申请号:US17705728
申请日:2022-03-28
Applicant: SAP SE
Inventor: Michael Brendle , Norman May , Robert Schulze , Alexander Boehm , Michael Grossniklaus , Guido Moerkotte
CPC classification number: G06F16/2282 , G06F11/3414 , G06F16/213 , G06F16/256
Abstract: According to some embodiments, methods and systems may be associated with a cloud computing environment. A workload prediction framework may receive observed workload information associated with a database in the cloud computing environment (e.g., a DataBase as a Service (“DBaaS”)). Based on the observed workload information, a Statement Arrival Rate (“SAR”) prediction may be generated. In addition, a host variable assignment prediction may be generated based on the observed workload information. The workload prediction framework may then use the SAR prediction and the host variable assignment prediction to automatically create a workload prediction for the database. A physical database design advisor (e.g., a table partitioning advisor) may receive the workload prediction and, responsive to the workload prediction, automatically generate a recommended physical layout for the database (e.g., using a cost model, the current physical layout, and an objective function).
-
公开(公告)号:US20220269653A1
公开(公告)日:2022-08-25
申请号:US17324914
申请日:2021-05-19
Applicant: SAP SE
Inventor: Michael Brendle , Norman May , Robert Schulze , Alexander Boehm , Guido Moerkotte , Michael Grossniklaus
Abstract: The present disclosure involves systems, software, and computer implemented methods for improved design and implementation of data access metrics for automated physical database design. An example method includes identifying a database workload for which index advisor access counters are to be tracked. Each SQL statement in the database workload is executed. For each SQL statement, attribute sets are determined for which a selection predicate filters a result for an SQL statement. An output cardinality of each selection predicate is determined. A logarithmic counter for an attribute set corresponding to the selection predicate is determined based on the output cardinality of the selection predicate. The determined logarithmic counter is incremented. Respective values for logarithmic counters of the determined attributes are provided to an index advisor. The index advisor determines attribute sets for which to propose an index based on the logarithmic counters of the respective attribute sets.
-
公开(公告)号:US11386104B2
公开(公告)日:2022-07-12
申请号:US16661993
申请日:2019-10-23
Applicant: SAP SE
Inventor: Gordon Gaumnitz , Lars Dannecker , Robert Schulze , Ivan T. Bowman , Daniel James Farrar
IPC: G06F16/2458 , G06F16/22
Abstract: Disclosed is a system and method for improving database memory consumption and performance using compression of time stamp columns. A number of time stamps of a time series is received. The time stamps have a start time, and are separated by an equal increment of time that defines an interval. The start time and interval are stored in a dictionary of a column store of a database. An index is generated in the column store of the database, the index having a number of index vectors. Using the index vectors, each time stamp of the number of time stamps can be calculated from the start time stored in the dictionary and the position in the time series based on the interval stored in the dictionary.
-
-
-
-
-
-
-
-
-