ADAPTIVE COMPRESSION OPTIMIZATION FOR EFFECTIVE PRUNING

    公开(公告)号:US20230025952A1

    公开(公告)日:2023-01-26

    申请号:US17933903

    申请日:2022-09-21

    Applicant: SAP SE

    Abstract: A database management system is described that can encode data to generate a plurality of data vectors. The database management system can perform the encoding by using a dictionary. The database management system can adaptively reorder the plurality of data vectors to prepare for compression of the plurality of data vectors. During a forward pass of the adaptive reordering, most frequent values of a data vector of the plurality of data vectors can be moved-up in the data vector. During a backward pass of the adaptive reordering, content within a rest range of a plurality of rest ranges can be rearranged within the plurality of data vectors according to frequencies of the content. The reordering according to frequency can further sort the rest range by value. Related apparatuses, systems, methods, techniques, computer programmable products, computer readable media, and articles are also described.

    Implementation of data access metrics for automated physical database design

    公开(公告)号:US11550762B2

    公开(公告)日:2023-01-10

    申请号:US17316901

    申请日:2021-05-11

    Applicant: SAP SE

    Abstract: The present disclosure involves systems, software, and computer implemented methods for improved design and implementation of data access metrics for automated physical database design. An example method includes identifying a database workload for which index advisor access counters are to be tracked. Each SQL statement in the database workload is executed. For each SQL statement, attribute sets are determined for which a selection predicate filters a result for an SQL statement. An output cardinality of each selection predicate is determined. A logarithmic counter for an attribute set corresponding to the selection predicate is determined based on the output cardinality of the selection predicate. The determined logarithmic counter is incremented. Respective values for logarithmic counters of the determined attributes are provided to an index advisor. The index advisor determines attribute sets for which to propose an index based on the logarithmic counters of the respective attribute sets.

    Hybrid data storage and load system with ROWID lookup

    公开(公告)号:US11514028B2

    公开(公告)日:2022-11-29

    申请号:US16863834

    申请日:2020-04-30

    Applicant: SAP SE

    Abstract: System, method, and various embodiments for providing a hybrid data storage and load system are described herein. An embodiment operates by receiving a lookup request comprising a row identifier (ROWID) corresponding to a row of a table of a database. Which of a first set of pages corresponds to a sorted position of the ROWID is identified. The identified page of the first set of pages is loaded from the disk storage to the memory storage. A sorted row position of the ROWID is identified from a memory block of the loaded page from the first set of pages. One of a second set of pages that includes the sorted row position is identified. The identified one of the second set of pages is loaded from the disk storage into memory. The loaded identified one of the second set of pages is searched for the unsorted position of the ROWID.

    Adaptive compression optimization for effective pruning

    公开(公告)号:US11500842B2

    公开(公告)日:2022-11-15

    申请号:US16220688

    申请日:2018-12-14

    Applicant: SAP SE

    Abstract: A database management system is described that can encode data to generate a plurality of data vectors. The database management system can perform the encoding by using a dictionary. The database management system can adaptively reorder the plurality of data vectors to prepare for compression of the plurality of data vectors. During a forward pass of the adaptive reordering, most frequent values of a data vector of the plurality of data vectors can be moved-up in the data vector. During a backward pass of the adaptive reordering, content within a rest range of a plurality of rest ranges can be rearranged within the plurality of data vectors according to frequencies of the content. The reordering according to frequency can further sort the rest range by value. Related apparatuses, systems, methods, techniques, computer programmable products, computer readable media, and articles are also described.

    DESIGN AND IMPLEMENTATION OF DATA ACCESS METRICS FOR AUTOMATED PHYSICAL DATABASE DESIGN

    公开(公告)号:US20220269655A1

    公开(公告)日:2022-08-25

    申请号:US17316901

    申请日:2021-05-11

    Applicant: SAP SE

    Abstract: The present disclosure involves systems, software, and computer implemented methods for improved design and implementation of data access metrics for automated physical database design. An example method includes identifying a database workload for which index advisor access counters are to be tracked. Each SQL statement in the database workload is executed. For each SQL statement, attribute sets are determined for which a selection predicate filters a result for an SQL statement. An output cardinality of each selection predicate is determined. A logarithmic counter for an attribute set corresponding to the selection predicate is determined based on the output cardinality of the selection predicate. The determined logarithmic counter is incremented. Respective values for logarithmic counters of the determined attributes are provided to an index advisor. The index advisor determines attribute sets for which to propose an index based on the logarithmic counters of the respective attribute sets.

    Space efficient vector for columnar data storage

    公开(公告)号:US11386082B2

    公开(公告)日:2022-07-12

    申请号:US16893697

    申请日:2020-06-05

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for providing paged and compressed storage for column data, while maintaining existing access mechanisms for the data. In order to reduce an in-memory footprint for column data, columns may be stored in pageable format using page chains, and only those pages of the column data needed to resolve a query will be placed in memory, and evicted from memory when no longer needed. In order to further reduce the footprint for these columns, compression can be applied, and the compressed column data stored in the same pageable format using page chains. The compressed data includes a plurality of vectors, each of which is converted into pages and stored on the page chain with the others so that they can be efficiently retrieved during database retrieval operations.

    Hash composite indexes
    30.
    发明授权

    公开(公告)号:US11341120B2

    公开(公告)日:2022-05-24

    申请号:US15931179

    申请日:2020-05-13

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for creating a dictionary-free index. An embodiment operates by a database maintaining database table comprises a column and a row. The database creates a concatenated string index comprising a concatenated string value based on the database column's row. The database also creates a hash value list having a hash value based on the concatenation string index's concatenated string value. Thereafter, the database creates a directory having a pointer to (i) a row position of the hash value list or (ii) a row position of a collision list having the row position of the hash value in the hash value list. The database also creates a collision list containing the row positions of the hash values occurring multiple times in the hash value list. The index for the database table includes the hash value list, the directory, and the collision list.

Patent Agency Ranking