Workload-aware column inprints
    21.
    发明授权

    公开(公告)号:US12169493B2

    公开(公告)日:2024-12-17

    申请号:US18244021

    申请日:2023-09-08

    Applicant: SAP SE

    Abstract: Techniques and solutions are described that can be used to determine data subsets of a data set that may be responsive to a query. In one aspect, imprints are created for the data subsets, where an imprint indicates whether, for a plurality of bins, a data subset contains values within respective bins. The bins are defined at least in part using data for a set of queries. In another aspect, at least one bin of a plurality of bins is designated to be associated with values of the data set satisfying a threshold frequency. In a further aspect, imprints are created for a plurality of bins, where an imprint indicates whether the plurality of data subsets comprise a value for a given bin. A query imprint can be compared with the data subset imprints or bin imprints to determine data subsets that may be responsive to an associated query.

    WORKLOAD-AWARE COLUMN INPRINTS
    22.
    发明公开

    公开(公告)号:US20230418824A1

    公开(公告)日:2023-12-28

    申请号:US18244021

    申请日:2023-09-08

    Applicant: SAP SE

    CPC classification number: G06F16/24554 G06F16/2237 G06F16/248 G06F9/30029

    Abstract: Techniques and solutions are described that can be used to determine data subsets of a data set that may be responsive to a query. In one aspect, imprints are created for the data subsets, where an imprint indicates whether, for a plurality of bins, a data subset contains values within respective bins. The bins are defined at least in part using data for a set of queries. In another aspect, at least one bin of a plurality of bins is designated to be associated with values of the data set satisfying a threshold frequency. In a further aspect, imprints are created for a plurality of bins, where an imprint indicates whether the plurality of data subsets comprise a value for a given bin. A query imprint can be compared with the data subset imprints or bin imprints to determine data subsets that may be responsive to an associated query.

    FINGERPRINTS FOR COMPRESSED COLUMNAR DATA SEARCH

    公开(公告)号:US20230268930A1

    公开(公告)日:2023-08-24

    申请号:US18310251

    申请日:2023-05-01

    Applicant: SAP SE

    Abstract: The present disclosure involves systems, software, and computer implemented methods for compressed columnar data search using fingerprints. One example method includes compressing columnar data that includes dividing the columnar data into multiple data blocks and generating a fingerprint for each data block, storing the compressed columnar data and the generated fingerprints in an in-memory database, receiving a query for the columnar data, for each in-memory data block stored in the in-memory database, determining whether the in-memory data block satisfies the query and in response to a determination that the in-memory data block does not satisfy the query, pruning the in-memory data block from the multiple data blocks to generate an unpruned set of data blocks, decompressing the unpruned set of data blocks, and performing a query search on the decompressed unpruned set of data blocks for the received query.

    Constraint Data Statistics
    24.
    发明申请

    公开(公告)号:US20220222244A1

    公开(公告)日:2022-07-14

    申请号:US17711608

    申请日:2022-04-01

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for using a data statistic as a dynamic data integrity constraint. An embodiment operates by defining a data statistic for a column or a set of columns of a partition of a plurality of partitions of a database table. The embodiment creates a constraint data statistics object based on the data statistic. The embodiment receives a query for the database table. The embodiment determines the constraint data statistics object is consistent with a data state of the partition. The embodiment derives an implied constraint based on the constraint data statistics object. The embodiment processes the query for the partition based on the implied constraint.

    DATA COMPRESSION TECHNIQUES
    25.
    发明申请

    公开(公告)号:US20210367613A1

    公开(公告)日:2021-11-25

    申请号:US16878419

    申请日:2020-05-19

    Applicant: SAP SE

    Abstract: Techniques and solutions are described for compressing data and facilitating access to compressed data. Compression can be applied to proper data subsets of a data set, such as to columns of a table. Using various methods, the proper data subsets can be evaluated to be included in a group of proper data subsets to be compressed using a first compression technique, where unselected proper data subsets are not compressed using the first compression technique. Data in the data set can be reordered based on a reordering sequence for the proper data subsets. Reordering data in the data set can improve compression when at least a portion of the proper data subsets are compressed. A data structure is provided that facilitates accessing specified data stored in a compressed format.

    Memory optimization system for inverted indexes

    公开(公告)号:US10885074B2

    公开(公告)日:2021-01-05

    申请号:US16059665

    申请日:2018-08-09

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for providing a memory optimization system for inverted indexes. An embodiment operates by determining a value identifier corresponding to a value to be searched for in a database based on a query. From a lookup table, a prefix of an offset corresponding to a location in a compressed posting list that identifies row locations where the value is found in the database is retrieved. From a directory page, a suffix of the offset corresponding to the location is retrieved. The offset is generated based on the prefix and the suffix. From the posting list, row identifiers corresponding to the generated offset are retrieved. The retrieved row identifiers are returned responsive to the query.

    Paged inverted index
    29.
    发明授权

    公开(公告)号:US10140326B2

    公开(公告)日:2018-11-27

    申请号:US14954736

    申请日:2015-11-30

    Applicant: SAP SE

    Abstract: Disclosed herein are system and method embodiments for generating a paged inverted index. An embodiment is generated by storing a first data structure and the second data structure in a plurality of pages, where the plurality of pages are stored in the one or more memories. The first data structure is stored in the plurality of pages and includes a plurality of value identifiers, where a value identifier corresponds to an offset. The second data structure stored in the plurality of pages includes a plurality of row positions, wherein a row position is at a location that corresponds to the offset in the first data structure and identifies a position of row in a table that stores data associated with the value ID.

Patent Agency Ranking