Constraint data statistics
    31.
    发明授权

    公开(公告)号:US11734258B2

    公开(公告)日:2023-08-22

    申请号:US17711608

    申请日:2022-04-01

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for using a data statistic as a dynamic data integrity constraint. An embodiment operates by defining a data statistic for a column or a set of columns of a partition of a plurality of partitions of a database table. The embodiment creates a constraint data statistics object based on the data statistic. The embodiment receives a query for the database table. The embodiment determines the constraint data statistics object is consistent with a data state of the partition. The embodiment derives an implied constraint based on the constraint data statistics object. The embodiment processes the query for the partition based on the implied constraint.

    Fingerprints for compressed columnar data search

    公开(公告)号:US11677414B2

    公开(公告)日:2023-06-13

    申请号:US17645897

    申请日:2021-12-23

    Applicant: SAP SE

    Abstract: The present disclosure involves systems, software, and computer implemented methods for compressed columnar data search using fingerprints. One example method includes compressing columnar data that includes dividing the columnar data into multiple data blocks and generating a fingerprint for each data block, storing the compressed columnar data and the generated fingerprints in an in-memory database, receiving a query for the columnar data, for each in-memory data block stored in the in-memory database, determining whether the in-memory data block satisfies the query and in response to a determination that the in-memory data block does not satisfy the query, pruning the in-memory data block from the multiple data blocks to generate an unpruned set of data blocks, decompressing the unpruned set of data blocks, and performing a query search on the decompressed unpruned set of data blocks for the received query.

    WORKLOAD-AWARE COLUMN INPRINTS
    34.
    发明申请

    公开(公告)号:US20220277012A1

    公开(公告)日:2022-09-01

    申请号:US17750188

    申请日:2022-05-20

    Applicant: SAP SE

    Abstract: Techniques and solutions are described that can be used to determine data subsets of a data set that may be responsive to a query. In one aspect, imprints are created for the data subsets, where an imprint indicates whether, for a plurality of bins, a data subset contains values within respective bins. The bins are defined at least in part using data for a set of queries. In another aspect, at least one bin of a plurality of bins is designated to be associated with values of the data set satisfying a threshold frequency. In a further aspect, imprints are created for a plurality of bins, where an imprint indicates whether the plurality of data subsets comprise a value for a given bin. A query imprint can be compared with the data subset imprints or bin imprints to determine data subsets that may be responsive to an associated query.

    Space efficient vector for columnar data storage

    公开(公告)号:US11386082B2

    公开(公告)日:2022-07-12

    申请号:US16893697

    申请日:2020-06-05

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for providing paged and compressed storage for column data, while maintaining existing access mechanisms for the data. In order to reduce an in-memory footprint for column data, columns may be stored in pageable format using page chains, and only those pages of the column data needed to resolve a query will be placed in memory, and evicted from memory when no longer needed. In order to further reduce the footprint for these columns, compression can be applied, and the compressed column data stored in the same pageable format using page chains. The compressed data includes a plurality of vectors, each of which is converted into pages and stored on the page chain with the others so that they can be efficiently retrieved during database retrieval operations.

    Data imprints techniques for use with data retrieval methods

    公开(公告)号:US11366811B2

    公开(公告)日:2022-06-21

    申请号:US16880730

    申请日:2020-05-21

    Applicant: SAP SE

    Abstract: Techniques and solutions are described that can be used to determine data subsets of a data set that may be responsive to a query. In one aspect, imprints are created for the data subsets, where an imprint indicates whether, for a plurality of bins, a data subset contains values within respective bins. The bins are defined at least in part using data for a set of queries. In another aspect, at least one bin of a plurality of bins is designated to be associated with values of the data set satisfying a threshold frequency. In a further aspect, imprints are created for a plurality of bins, where an imprint indicates whether the plurality of data subsets comprise a value for a given bin. A query imprint can be compared with the data subset imprints or bin imprints to determine data subsets that may be responsive to an associated query.

    DATA IMPRINTS TECHNIQUES FOR USE WITH DATA RETRIEVAL METHODS

    公开(公告)号:US20210365458A1

    公开(公告)日:2021-11-25

    申请号:US16880730

    申请日:2020-05-21

    Applicant: SAP SE

    Abstract: Techniques and solutions are described that can be used to determine data subsets of a data set that may be responsive to a query. In one aspect, imprints are created for the data subsets, where an imprint indicates whether, for a plurality of bins, a data subset contains values within respective bins. The bins are defined at least in part using data for a set of queries. In another aspect, at least one bin of a plurality of bins is designated to be associated with values of the data set satisfying a threshold frequency. In a further aspect, imprints are created for a plurality of bins, where an imprint indicates whether the plurality of data subsets comprise a value for a given bin. A query imprint can be compared with the data subset imprints or bin imprints to determine data subsets that may be responsive to an associated query.

    Effective partition pruning using globally encoded partitions

    公开(公告)号:US11163773B2

    公开(公告)日:2021-11-02

    申请号:US15964624

    申请日:2018-04-27

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for partition pruning using globally encoded range partition information. An embodiment operates by partitioning a data table into a plurality of data partitions, determining a plurality of sub-partitions within the data partitions, and determining a global range table based at least in part on value ranges corresponding to the sub-partitions. Further, the global range table may be used to determine bit vectors for the data partitions and query predicates. In some examples, the bit vectors may be used to determine whether to prune data partitions during execution of a query over a large data volume.

Patent Agency Ranking