NON-HOMOGENOUS SYNOPSIS FOR EFFICIENT PARTITION PRUNING

    公开(公告)号:US20190236192A1

    公开(公告)日:2019-08-01

    申请号:US15964631

    申请日:2018-04-27

    Applicant: SAP SE

    CPC classification number: G06F16/24554 G06F16/2282 G06F16/24556

    Abstract: Disclosed herein are system, method, and computer program product embodiments for partition pruning via non-homogenous synopsis information. An embodiment operates by maintaining synopsis information for a data partition in accordance with a first synopsis strategy, monitoring performance of the synopsis information within a partition pruning system, determining that the performance of the synopsis information is insufficient, and updating the synopsis information in accordance with a second synopsis strategy better suited for the attributes of the data partition. In some embodiments, a first data partition of a partitioned data table may employ a first synopsis strategy and a second data partition of the partitioned data table may employ a second synopsis strategy.

    Data compression techniques
    13.
    发明授权

    公开(公告)号:US12047098B2

    公开(公告)日:2024-07-23

    申请号:US18072979

    申请日:2022-12-01

    Applicant: SAP SE

    CPC classification number: H03M7/46 G06F16/2282 H03M7/3088

    Abstract: Techniques and solutions are described for compressing data and facilitating access to compressed data. Compression can be applied to proper data subsets of a data set, such as to columns of a table. Using various methods, the proper data subsets can be evaluated to be included in a group of proper data subsets to be compressed using a first compression technique, where unselected proper data subsets are not compressed using the first compression technique. Data in the data set can be reordered based on a reordering sequence for the proper data subsets. Reordering data in the data set can improve compression when at least a portion of the proper data subsets are compressed. A data structure is provided that facilitates accessing specified data stored in a compressed format.

    Workload-aware column imprints
    14.
    发明授权

    公开(公告)号:US11803550B2

    公开(公告)日:2023-10-31

    申请号:US17750188

    申请日:2022-05-20

    Applicant: SAP SE

    CPC classification number: G06F16/24554 G06F9/30029 G06F16/2237 G06F16/248

    Abstract: Techniques and solutions are described that can be used to determine data subsets of a data set that may be responsive to a query. In one aspect, imprints are created for the data subsets, where an imprint indicates whether, for a plurality of bins, a data subset contains values within respective bins. The bins are defined at least in part using data for a set of queries. In another aspect, at least one bin of a plurality of bins is designated to be associated with values of the data set satisfying a threshold frequency. In a further aspect, imprints are created for a plurality of bins, where an imprint indicates whether the plurality of data subsets comprise a value for a given bin. A query imprint can be compared with the data subset imprints or bin imprints to determine data subsets that may be responsive to an associated query.

    Data compression techniques
    15.
    发明授权

    公开(公告)号:US11558067B2

    公开(公告)日:2023-01-17

    申请号:US16878419

    申请日:2020-05-19

    Applicant: SAP SE

    Abstract: Techniques and solutions are described for compressing data and facilitating access to compressed data. Compression can be applied to proper data subsets of a data set, such as to columns of a table. Using various methods, the proper data subsets can be evaluated to be included in a group of proper data subsets to be compressed using a first compression technique, where unselected proper data subsets are not compressed using the first compression technique. Data in the data set can be reordered based on a reordering sequence for the proper data subsets. Reordering data in the data set can improve compression when at least a portion of the proper data subsets are compressed. A data structure is provided that facilitates accessing specified data stored in a compressed format.

    Paged hybrid LOBs
    16.
    发明授权

    公开(公告)号:US11514027B2

    公开(公告)日:2022-11-29

    申请号:US16857982

    申请日:2020-04-24

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for storing a large object (LOB) in a database. An embodiment operates by identifying the LOB to be stored in an on-disk store of the database and determining the size of the LOB. Thereafter, an index vector for the on-disk store to include an identifier corresponding to the LOB is created. Additionally, a dictionary for the on-disk store to include a copy of the identifier and corresponding LOB data is also created. The LOB data corresponds to the LOB itself or a LOB location inside of the on-disk store based on the LOB's size.

    Constraint data statistics
    17.
    发明授权

    公开(公告)号:US11468035B2

    公开(公告)日:2022-10-11

    申请号:US15796326

    申请日:2017-10-27

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for using a data statistic as a dynamic data integrity constraint. An embodiment operates by defining a data statistic for a column or a set of columns of a partition of a plurality of partitions of a database table. The embodiment creates a constraint data statistics object based on the data statistic. The embodiment receives a query for the database table. The embodiment determines the constraint data statistics object is consistent with a data state of the partition. The embodiment derives an implied constraint based on the constraint data statistics object. The embodiment processes the query for the partition based on the implied constraint.

    FINGERPRINTS FOR COMPRESSED COLUMNAR DATA SEARCH

    公开(公告)号:US20220114181A1

    公开(公告)日:2022-04-14

    申请号:US17645897

    申请日:2021-12-23

    Applicant: SAP SE

    Abstract: The present disclosure involves systems, software, and computer implemented methods for compressed columnar data search using fingerprints. One example method includes compressing columnar data that includes dividing the columnar data into multiple data blocks and generating a fingerprint for each data block, storing the compressed columnar data and the generated fingerprints in an in-memory database, receiving a query for the columnar data, for each in-memory data block stored in the in-memory database, determining whether the in-memory data block satisfies the query and in response to a determination that the in-memory data block does not satisfy the query, pruning the in-memory data block from the multiple data blocks to generate an unpruned set of data blocks, decompressing the unpruned set of data blocks, and performing a query search on the decompressed unpruned set of data blocks for the received query.

    EFFECTIVE PARTITION PRUNING USING GLOBALLY ENCODED PARTITIONS

    公开(公告)号:US20190236193A1

    公开(公告)日:2019-08-01

    申请号:US15964624

    申请日:2018-04-27

    Applicant: SAP SE

    CPC classification number: G06F16/24561 G06F16/2282 G06F16/242 G06F16/278

    Abstract: Disclosed herein are system, method, and computer program product embodiments for partition pruning using globally encoded range partition information. An embodiment operates by partitioning a data table into a plurality of data partitions, determining a plurality of sub-partitions within the data partitions, and determining a global range table based at least in part on value ranges corresponding to the sub-partitions. Further, the global range table may be used to determine bit vectors for the data partitions and query predicates. In some examples, the bit vectors may be used to determine whether to prune data partitions during execution of a query over a large data volume.

Patent Agency Ranking