-
公开(公告)号:US20190236192A1
公开(公告)日:2019-08-01
申请号:US15964631
申请日:2018-04-27
Applicant: SAP SE
Inventor: William Zou , Reza Sherkat , John Smirnios , Mihnea Andrei
IPC: G06F17/30
CPC classification number: G06F16/24554 , G06F16/2282 , G06F16/24556
Abstract: Disclosed herein are system, method, and computer program product embodiments for partition pruning via non-homogenous synopsis information. An embodiment operates by maintaining synopsis information for a data partition in accordance with a first synopsis strategy, monitoring performance of the synopsis information within a partition pruning system, determining that the performance of the synopsis information is insufficient, and updating the synopsis information in accordance with a second synopsis strategy better suited for the attributes of the data partition. In some embodiments, a first data partition of a partitioned data table may employ a first synopsis strategy and a second data partition of the partitioned data table may employ a second synopsis strategy.
-
公开(公告)号:US20170364554A1
公开(公告)日:2017-12-21
申请号:US15186300
申请日:2016-06-17
Applicant: SAP SE
Inventor: Anisoara Nica , Peter Bumbulis , Reza Sherkat , Mihnea Andrei , Anil K. Goel
IPC: G06F17/30
CPC classification number: G06F16/24534 , G06F16/2228 , G06F16/2455
Abstract: Cracking page-loadable columns for in-memory data management is described herein. An embodiment operates by accessing a column according to a received query, determining that the received query requires a non-critical data structure associated with the column, and rebuilding the non-critical data structure from data associated with the column.
-
公开(公告)号:US12047098B2
公开(公告)日:2024-07-23
申请号:US18072979
申请日:2022-12-01
Applicant: SAP SE
Inventor: Jia Shi , Reza Sherkat , John Smirnios
CPC classification number: H03M7/46 , G06F16/2282 , H03M7/3088
Abstract: Techniques and solutions are described for compressing data and facilitating access to compressed data. Compression can be applied to proper data subsets of a data set, such as to columns of a table. Using various methods, the proper data subsets can be evaluated to be included in a group of proper data subsets to be compressed using a first compression technique, where unselected proper data subsets are not compressed using the first compression technique. Data in the data set can be reordered based on a reordering sequence for the proper data subsets. Reordering data in the data set can improve compression when at least a portion of the proper data subsets are compressed. A data structure is provided that facilitates accessing specified data stored in a compressed format.
-
公开(公告)号:US11803550B2
公开(公告)日:2023-10-31
申请号:US17750188
申请日:2022-05-20
Applicant: SAP SE
Inventor: Noah Slavitch , Reza Sherkat
IPC: G06F16/2455 , G06F16/22 , G06F16/248 , G06F9/30
CPC classification number: G06F16/24554 , G06F9/30029 , G06F16/2237 , G06F16/248
Abstract: Techniques and solutions are described that can be used to determine data subsets of a data set that may be responsive to a query. In one aspect, imprints are created for the data subsets, where an imprint indicates whether, for a plurality of bins, a data subset contains values within respective bins. The bins are defined at least in part using data for a set of queries. In another aspect, at least one bin of a plurality of bins is designated to be associated with values of the data set satisfying a threshold frequency. In a further aspect, imprints are created for a plurality of bins, where an imprint indicates whether the plurality of data subsets comprise a value for a given bin. A query imprint can be compared with the data subset imprints or bin imprints to determine data subsets that may be responsive to an associated query.
-
公开(公告)号:US11558067B2
公开(公告)日:2023-01-17
申请号:US16878419
申请日:2020-05-19
Applicant: SAP SE
Inventor: Jia Shi , Reza Sherkat , John Smirnios
Abstract: Techniques and solutions are described for compressing data and facilitating access to compressed data. Compression can be applied to proper data subsets of a data set, such as to columns of a table. Using various methods, the proper data subsets can be evaluated to be included in a group of proper data subsets to be compressed using a first compression technique, where unselected proper data subsets are not compressed using the first compression technique. Data in the data set can be reordered based on a reordering sequence for the proper data subsets. Reordering data in the data set can improve compression when at least a portion of the proper data subsets are compressed. A data structure is provided that facilitates accessing specified data stored in a compressed format.
-
公开(公告)号:US11514027B2
公开(公告)日:2022-11-29
申请号:US16857982
申请日:2020-04-24
Applicant: SAP SE
Inventor: Reza Sherkat , Chaitanya Gottipati , Colin Florendo , Martin Heidel
IPC: G06F16/22 , G06F16/23 , G06F12/02 , G06F3/06 , G06F12/0882 , G06F16/2455 , G06F12/0875 , G06F9/48
Abstract: Disclosed herein are system, method, and computer program product embodiments for storing a large object (LOB) in a database. An embodiment operates by identifying the LOB to be stored in an on-disk store of the database and determining the size of the LOB. Thereafter, an index vector for the on-disk store to include an identifier corresponding to the LOB is created. Additionally, a dictionary for the on-disk store to include a copy of the identifier and corresponding LOB data is also created. The LOB data corresponds to the LOB itself or a LOB location inside of the on-disk store based on the LOB's size.
-
公开(公告)号:US11468035B2
公开(公告)日:2022-10-11
申请号:US15796326
申请日:2017-10-27
Applicant: SAP SE
Inventor: Anisoara Nica , Reza Sherkat , Mihnea Andrei
IPC: G06F16/23 , G06F16/21 , G06F16/27 , G06F16/22 , G06F16/2458 , G06F16/2453
Abstract: Disclosed herein are system, method, and computer program product embodiments for using a data statistic as a dynamic data integrity constraint. An embodiment operates by defining a data statistic for a column or a set of columns of a partition of a plurality of partitions of a database table. The embodiment creates a constraint data statistics object based on the data statistic. The embodiment receives a query for the database table. The embodiment determines the constraint data statistics object is consistent with a data state of the partition. The embodiment derives an implied constraint based on the constraint data statistics object. The embodiment processes the query for the partition based on the implied constraint.
-
公开(公告)号:US20220114181A1
公开(公告)日:2022-04-14
申请号:US17645897
申请日:2021-12-23
Applicant: SAP SE
Inventor: Carmen Kwan , Reza Sherkat
IPC: G06F16/2455 , G06F16/22 , H03M7/30
Abstract: The present disclosure involves systems, software, and computer implemented methods for compressed columnar data search using fingerprints. One example method includes compressing columnar data that includes dividing the columnar data into multiple data blocks and generating a fingerprint for each data block, storing the compressed columnar data and the generated fingerprints in an in-memory database, receiving a query for the columnar data, for each in-memory data block stored in the in-memory database, determining whether the in-memory data block satisfies the query and in response to a determination that the in-memory data block does not satisfy the query, pruning the in-memory data block from the multiple data blocks to generate an unpruned set of data blocks, decompressing the unpruned set of data blocks, and performing a query search on the decompressed unpruned set of data blocks for the received query.
-
公开(公告)号:US20190236193A1
公开(公告)日:2019-08-01
申请号:US15964624
申请日:2018-04-27
Applicant: SAP SE
Inventor: Yu-Jui CHEN , Reza Sherkat , John Smirnios , Mihnea Andrei
IPC: G06F17/30
CPC classification number: G06F16/24561 , G06F16/2282 , G06F16/242 , G06F16/278
Abstract: Disclosed herein are system, method, and computer program product embodiments for partition pruning using globally encoded range partition information. An embodiment operates by partitioning a data table into a plurality of data partitions, determining a plurality of sub-partitions within the data partitions, and determining a global range table based at least in part on value ranges corresponding to the sub-partitions. Further, the global range table may be used to determine bit vectors for the data partitions and query predicates. In some examples, the bit vectors may be used to determine whether to prune data partitions during execution of a query over a large data volume.
-
公开(公告)号:US20170371909A1
公开(公告)日:2017-12-28
申请号:US15361337
申请日:2016-11-25
Applicant: SAP SE
Inventor: Mihnea Andrei , Christian Bensberg , Alexander Boehm , Adrian Draguseanu , Xun Chen , Reza Sherkat , Ani Nica , Martin Heidel , Rolando Blanco
IPC: G06F17/30
CPC classification number: G06F16/282 , G06F16/21
Abstract: Persisting data in logical tiers of an in-memory database based on aging rules of the database system. The database system can be configured to age data by moving the data from a first logical tier supporting high performance data access to a second logical tier supporting lower performance data access, thereby improving the operational efficiency of the database system.
-
-
-
-
-
-
-
-
-