Database object store size estimation

    公开(公告)号:US12019883B2

    公开(公告)日:2024-06-25

    申请号:US17868450

    申请日:2022-07-19

    Applicant: SAP SE

    CPC classification number: G06F3/0631 G06F3/0608 G06F3/0673 G06F16/2272

    Abstract: A system includes detection of a first allocation of a first memory size in an object store for storage of a first logical page, in response to detection of the first allocation, incrementing a count associated with the first memory size by a first data structure associating a respective count with each of a plurality of memory sizes, detection of a first deallocation of the first logical page, in response to detection of the first deallocation, decrementing a count associated with a second one of the plurality of memory sizes by the first data structure, and determination of a memory usage associated with the object store based on the counts associated with each of the plurality of memory sizes by the first data structure wherein the second one of the plurality of memory sizes is different from the first memory size.

    Extended cache for efficient object store access by a database

    公开(公告)号:US11567873B1

    公开(公告)日:2023-01-31

    申请号:US17486206

    申请日:2021-09-27

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for utilizing an extended cache to access an object store efficiently. An embodiment operates by executing a database transaction, thereby causing pages to be written from a buffer cache to an extended cache and to an object store. The embodiment determines a transaction type of the database transaction. The transaction type can a read-only transaction or an update transaction. The embodiment determines a phase of the database transaction based on the determined transaction type. The phase can be an execution phase or a commit phase. The embodiment then applies a caching policy to the extended cache for the evicted pages based on the determined transaction type of the database transaction and the determined phase of the database transaction.

    DATASET SUMMARY METADATA PROVIDING IMPROVED QUERY PERFORMANCE

    公开(公告)号:US20250110955A1

    公开(公告)日:2025-04-03

    申请号:US18376369

    申请日:2023-10-03

    Applicant: SAP SE

    Abstract: The present disclosure provides techniques and solutions for determining whether a particular value is in a dataset using summary information. A sorted set of unique values is received. The sorted set of unique values includes gaps between at least certain values. The gaps are determined, and the set of unique values is represented as a gap filter. The gap filter includes a starting value of the set of unique values, a set of gap lengths, and identifiers indicating a number of unique values between respective gaps. The gap filter serves as summary information that can be used to determine whether a value be present in the dataset. In at least some cases, the use of the summary information may provide false positive results. The representation of the gap filter can be modified to improve its compressibility, but may increase the number of false positives produced by the gap filter.

    Efficient sorting techniques facilitating the creation and use of dataset summary metadata

    公开(公告)号:US12265514B1

    公开(公告)日:2025-04-01

    申请号:US18376371

    申请日:2023-10-03

    Applicant: SAP SE

    Abstract: The present disclosure provides techniques and solutions for sorting data. In a particular implementation, a sorting technique is provided that places values in a sorted order by adding an offset value to values that are not in a sorted order. The resulting sorted set of values is not truly sorted, in that the set of modified values is sorted, but the underlying data itself is not sorted. In another implementation, a sorting technique can use multiple streams or sets. When an out of order element is encountered, it can be added to a new stream, if such a stream is available. The sorting techniques can be used for a variety of purposes, including provided sorted data for use in generating summary data, or for providing sorted data to be used in determining an intersection between two datasets.

    APPROXIMATE ANALYTICS WITH QUERY-TIME SAMPLING FOR EXPLORATORY DATA ANALYSIS

    公开(公告)号:US20200286011A1

    公开(公告)日:2020-09-10

    申请号:US16292511

    申请日:2019-03-05

    Applicant: SAP SE

    Abstract: A system and method are disclosed to facilitate exploratory data analytics for an enterprise. A storage area network, for a column-oriented relational database management system, may contain electronic records that store enterprise information. A query engine may receive, from a user via an interactive user interface, query parameters associated with the enterprise information. The query engine may then automatically generate an approximate query for exploratory data analytics using query-time sampling, the approximate query being associated with at least one of: (1) a stratified sampler with randomized row access, and (2) a hash-based, outlier aware join sampler. The approximate query may then be executed in connection with the enterprise information in the storage area network, and results of the executed approximate query may be provided to the user via the user interface.

    EFFICIENT SORTING TECHNIQUES FACILITATING THE CREATION AND USE OF DATASET SUMMARY METADATA

    公开(公告)号:US20250110937A1

    公开(公告)日:2025-04-03

    申请号:US18376371

    申请日:2023-10-03

    Applicant: SAP SE

    Abstract: The present disclosure provides techniques and solutions for sorting data. In a particular implementation, a sorting technique is provided that places values in a sorted order by adding an offset value to values that are not in a sorted order. The resulting sorted set of values is not truly sorted, in that the set of modified values is sorted, but the underlying data itself is not sorted. In another implementation, a sorting technique can use multiple streams or sets. When an out of order element is encountered, it can be added to a new stream, if such a stream is available. The sorting techniques can be used for a variety of purposes, including provided sorted data for use in generating summary data, or for providing sorted data to be used in determining an intersection between two datasets.

    Dataset summary metadata providing improved query performance

    公开(公告)号:US12265535B1

    公开(公告)日:2025-04-01

    申请号:US18376369

    申请日:2023-10-03

    Applicant: SAP SE

    Abstract: The present disclosure provides techniques and solutions for determining whether a particular value is in a dataset using summary information. A sorted set of unique values is received. The sorted set of unique values includes gaps between at least certain values. The gaps are determined, and the set of unique values is represented as a gap filter. The gap filter includes a starting value of the set of unique values, a set of gap lengths, and identifiers indicating a number of unique values between respective gaps. The gap filter serves as summary information that can be used to determine whether a value be present in the dataset. In at least some cases, the use of the summary information may provide false positive results. The representation of the gap filter can be modified to improve its compressibility, but may increase the number of false positives produced by the gap filter.

    Adaptive data prefetch
    10.
    发明授权

    公开(公告)号:US11966393B2

    公开(公告)日:2024-04-23

    申请号:US17840961

    申请日:2022-06-15

    Applicant: SAP SE

    CPC classification number: G06F16/24539 G06F16/2255 G06F16/24542

    Abstract: A system includes reception of a request to fetch a first dataset, the request including a first identifier associated with the first dataset, determination, in response to the request, of a first signature associated with the first dataset based on the first identifier and on respective identifiers of one or more datasets fetched consecutively with the first dataset, and determination, based on the determined first signature, of whether to update a first link associating a second dataset with the first dataset, the link indicating to prefetch the first dataset in response to a request to fetch the second dataset.

Patent Agency Ranking