Missing data-based indexing in database systems

    公开(公告)号:US11803544B2

    公开(公告)日:2023-10-31

    申请号:US17450109

    申请日:2021-10-06

    摘要: A method for execution by at least one processor of a database system includes storing a plurality of column values for a first column of a plurality of rows. Each of a set of missing data-based conditions for the first column are indexed via an indexing scheme. A subset of the set of missing data-based conditions that satisfy a query predicate of a query are identified. An IO pipeline for access of the first column based on the query predicate and further based on the subset of the set of missing data-based conditions is generated. The IO pipeline is applied in conjunction with execution of the query by applying at least one index element to identify a proper subset of the plurality of rows based on index data of the indexing scheme, and by generating a query resultant based on the proper subset.

    Columnar cache query using hybrid query execution plan

    公开(公告)号:US11782921B2

    公开(公告)日:2023-10-10

    申请号:US17521213

    申请日:2021-11-08

    申请人: Google LLC

    摘要: Aspects of the disclosure are directed to generating a hybrid query execution plan for executing queries on database systems implementing a columnar cache. A hybrid query execution plan combines a query execution plan for querying and retrieving data from a columnar cache and a base table. A columnar cache stores cached data in column-major format, which is logically represented by the database management system in row-major format. A database management system as described herein can scan valid blocks of column data according to a column scan operation. The system can identify invalidated blocks and execute a different sub-execution plan of the hybrid query execution plan to scan corresponding rows in tables corresponding to the location of data for the invalidated blocks.

    AUTOMATIC BUFFER POOL PAGE FIX PRIORITIZATION FOR DATABASE TUNING

    公开(公告)号:US20230315697A1

    公开(公告)日:2023-10-05

    申请号:US17657347

    申请日:2022-03-30

    发明人: Donnell W. Roach

    摘要: For each database subsystem in a plurality of database subsystems running in a logical partition (LPAR), the systems and techniques include collecting data from buffer pools on a periodic interval, monitoring real storage availability on the LPAR, calculating an input/output (I/O) intensity score for each buffer pool for the periodic interval, ranking the buffer pools based on the I/O intensity score, calculating a cumulative I/O intensity score for each buffer pool across a cycle of multiple periodic intervals, ranking the buffer pools based on the cumulative I/O intensity score, and selecting a buffer pool candidate for page fixing. The buffer pool candidate for page fixing is a highest-ranked buffer pool from the plurality of buffer pools during the cycle based on the cumulative I/O intensity score that satisfies the real storage availability on the LPAR.

    METHOD FOR DATA PROCESSING, COMPUTING DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20230306031A1

    公开(公告)日:2023-09-28

    申请号:US17921620

    申请日:2021-12-08

    IPC分类号: G06F16/2455 G06F16/22

    摘要: A method is provided. The method comprises obtaining a first data stream; querying a query table to seek for storage information of a second data stream corresponding to the first data stream in a database, wherein a plurality of second data streams are stored in the database, one or more valid second data streams are included in the plurality of second data streams, and storage information of each of the one or more valid second data streams in the database is included in the query table; and in response to having found in the query table the storage information of the second data stream corresponding to the first data stream in the database, determining, in the database, the second data stream to be spliced with the first data stream based on the storage information of the second data stream.

    MEMORY MANAGEMENT FOR KLL SKETCH
    46.
    发明公开

    公开(公告)号:US20230289346A1

    公开(公告)日:2023-09-14

    申请号:US17690569

    申请日:2022-03-09

    摘要: The present application relates to managing memory of a KLL sketch for a streaming set. A computer system allocates an initial amount of memory from a computer memory to a buffer of a KLL sketch data structure based on a sparse sketch size independent of a data type of the streaming set. The system allocates an incremental amount of memory greater than or equal to the initial amount of memory from the computer memory to the buffer in response to a size of the streaming set exceeding a supported stream size of the KLL sketch data structure. The system determines a number of levels of the KLL sketch data structure based on a total allocated amount of memory and a size of the data type of the streaming set. The system adds new elements from the streaming set to the KLL sketch data structure based on the number of levels.

    Flat object storage namespace in an object storage system

    公开(公告)号:US11755557B2

    公开(公告)日:2023-09-12

    申请号:US17930214

    申请日:2022-09-07

    申请人: NetApp, Inc.

    摘要: A system, method, and machine-readable storage medium for maintaining an object storage system data are provided. In some embodiments, an object manager may receive a request to perform an operation on an object. The object storage system includes a first database of a first type and a second database of a second type. The object manager may identify a first record stored in the first database. The first record includes a name marker indicating a range of object names covered by the second database and includes a file handle referencing the second database. The range of object names includes the object name. Additionally, the object manager may identify a second record stored in the second database. The second record includes the object name and includes a file handle referencing the object. The object manager may perform the operation on the object in accordance with the request.