Evaluating row-store expressions on a column-store database

    公开(公告)号:US12038894B2

    公开(公告)日:2024-07-16

    申请号:US18167134

    申请日:2023-02-10

    申请人: Google LLC

    IPC分类号: G06F16/22 G06F12/02

    摘要: Aspects of the disclosure provide for natively executing row-store expression data structures on column-store databases without rewriting. A database management system (DBMS) configured as described herein can maintain a mapping of row-store results to addresses of where corresponding column data is stored. When executing operators, such as logical operators, comparison operators, and/or function operators of a received query expression, the DBMS can operate on the column data, rather than the individual rows. The DBMS can store the results generated by executing the column operators, for example on a stack, and record the row-store addresses to which the stored results correspond. The DBMS responds with a number of rows corresponding to the processed column data.

    Columnar Cache Query Using Hybrid Query Execution Plan

    公开(公告)号:US20230141902A1

    公开(公告)日:2023-05-11

    申请号:US17521213

    申请日:2021-11-08

    申请人: Google LLC

    摘要: Aspects of the disclosure are directed to generating a hybrid query execution plan for executing queries on database systems implementing a columnar cache. A hybrid query execution plan combines a query execution plan for querying and retrieving data from a columnar cache and a base table. A columnar cache stores cached data in column-major format, which is logically represented by the database management system in row-major format. A database management system as described herein can scan valid blocks of column data according to a column scan operation. The system can identify invalidated blocks and execute a different sub-execution plan of the hybrid query execution plan to scan corresponding rows in tables corresponding to the location of data for the invalidated blocks.

    Late Materialization of Queried Data in Database Cache

    公开(公告)号:US20230141190A1

    公开(公告)日:2023-05-11

    申请号:US17522504

    申请日:2021-11-09

    申请人: Google LLC

    IPC分类号: G06F16/2453 G06F16/2455

    摘要: Aspects of the disclosure are directed to late materialization of attributes in response to queries to a database implementing a database cache. Queried data is materialized in temporary memory before the data is projected as part of generating a result to the query. Instead of materializing all of the attributes referenced in a query before executing the query, a database management system materializes attributes as “late” as possible—when the operation needing the attributes is executed. The operation needing the attributes can be performed sooner, as opposed to materializing all referenced attributes are materialized before executing the query.

    Late materialization of queried data in database cache

    公开(公告)号:US12130814B2

    公开(公告)日:2024-10-29

    申请号:US17522504

    申请日:2021-11-09

    申请人: Google LLC

    IPC分类号: G06F16/2453 G06F16/2455

    摘要: Aspects of the disclosure are directed to late materialization of attributes in response to queries to a database implementing a database cache. Queried data is materialized in temporary memory before the data is projected as part of generating a result to the query. Instead of materializing all of the attributes referenced in a query before executing the query, a database management system materializes attributes as “late” as possible—when the operation needing the attributes is executed. The operation needing the attributes can be performed sooner, as opposed to materializing all referenced attributes are materialized before executing the query.

    Evaluating Row-Store Expressions on a Column-Store Database

    公开(公告)号:US20230367751A1

    公开(公告)日:2023-11-16

    申请号:US18167134

    申请日:2023-02-10

    申请人: Google LLC

    IPC分类号: G06F16/22 G06F12/02

    摘要: Aspects of the disclosure provide for natively executing row-store expression data structures on column-store databases without rewriting. A database management system (DBMS) configured as described herein can maintain a mapping of row-store results to addresses of where corresponding column data is stored. When executing operators, such as logical operators, comparison operators, and/or function operators of a received query expression, the DBMS can operate on the column data, rather than the individual rows. The DBMS can store the results generated by executing the column operators, for example on a stack, and record the row-store addresses to which the stored results correspond. The DBMS responds with a number of rows corresponding to the processed column data.

    Columnar cache query using hybrid query execution plan

    公开(公告)号:US11782921B2

    公开(公告)日:2023-10-10

    申请号:US17521213

    申请日:2021-11-08

    申请人: Google LLC

    摘要: Aspects of the disclosure are directed to generating a hybrid query execution plan for executing queries on database systems implementing a columnar cache. A hybrid query execution plan combines a query execution plan for querying and retrieving data from a columnar cache and a base table. A columnar cache stores cached data in column-major format, which is logically represented by the database management system in row-major format. A database management system as described herein can scan valid blocks of column data according to a column scan operation. The system can identify invalidated blocks and execute a different sub-execution plan of the hybrid query execution plan to scan corresponding rows in tables corresponding to the location of data for the invalidated blocks.

    Database Join Operations With Early Filtering

    公开(公告)号:US20240078237A1

    公开(公告)日:2024-03-07

    申请号:US17939141

    申请日:2022-09-07

    申请人: Google LLC

    发明人: Xiaobin Ma Xun Cheng

    摘要: Aspects of the disclosure are directed to early filtering of candidate rows for a join operator of a query statement before the join operator is evaluated to generate a result set. Early filtering, e.g., before execution of the join operator, reduces the number of candidate rows fetched from a database during a join operator, which can improve the rate at which queries including join operators are executed by a DBMS for the database. One or more Bloom filters or other probabilistic data structures can be produced and consumed according to different join conditions, allowing for early pruning of unqualified rows during a database scan operation.

    Columnar Cache Query Using Hybrid Query Execution Plan

    公开(公告)号:US20230401209A1

    公开(公告)日:2023-12-14

    申请号:US18237490

    申请日:2023-08-24

    申请人: Google LLC

    摘要: Aspects of the disclosure are directed to generating a hybrid query execution plan for executing queries on database systems implementing a columnar cache. A hybrid query execution plan combines a query execution plan for querying and retrieving data from a columnar cache and a base table. A columnar cache stores cached data in column-major format, which is logically represented by the database management system in row-major format. A database management system as described herein can scan valid blocks of column data according to a column scan operation. The system can identify invalidated blocks and execute a different sub-execution plan of the hybrid query execution plan to scan corresponding rows in tables corresponding to the location of data for the invalidated blocks.