JOIN QUERY PROCESSING USING PRUNING INDEX

    公开(公告)号:US20220292098A1

    公开(公告)日:2022-09-15

    申请号:US17804630

    申请日:2022-05-31

    Applicant: Snowflake Inc.

    Abstract: A query directed at a table organized into a set of batch units is received. The query comprises a predicate for which values are unknown prior to runtime. A set of values for the predicate are determined based on the query. An index access plan is created based on the set of values. Based on the index access plan, the set of batch units are pruned using a pruning index associated with the table. The pruning index comprises a set of filters that index distinct values in each column of the table. The pruning of the set of batch units comprises identifying a subset of batch units to scan for data that satisfies the query. The subset of batch units of the table are scanned to identify data that satisfies the query.

    PRUNING TECHNIQUES FOR PROCESSING TOP K QUERIES

    公开(公告)号:US20240168953A1

    公开(公告)日:2024-05-23

    申请号:US18534382

    申请日:2023-12-08

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24557 G06F16/24578

    Abstract: A top K query directed at a table is received. The table is organized into multiple storage units. The top K query comprises a first clause to sort a result set in order and a second clause that specifies a limit on a number of results provided in response to the query. A table scan operator identifies a first set of rows from the table based on a scan set determined for the table and provides the first set of rows to a top K operator. The top K operator determines a current boundary based on the first set of rows and provides the current boundary to the table scan operator. The table scan operator prunes the scan set based on the current boundary and identifies a second set of rows from the table based on the pruning.

    Pruning data based on state of top K operator

    公开(公告)号:US11880369B1

    公开(公告)日:2024-01-23

    申请号:US18057563

    申请日:2022-11-21

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24557 G06F16/24578

    Abstract: A top K query directed at a table is received. The table is organized into multiple storage units. The top K query comprises a first clause to sort a result set in order and a second clause that specifies a limit on a number of results provided in response to the query. A table scan operator identifies a first set of rows from the table based on a scan set determined for the table and provides the first set of rows to a top K operator. The top K operator determines a current boundary based on the first set of rows and provides the current boundary to the table scan operator. The table scan operator prunes the scan set based on the current boundary and identifies a second set of rows from the table based on the pruning.

    Join query processing using pruning index

    公开(公告)号:US11593379B2

    公开(公告)日:2023-02-28

    申请号:US17804630

    申请日:2022-05-31

    Applicant: Snowflake Inc.

    Abstract: A query directed at a table organized into a set of batch units is received. The query comprises a predicate for which values are unknown prior to runtime. A set of values for the predicate are determined based on the query. An index access plan is created based on the set of values. Based on the index access plan, the set of batch units are pruned using a pruning index associated with the table. The pruning index comprises a set of filters that index distinct values in each column of the table. The pruning of the set of batch units comprises identifying a subset of batch units to scan for data that satisfies the query. The subset of batch units of the table are scanned to identify data that satisfies the query.

Patent Agency Ranking