Runtime join pruning to improve join performance for database tables

    公开(公告)号:US11995080B1

    公开(公告)日:2024-05-28

    申请号:US18358402

    申请日:2023-07-25

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24537 G06F16/24542 G06F16/24549

    Abstract: The subject technology receives a query, the query including a statement for performing a join operation on a first table and a second table. The subject technology executes, by an execution node using a generated query plan, the query. The subject technology performs, during execution of the query by the execution node, a runtime range pruning process. The subject technology determines a set of range sets for pruning, each range set including a set of columns from the first table, the set of columns being removed from undergoing a read operation as part of executing the query. The subject technology determines, based on a range bloom vector, a set of rows in a particular range set of the first table to avoid scanning in connection with the read operation. The subject technology performs the read operation based a remaining set of rows.

    ONLINE INDEX CREATION FOR MULTI-VERSION DATABASE WITHOUT TABLE LOCKS

    公开(公告)号:US20240394244A1

    公开(公告)日:2024-11-28

    申请号:US18790664

    申请日:2024-07-31

    Applicant: Snowflake Inc.

    Abstract: The subject technology initiates an online index building process of an index of a hybrid table. The subject technology includes, in the index, a new index record for each record from the hybrid table. The subject technology validates the index. The subject technology retrieves a first metadata version of the hybrid table from a metadata database. The subject technology retrieves a second metadata version of the hybrid table from a distributed data store. The subject technology compares the first metadata version with the second metadata version. The subject technology determines that the second metadata version is less than or equal to the first metadata version. The subject technology, in response to successfully validating the index, indicates in a particular entry of the index that validation of the index has succeeded.

    HYBRID TABLE SECONDARY INDEX FOR LOOKUPS, UNIQUE CHECKS, AND REFERENTIAL INTEGRITY CONSTRAINTS

    公开(公告)号:US20240104116A1

    公开(公告)日:2024-03-28

    申请号:US18524784

    申请日:2023-11-30

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/283 G06F16/2264 G06F16/284

    Abstract: The subject technology generates a nested object based on a set of metadata, the set of metadata including information linking the nested object to a table object associated with a base table. The subject technology generates a second table object associated with the nested object, the second table object representing a secondary index of the base table, the second table object including information linking the second table object to the nested object. The subject technology generates a second nested object based on a particular set of metadata, the particular set of metadata including information linking the second nested object to the table object. The subject technology generates a third table object associated with the second nested object, the third table object representing a particular secondary index of the base table, the third table object including information linking the third table object to the second nested object.

    Deferred constraints support in distributed database systems

    公开(公告)号:US12007990B1

    公开(公告)日:2024-06-11

    申请号:US18447934

    申请日:2023-08-10

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24537 G06F16/24542 G06F16/27

    Abstract: The subject technology receives a query, the query including a set of statements, the set of statements including a first statement comprising a first data manipulation language operation. The subject technology determines that the set of statements includes at least one statement defining a first constraint, and a second statement indicating that the first constraint is statement deferrable for performing validation of the first constraint. The subject technology generates a key value constraint check operator as a node in a query plan. The subject technology generates a key value DML operator as a second node in the query plan, the key value DML operator corresponding to the first DML operation. The subject technology performs, using the key value constraint check operator, a validation process of the first constraint based at least in part on a set of violating keys of each DML operation from the set of statements.

    Hybrid table secondary index for lookups, unique checks, and referential integrity constraints

    公开(公告)号:US11880388B2

    公开(公告)日:2024-01-23

    申请号:US18171292

    申请日:2023-02-17

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/283 G06F16/2264 G06F16/284

    Abstract: The subject technology receives, from a metadata database, information related to a base table. The subject technology determines a table object associated with the base table, the table object including a first set of metadata. The subject technology generates a nested object based on a second set of metadata, the second set of metadata including information linking the nested object to the table object. The subject technology generates a second table object associated with the nested object, the second table object representing a secondary index of the base table, the second table object including information linking the second table object to the nested object. The subject technology establishes a link between the second table object to the base table based on the nested object. The subject technology stores, in the metadata database, the nested object and the second table object.

    QUERY PROCESSING USING HYBRID TABLE SECONDARY INDEXES

    公开(公告)号:US20230401189A1

    公开(公告)日:2023-12-14

    申请号:US18171296

    申请日:2023-02-17

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/2272 G06F16/283

    Abstract: The subject technology obtains a read timestamp of a first transaction. The subject technology performs a first read operation on a parent table associated with the first transaction to determine a set of committed versions of the parent table. The subject technology determines whether a key exists in the parent table based on the first transaction. The subject technology, in response to the key existing in the parent table, performs a first write operation on a child table. The subject technology determines whether a duplicate key exists in the child table. The subject technology, in response to determining that there is no duplicate key in the child table, determines whether there is a conflict with the key. The subject technology, in response to determining that there is no conflict with the key, performs a second write operation on a secondary index table of the child table.

    RUNTIME JOIN PRUNING TO IMPROVE JOIN PERFORMANCE FOR TABLES

    公开(公告)号:US20250036620A1

    公开(公告)日:2025-01-30

    申请号:US18649509

    申请日:2024-04-29

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query, the query including a statement for performing a join operation on a first table and a second table. The subject technology executes, by an execution node using a generated query plan, the query. The subject technology performs, during execution of the query by the execution node, a runtime range pruning process. The subject technology determines a set of range sets for pruning, each range set including a set of columns from the first table, the set of columns being removed from undergoing a read operation as part of executing the query. The subject technology determines, based on a range bloom vector, a set of rows in a particular range set of the first table to avoid scanning in connection with the read operation. The subject technology performs the read operation based a remaining set of rows.

    Query processing using hybrid table secondary indexes

    公开(公告)号:US12061587B2

    公开(公告)日:2024-08-13

    申请号:US18171296

    申请日:2023-02-17

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/2272 G06F16/2264 G06F16/283 G06F16/284

    Abstract: The subject technology obtains a read timestamp of a first transaction. The subject technology performs a first read operation on a parent table associated with the first transaction to determine a set of committed versions of the parent table. The subject technology determines whether a key exists in the parent table based on the first transaction. The subject technology, in response to the key existing in the parent table, performs a first write operation on a child table. The subject technology determines whether a duplicate key exists in the child table. The subject technology, in response to determining that there is no duplicate key in the child table, determines whether there is a conflict with the key. The subject technology, in response to determining that there is no conflict with the key, performs a second write operation on a secondary index table of the child table.

    ADAPTIVE SCAN FOR HYBRID TABLES
    10.
    发明申请

    公开(公告)号:US20250103572A1

    公开(公告)日:2025-03-27

    申请号:US18472819

    申请日:2023-09-22

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query. The subject technology performs a secondary index scan operation of a secondary index based on the query. The subject technology determines that a number of rows returned, from the secondary index scan operation, meets a threshold value. The subject technology sends a signal to a primary index probe operation to initiate a blob storage scan using at least one range scan operation. The subject technology extracts a primary key of each row from the rows returned from the secondary index scan operation. The subject technology identifies a corresponding range of each extracted primary key in the blob storage. The subject technology scans each corresponding range of the blob storage to identify a set of rows that satisfy a set of predicates on an index column. The subject technology provides a set of rows as a result of the query.

Patent Agency Ranking