ONLINE INDEX CREATION FOR MULTI-VERSION DATABASE WITHOUT TABLE LOCKS

    公开(公告)号:US20240394244A1

    公开(公告)日:2024-11-28

    申请号:US18790664

    申请日:2024-07-31

    Applicant: Snowflake Inc.

    Abstract: The subject technology initiates an online index building process of an index of a hybrid table. The subject technology includes, in the index, a new index record for each record from the hybrid table. The subject technology validates the index. The subject technology retrieves a first metadata version of the hybrid table from a metadata database. The subject technology retrieves a second metadata version of the hybrid table from a distributed data store. The subject technology compares the first metadata version with the second metadata version. The subject technology determines that the second metadata version is less than or equal to the first metadata version. The subject technology, in response to successfully validating the index, indicates in a particular entry of the index that validation of the index has succeeded.

    HYBRID TABLE SECONDARY INDEX FOR LOOKUPS, UNIQUE CHECKS, AND REFERENTIAL INTEGRITY CONSTRAINTS

    公开(公告)号:US20240104116A1

    公开(公告)日:2024-03-28

    申请号:US18524784

    申请日:2023-11-30

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/283 G06F16/2264 G06F16/284

    Abstract: The subject technology generates a nested object based on a set of metadata, the set of metadata including information linking the nested object to a table object associated with a base table. The subject technology generates a second table object associated with the nested object, the second table object representing a secondary index of the base table, the second table object including information linking the second table object to the nested object. The subject technology generates a second nested object based on a particular set of metadata, the particular set of metadata including information linking the second nested object to the table object. The subject technology generates a third table object associated with the second nested object, the third table object representing a particular secondary index of the base table, the third table object including information linking the third table object to the second nested object.

    SCALABLE COMPACTION FOR A DISTRIBUTED DATABASE

    公开(公告)号:US20230325409A1

    公开(公告)日:2023-10-12

    申请号:US18326683

    申请日:2023-05-31

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/273 G06F16/2343 G06F16/2379

    Abstract: The subject technology performs, using asynchronous threads of execution nodes of a distributed database, asynchronous transformation and compaction of key-value pairs of data that correspond to a plurality of transactional queries, the asynchronous threads transforming the key-value pairs from an initial format to a commit format that includes a commit time of data committed to the distributed database, the asynchronous threads compacting the key-value pairs by deleting the key-value pairs in the initial format. The subject technology processes, using transactional threads in the execution nodes, the plurality of transactional queries to generate results data, the asynchronous threads performing the asynchronous transformation and compaction while the transactional threads generate the results data. The subject technology stores the results data.

    CONCURRENT TRANSACTION PROCESSING IN A DATABASE SYSTEM

    公开(公告)号:US20220092050A1

    公开(公告)日:2022-03-24

    申请号:US17454368

    申请日:2021-11-10

    Applicant: Snowflake Inc.

    Abstract: The subject technology inserts, by a first transaction, a new version of an object, the inserting corresponding to a first statement from the first transaction, the first statement including information indicating a first key, a first transaction identifier, a first statement number, and a first restart count. The subject technology performs, by a second transaction, a range read, the range read including information indicating the first key. The subject technology receives, after performing the range read, a set of conflicting transactions. The subject technology determines that a conflict occurred between the first transaction and a third transaction from the set of conflicting transactions.

    Distributed database that uses hybrid table secondary indexes

    公开(公告)号:US12235872B2

    公开(公告)日:2025-02-25

    申请号:US18171297

    申请日:2023-02-17

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives, by a distributed database, transactional queries against a key-value database comprising data managed by key-value pairs. The subject technology performs, using asynchronous threads of execution nodes of the distributed database, asynchronous transformation and compaction of key-value pairs of data that corresponds to the plurality of transactional queries. The subject technology performs, using asynchronous threads of execution nodes of the distributed database, asynchronous transformation and compaction of a set of secondary indexes that corresponds to the plurality of transactional queries. The subject technology processes, using transactional threads in the execution nodes, the plurality of transactional queries to generate results data, the asynchronous threads performing the asynchronous transformation and compaction while the transactional threads generate the results data.

    QUERY PROCESSING USING HYBRID TABLE SECONDARY INDEXES

    公开(公告)号:US20250005010A1

    公开(公告)日:2025-01-02

    申请号:US18759124

    申请日:2024-06-28

    Applicant: Snowflake Inc

    Abstract: The subject technology determines whether a key exists in a parent table associated with a first transaction. The subject technology performs a first write operation on a child table. The subject technology determines whether a duplicate key exists in the child table based on the key of the first write operation. The subject technology determines whether there is a conflict with the key. The subject technology, in response to determining that there is no conflict with the key, performs a second write operation on a secondary index table of the child table. The subject technology determines whether a particular duplicate key exists in the secondary index table based on a particular key from the second write operation. The subject technology, in response to determining that there is the particular duplicate key in the secondary index table, throws a uniqueness exception.

    HYBRID TABLE SECONDARY INDEX FOR LOOKUPS, UNIQUE CHECKS, AND REFERENTIAL INTEGRITY CONSTRAINTS

    公开(公告)号:US20230401236A1

    公开(公告)日:2023-12-14

    申请号:US18171292

    申请日:2023-02-17

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/283 G06F16/284 G06F16/2264

    Abstract: The subject technology receives, from a metadata database, information related to a base table. The subject technology determines a table object associated with the base table, the table object including a first set of metadata. The subject technology generates a nested object based on a second set of metadata, the second set of metadata including information linking the nested object to the table object. The subject technology generates a second table object associated with the nested object, the second table object representing a secondary index of the base table, the second table object including information linking the second table object to the nested object. The subject technology establishes a link between the second table object to the base table based on the nested object. The subject technology stores, in the metadata database, the nested object and the second table object.

    A DISTRIBUTED DATABASE THAT USES HYBRID TABLE SECONDARY INDEXES

    公开(公告)号:US20230205785A1

    公开(公告)日:2023-06-29

    申请号:US18171297

    申请日:2023-02-17

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/273 G06F9/466

    Abstract: The subject technology receives, by a distributed database, transactional queries against a key-value database comprising data managed by key-value pairs. The subject technology performs, using asynchronous threads of execution nodes of the distributed database, asynchronous transformation and compaction of key-value pairs of data that corresponds to the plurality of transactional queries. The subject technology performs, using asynchronous threads of execution nodes of the distributed database, asynchronous transformation and compaction of a set of secondary indexes that corresponds to the plurality of transactional queries. The subject technology processes, using transactional threads in the execution nodes, the plurality of transactional queries to generate results data, the asynchronous threads performing the asynchronous transformation and compaction while the transactional threads generate the results data.

    RUNTIME JOIN PRUNING TO IMPROVE JOIN PERFORMANCE FOR TABLES

    公开(公告)号:US20250036620A1

    公开(公告)日:2025-01-30

    申请号:US18649509

    申请日:2024-04-29

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query, the query including a statement for performing a join operation on a first table and a second table. The subject technology executes, by an execution node using a generated query plan, the query. The subject technology performs, during execution of the query by the execution node, a runtime range pruning process. The subject technology determines a set of range sets for pruning, each range set including a set of columns from the first table, the set of columns being removed from undergoing a read operation as part of executing the query. The subject technology determines, based on a range bloom vector, a set of rows in a particular range set of the first table to avoid scanning in connection with the read operation. The subject technology performs the read operation based a remaining set of rows.

Patent Agency Ranking