INCREMENTAL FEATURE DEVELOPMENT AND WORKLOAD CAPTURE IN DATABASE SYSTEMS

    公开(公告)号:US20220237161A1

    公开(公告)日:2022-07-28

    申请号:US17721938

    申请日:2022-04-15

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for incremental feature development and workload capture in database systems are disclosed. The method includes determining a workload based on one or more client queries to be rerun for testing the feature release. The method further includes repeatedly executing a test run of the workload to identify whether there is a performance difference in the test run. The method further includes re-execute, in response to identifying the performance difference, the test run using resources with a different concurrency to identify whether the performance difference still exists. The method further includes flagging the performance difference as a false positive in response to identifying that the performance difference no longer exists when the test run is re-executed using the resources with different concurrency.

    TRACKING INTERMEDIATE CHANGES IN DATABASE DATA

    公开(公告)号:US20220222247A1

    公开(公告)日:2022-07-14

    申请号:US17656960

    申请日:2022-03-29

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.

    DATA REPLICATION AND DATA FAILOVER IN DATABASE SYSTEMS

    公开(公告)号:US20220215041A1

    公开(公告)日:2022-07-07

    申请号:US17703804

    申请日:2022-03-24

    Applicant: Snowflake Inc.

    Abstract: Replication and failover of database data is disclosed. A method includes copying a first dataset of a first account stored in a primary deployment to a secondary deployment. The method includes determining that the first dataset stored in the primary deployment is unavailable. The method includes executing a first transaction on the first dataset at the secondary deployment in response to determining that the first dataset stored in the primary deployment is unavailable. The method includes executing a second transaction on a second dataset of a second account at the primary deployment while the first dataset of the first account stored in the primary deployment is unavailable.

    Parallel execution of query sub-plans

    公开(公告)号:US11379480B1

    公开(公告)日:2022-07-05

    申请号:US17647629

    申请日:2022-01-11

    Applicant: Snowflake Inc.

    Abstract: Sub-plans are executed in parallel using a plurality of execution nodes, which can be part of a data platform. In particular, sub-plans (e.g., fragments or portions of one or more child operators) of a root operator are identified in a query plan such that the identified sub-plans that are candidates for execution on a single execution node, determine a cost estimate for causing the candidate sub-plans to be executed in parallel using multiple execution nodes, and cause the candidate sub-plans to be executed in parallel based on the cost estimate.

    PROCESSING QUERIES ON SEMI-STRUCTURED DATA COLUMNS

    公开(公告)号:US20220207041A1

    公开(公告)日:2022-06-30

    申请号:US17655124

    申请日:2022-03-16

    Applicant: Snowflake Inc.

    Abstract: A source table organized into a set of batch units is accessed. The source table comprises a column of data corresponding to a semi-structured data type. One or more indexing transformations for an object in the column are generated. The generating of the one or more indexing transformation includes converting the object to one or more stored data types. A pruning index is generated for the source table based in part on the one or more indexing transformations. The pruning index comprises a set of filters that index distinct values in each column of the source table, and each filter corresponds to a batch unit in the set of batch units. The pruning index is stored in a database with an association with the source table.

Patent Agency Ranking