Tracking intermediate changes in database data

    公开(公告)号:US11106661B2

    公开(公告)日:2021-08-31

    申请号:US17219600

    申请日:2021-03-31

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.

    Increasing clustering of partitions of a table

    公开(公告)号:US11100142B2

    公开(公告)日:2021-08-24

    申请号:US17030549

    申请日:2020-09-24

    Applicant: Snowflake Inc.

    Abstract: The subject technology creates one or more partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition, the creating of the one or more partitions resulting in a decrease in a degree of clustering of the table. The subject technology determines, after creating the one or more partitions, that the degree of clustering of the table is below a clustering threshold, the clustering threshold corresponding to a clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes. The subject technology increases the degree of clustering of the table by reclustering partitions of the table.

    DYNAMIC LINEAGE TRACKING OF DATABASE DATA

    公开(公告)号:US20210248127A1

    公开(公告)日:2021-08-12

    申请号:US17232927

    申请日:2021-04-16

    Applicant: Snowflake Inc.

    Abstract: The subject technology executes a change on an existing micro-partition of a table, the executing of the change comprising generating a new micro-partition that embodies the change and deleting the existing micro-partition. The subject technology receives a request for a delta for the table between a first timestamp and a second timestamp. The subject technology responsively determines the delta based on a table history, the delta including information indicating at least one operation that was performed to at least one row of a set of rows of the table, without including information as to intermediate changes made to at least one row of the set of rows of the table between the first timestamp and the second timestamp, that facilitates a reduction in storage of historical versions of the set of rows of the table.

    INCREMENTAL REFRESH OF A MATERIALIZED VIEW

    公开(公告)号:US20210240695A1

    公开(公告)日:2021-08-05

    申请号:US17238539

    申请日:2021-04-23

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for incrementally refreshing a materialized view are disclosed. A method includes generating a materialized view based on a source table. The method includes merging the source table and the materialized view to generate a merged table to identify whether an update has been executed on the source table that is not reflected in the materialized view. The method includes, in response to detecting an update made to the source table that is not reflected in the materialized view, applying the update to the materialized view.

    Caching metadata for a database table

    公开(公告)号:US11080270B2

    公开(公告)日:2021-08-03

    申请号:US17086253

    申请日:2020-10-30

    Applicant: Snowflake Inc.

    Abstract: The subject technology caches, in connection with processing a first query, one or more table-metadata files in at least one of a current version set of one or more table-metadata files and a new version set of one or more table-metadata files, the new version set of one or more table-metadata files comprising table metadata for a new version of a database table. The subject technology receives a subsequent query directed to the database table. The subject technology downloads, in connection with processing the subsequent query, at least one uncached table-metadata file in a scan set of table-metadata files for the subsequent query. The subject technology processes the subsequent query using the at least one uncached table-metadata file.

Patent Agency Ranking