MAINTAINING STATES OF PARTITIONS OF A TABLE FOR ADJUSTING CLUSTERING RATIOS

    公开(公告)号:US20210216574A1

    公开(公告)日:2021-07-15

    申请号:US17219609

    申请日:2021-03-31

    Applicant: Snowflake Inc.

    Abstract: The subject technology creates partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition. The subject technology maintains states for the partitions, each state from the plurality of states representing a particular degree of clustering of the table. The subject technology determines a number of overlapping partitions and a depth of the overlapping partitions, and determines a clustering ratio based at least in part on the number of overlapping partitions and the depth. The subject technology reclusters partitions of the table to increase the clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes.

    CLONING CATALOG OBJECTS
    312.
    发明申请

    公开(公告)号:US20210205120A1

    公开(公告)日:2021-07-08

    申请号:US17207508

    申请日:2021-03-19

    Abstract: Example systems and methods for cloning catalog objects are described. In one implementation, a method identifies an original catalog object associated with data and creates a duplicate copy of the original catalog object without copying the data itself. The method allows access to the data using the duplicate catalog object and supports modifying the data associated with the original catalog object independently of the duplicate catalog object. The duplicate catalog object can be deleted upon completion of modifying the data associated with the original catalog object.

    Batch data ingestion in database systems

    公开(公告)号:US11055280B2

    公开(公告)日:2021-07-06

    申请号:US16201854

    申请日:2018-11-27

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.

    TRACKING LINEAGE OF DATABASE DATA
    314.
    发明申请

    公开(公告)号:US20210182268A1

    公开(公告)日:2021-06-17

    申请号:US17162790

    申请日:2021-01-29

    Applicant: Snowflake Inc.

    Abstract: The subject technology executes a change on an existing micro-partition of a table of a database, the executing of the change comprising generating a new micro-partition that embodies the change. The subject technology receives a request for a delta for the table between a first timestamp and a second timestamp. The subject technology queries at least one change tracking column to determine the delta between the first timestamp and the second timestamp, the delta including information indicating at least one database operation that was performed to at least one row of a set of rows of the table, without including information as to intermediate changes made to at least one row of the set of rows of the table between the first timestamp and the second timestamp, that facilitates a reduction in storage of historical versions of the set of rows of the table.

    Querying over external tables in database systems

    公开(公告)号:US11030191B2

    公开(公告)日:2021-06-08

    申请号:US16841831

    申请日:2020-04-07

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for querying over an external table are disclosed. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes receiving a query comprising a predicate, the query directed at least to data in the external table. The method includes determining, based on metadata, one or more partitions in the external table comprising data satisfying the predicate. The method includes pruning, based on the metadata, all partitions in the external table that do not comprise any data satisfying the predicate. The method includes generating a query plan comprising a plurality of discrete subtasks. The method includes assigning, based on the metadata, the plurality of discrete subtasks to one or more nodes in an execution platform.

    Data sharing in database systems
    319.
    发明授权

    公开(公告)号:US10997169B2

    公开(公告)日:2021-05-04

    申请号:US17125453

    申请日:2020-12-17

    Applicant: Snowflake Inc.

    Abstract: Using one or more processors, generating a materialized view over a share object, the share object, including data associated with a first account. Using the one or more processors, merging the materialized view and the share object. Using the one or more processors, updating the data associated with the first account subsequent to generating the materialized view. Using the one or more processors, generating a refreshed materialized view with respect to the share object in response to detecting the updated data. Using the one or more processors, sharing the refreshed materialized view to a second account in response to a request from the second account.

    Systems, methods, and devices for encrypting database data

    公开(公告)号:US10977383B2

    公开(公告)日:2021-04-13

    申请号:US15286112

    申请日:2016-10-05

    Applicant: Snowflake Inc.

    Abstract: A method for encrypting database data includes generating an encryption key for a first file stored in a data store, wherein a table in a database comprises an entry pointing to the first file. The method includes generating a second file by encrypting the data the first file in the data store using the encryption key without modifying the first file. The method includes, in response to generating the second file, modifying the entry in the table to point to the second file, wherein the modification of the entry is performed atomically. A process for rekeying from the first file to the second file may happen in the background without blocking, interfering, or otherwise obstructing user interaction with a database system.

Patent Agency Ranking