Extensible streams for operations on external systems

    公开(公告)号:US11216421B2

    公开(公告)日:2022-01-04

    申请号:US17345422

    申请日:2021-06-11

    Applicant: Snowflake Inc.

    Abstract: The subject technology determines, using a stream object provided by a network-based database system separate from an external system, a set of offsets of each shard of a set of shards, the set of offsets being determined by executing a user defined function (UDF) included in the stream object, the stream object and the UDF being implemented in a particular programming language different than a database query language. The subject technology identifies an operation to perform on the set of shards. The subject technology, based on the set of shards and the set of offsets, performs, using the UDF included in the stream object, the operation on an external data source hosted by the external system. The subject technology increments a property indicating a different version of the stream object after performing the operation on the external data source, the property being included in the stream object.

    Dynamic lineage tracking of database data

    公开(公告)号:US11194782B2

    公开(公告)日:2021-12-07

    申请号:US17232927

    申请日:2021-04-16

    Applicant: Snowflake Inc.

    Abstract: The subject technology executes a change on an existing micro-partition of a table, the executing of the change comprising generating a new micro-partition that embodies the change and deleting the existing micro-partition. The subject technology receives a request for a delta for the table between a first timestamp and a second timestamp. The subject technology responsively determines the delta based on a table history, the delta including information indicating at least one operation that was performed to at least one row of a set of rows of the table, without including information as to intermediate changes made to at least one row of the set of rows of the table between the first timestamp and the second timestamp, that facilitates a reduction in storage of historical versions of the set of rows of the table.

    Querying over external tables in database systems

    公开(公告)号:US11163756B2

    公开(公告)日:2021-11-02

    申请号:US16385774

    申请日:2019-04-16

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for querying over an external table are disclosed. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes receiving a query comprising a predicate, the query directed at least to data in the external table. The method includes determining, based on metadata, one or more partitions in the external table comprising data satisfying the predicate. The method includes pruning, based on the metadata, all partitions in the external table that do not comprise any data satisfying the predicate. The method includes generating a query plan comprising a plurality of discrete subtasks. The method includes assigning, based on the metadata, the plurality of discrete subtasks to one or more nodes in an execution platform.

    Task scheduling in database systems

    公开(公告)号:US11157486B2

    公开(公告)日:2021-10-26

    申请号:US16705533

    申请日:2019-12-06

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for executing a task on database data in response to a trigger event are disclosed. A method includes executing a transaction on a table comprising database data, wherein executing the transaction comprises generating a new table version. The method includes, in response to the transaction being fully executed, generating a change tracking entry comprising an indication of one or more modifications made to the table by the transaction and storing the change tracking entry in a change tracking stream. The method includes executing a task on the new table version in response to a trigger event.

    Materialized views over external tables in database systems

    公开(公告)号:US11138190B2

    公开(公告)日:2021-10-05

    申请号:US16775110

    申请日:2020-01-28

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for generating a materialized view over an external table. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating, by the database platform, a materialized view over the external table. The method includes receiving a notification that a modification has been made to the external table, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes, in response to the external table being modified, refreshing the materialized view such that the materialized view comprises an accurate representation of the external table.

    Real-time streaming data ingestion into database tables

    公开(公告)号:US11100111B1

    公开(公告)日:2021-08-24

    申请号:US17226423

    申请日:2021-04-09

    Applicant: Snowflake Inc.

    Abstract: A streaming ingest platform can improve latency and expense issues related to uploading data into a cloud data system. The streaming ingest platform can organize the data to be ingested into per-table chunks and per-account blobs. This data may be committed and may be made available for query processing before it is ingested into the target source tables. This significantly improves latency issues. The streaming ingest platform can also accommodate uploading data from various sources with different processing and communication capabilities, such as Internet of Things (IOT) devices.

    Transactional streaming of change tracking data

    公开(公告)号:US11086840B2

    公开(公告)日:2021-08-10

    申请号:US16213554

    申请日:2018-12-07

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for generating a transactional change tracking summary for a database. A method includes executing a transaction on a table of a database, wherein the table includes a micro-partition and the transaction is executed on the micro-partition. The method includes, in response to the transaction being fully executed, generating a change tracking entry comprising an indication of one or modifications made to the table by the transaction. The method includes storing the change tracking entry in a change tracking stream, wherein the change tracking stream includes one or more sequential change tracking entries that each correspond to a different transaction that is fully executed on the table.

    INVOKING EXTERNAL TABLE FUNCTIONS FROM A DATA SYSTEM

    公开(公告)号:US20210240557A1

    公开(公告)日:2021-08-05

    申请号:US17238558

    申请日:2021-04-23

    Applicant: Snowflake Inc.

    Abstract: A query referencing a function associated with a remote software component is received by a network-based data warehouse system. Temporary security credentials corresponding to a role at a cloud computing service platform are obtained. The role has permission to send calls to a web endpoint corresponding to the remote software component. A request comprising input data and electronically signed using the temporary security credentials is sent to a web Application Programming Interface (API) management system of the cloud computing service platform. The request, when received by the web API management system, causes the web API management system to invoke external functionality provided by the remote software component at the web endpoint with respect to the input data. A response comprising a result of invoking the external functionality is received from the web API management system, and the result data is processed according to the query.

    INVOKING EXTERNAL FUNCTIONS FROM A DATA SYSTEM

    公开(公告)号:US20210216385A1

    公开(公告)日:2021-07-15

    申请号:US17219858

    申请日:2021-03-31

    Applicant: Snowflake Inc.

    Abstract: A query referencing a function associated with a remote software component is received by a network-based data warehouse system. Temporary security credentials corresponding to a role at a cloud computing service platform are obtained. The role has permission to send calls to a web endpoint corresponding to the remote software component. A request comprising input data and electronically signed using the temporary security credentials is sent to a web Application Programming Interface (API) management system of the cloud computing service platform. The request, when received by the web API management system, causes the web API management system to invoke external functionality provided by the remote software component at the web endpoint with respect to the input data. A response comprising a result of invoking the external functionality is received from the web API management system, and the result data is processed according to the query.

    Automated maintenance of external tables in database systems

    公开(公告)号:US10997165B2

    公开(公告)日:2021-05-04

    申请号:US16842942

    申请日:2020-04-08

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for automated maintenance of external tables in database systems are disclosed. A method includes receiving, by a database platform, read access to content in an external data storage platform that is separate from the database platform. The method includes defining an external table based on the content in the external data storage platform. The method includes connecting the database platform to the external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating metadata for the external table, the metadata comprising information about data stored in the external table. The method includes receiving a notification that a modification has been made to the content in the external data storage platform, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes refreshing the metadata for the external table in response to the modification being made to the content in the external data storage platform.

Patent Agency Ranking