-
公开(公告)号:US20210326324A1
公开(公告)日:2021-10-21
申请号:US17364562
申请日:2021-06-30
Applicant: Snowflake, Inc.
Inventor: Subramanian Muralidhar , Istvan Cseri , Torsten Grabs , Benoit Dageville
IPC: G06F16/23 , G06F16/27 , G06F16/245 , G06F16/18
Abstract: Systems, methods, and devices for generating a transactional change tracking summary for a database. A method includes executing a transaction on a table of a database, wherein the table includes a micro-partition and the transaction is executed on the micro-partition. The method includes, in response to the transaction being fully executed, generating a change tracking entry comprising an indication of one or modifications made to the table by the transaction. The method includes storing the change tracking entry in a change tracking stream, wherein the change tracking stream includes one or more sequential change tracking entries that each correspond to a different transaction that is fully executed on the table.
-
公开(公告)号:US20210318994A1
公开(公告)日:2021-10-14
申请号:US17345422
申请日:2021-06-11
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Tyler Jones , Dinesh Chandrakant Kulkarni , Igor Zinkovsky
IPC: G06F16/18 , G06F16/182 , G06F16/2453 , G06F16/2455 , G06F16/23
Abstract: The subject technology determines, using a stream object provided by a network-based database system separate from an external system, a set of offsets of each shard of a set of shards, the set of offsets being determined by executing a user defined function (UDF) included in the stream object, the stream object and the UDF being implemented in a particular programming language different than a database query language. The subject technology identifies an operation to perform on the set of shards. The subject technology, based on the set of shards and the set of offsets, performs, using the UDF included in the stream object, the operation on an external data source hosted by the external system. The subject technology increments a property indicating a different version of the stream object after performing the operation on the external data source, the property being included in the stream object.
-
公开(公告)号:US11086841B1
公开(公告)日:2021-08-10
申请号:US16779045
申请日:2020-01-31
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Dinesh Chandrakant Kulkarni
IPC: G06F16/00 , G06F16/23 , G06F16/2457 , G06F16/17 , G06F16/18 , G06F16/16 , G06F16/22 , G06F16/182
Abstract: A database platform can track changes using a stream that records changes made by database transactions. Streams on views can be implemented by applying a stream to source data, and applying the view to the streamed data. Users of the database platform can query the stream on the view to review recent changes to the view, such as changes made since the last database transaction.
-
公开(公告)号:US20210240690A1
公开(公告)日:2021-08-05
申请号:US16779045
申请日:2020-01-31
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Dinesh Chandrakant Kulkarni
IPC: G06F16/23 , G06F16/22 , G06F16/2457 , G06F16/17 , G06F16/18 , G06F16/16 , G06F16/182
Abstract: A database platform can track changes using a stream that records changes made by database transactions. Streams on views can be implemented by applying a stream to source data, and applying the view to the streamed data. Users of the database platform can query the stream on the view to review recent changes to the view, such as changes made since the last database transaction.
-
公开(公告)号:US11080257B2
公开(公告)日:2021-08-03
申请号:US16410695
申请日:2019-05-13
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Thierry Cruanes , Subramanian Muralidhar , Benoit Dageville
Abstract: Systems, methods, and devices for storing database data in journal tables comprising a snapshot and a log table. A method includes defining a journal table comprising a snapshot and a log table, the snapshot comprising an up-to-date representation of data in the journal table at a point in time. The method includes assigning a timestamp to the snapshot indicating when the snapshot was generated. The method includes receiving a request to execute a transaction on the journal table to modify the data in the journal table, the transaction comprising one or more of an insert, a delete, an update, or a merge. The method includes inserting a new row into the log table in lieu of executing the transaction on the snapshot of the journal table, the new row comprising an indication of a change requested to be made to the journal table based on the transaction.
-
公开(公告)号:US10983980B2
公开(公告)日:2021-04-20
申请号:US16942717
申请日:2020-07-29
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Benoit Dageville
IPC: G06F16/23 , G06F16/27 , G06F16/2455
Abstract: The subject technology executes a change on an existing micro-partition of a table of a database, the executing of the change comprising generating a new micro-partition that embodies the change. The subject technology receives a request for a delta for the table between a first timestamp and a second timestamp. The subject technology queries at least one change tracking column to determine the delta between the first timestamp and the second timestamp, the delta including information indicating at least one database operation that was performed to at least one row of a set of rows of the table, without including information as to intermediate changes made to at least one row of the set of rows of the table between the first timestamp and the second timestamp, that facilitates a reduction in storage of historical versions of the set of rows of the table.
-
公开(公告)号:US20200167340A1
公开(公告)日:2020-05-28
申请号:US16203322
申请日:2018-11-28
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Benoit Dageville
Abstract: Systems, methods, and devices for executing a task on database data in response to a trigger event are disclosed. A method includes executing a transaction on a table comprising database data, wherein executing the transaction comprises generating a new table version. The method includes, in response to the transaction being fully executed, generating a change tracking entry comprising an indication of one or more modifications made to the table by the transaction and storing the change tracking entry in a change tracking stream. The method includes executing a task on the new table version in response to a trigger event.
-
公开(公告)号:US12216638B2
公开(公告)日:2025-02-04
申请号:US18158627
申请日:2023-01-24
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Istvan Cseri , Fabian Hueske , Tyler Jones , Yevgeniy Kogan , Dzmitry Pauliukevich , Daniel E. Sotolongo
IPC: G06F16/23 , G06F16/22 , G06F16/2455
Abstract: Techniques described herein can accelerate change data capture determinations such as stream reads, which show changes made to a table between two points in time. Three distinct row bitests that mark deleted, updated, inserted, rows in micro-partitions can be added as metadata for the table. These bitsets can be generated during DML operations and then stored as metadata of the new partition generated by the DML operations. The bitsets can then be used to generate streams showing the changes in the table between two points in time (changes interval).
-
公开(公告)号:US12130795B2
公开(公告)日:2024-10-29
申请号:US18359322
申请日:2023-07-26
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Istvan Cseri , Tyler Jones , Daniel E. Sotolongo
IPC: G06F16/22 , G06F16/23 , G06F16/2455
CPC classification number: G06F16/2282 , G06F16/2358 , G06F16/24568
Abstract: A system or persistent table may be generated storing changelog information of a primary base table. The system table may then be used to create streams of relevant information. In some examples, the streams may read from the system table for information past a retention period of the primary table while reading from the primary table information in the retention period.
-
公开(公告)号:US11928098B2
公开(公告)日:2024-03-12
申请号:US17809203
申请日:2022-06-27
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Istvan Cseri , Torsten Grabs , Benoit Dageville
IPC: G06F16/23 , G06F16/18 , G06F16/245 , G06F16/27 , G06F9/46
CPC classification number: G06F16/2358 , G06F16/1865 , G06F16/2322 , G06F16/2365 , G06F16/245 , G06F16/278 , G06F9/466
Abstract: A system includes one or more processors and data storage containing instructions executable by the one or more processors to perform operations. The operations include detecting a first executed transaction causing a first modification to table data stored in a table. The table data is associated with a corresponding metadata file with metadata information of the table. A new metadata file is generated responsive to the first executed transaction. The new metadata file includes the metadata information and additional metadata associated with the first modification. A second executed transaction causing a second modification to the table data is detected. The table data is updated with a change tracking column. The change tracking column includes lineage of executed transactions on the table data. The lineage indicates at least the first transaction and the second transaction.
-
-
-
-
-
-
-
-
-