EXTENSIBLE STREAMS FOR OPERATIONS ON EXTERNAL SYSTEMS

    公开(公告)号:US20210318994A1

    公开(公告)日:2021-10-14

    申请号:US17345422

    申请日:2021-06-11

    申请人: Snowflake Inc.

    摘要: The subject technology determines, using a stream object provided by a network-based database system separate from an external system, a set of offsets of each shard of a set of shards, the set of offsets being determined by executing a user defined function (UDF) included in the stream object, the stream object and the UDF being implemented in a particular programming language different than a database query language. The subject technology identifies an operation to perform on the set of shards. The subject technology, based on the set of shards and the set of offsets, performs, using the UDF included in the stream object, the operation on an external data source hosted by the external system. The subject technology increments a property indicating a different version of the stream object after performing the operation on the external data source, the property being included in the stream object.

    Journaled tables in database systems

    公开(公告)号:US11080257B2

    公开(公告)日:2021-08-03

    申请号:US16410695

    申请日:2019-05-13

    申请人: Snowflake Inc.

    IPC分类号: G06F16/22 G06F16/21 G06F16/23

    摘要: Systems, methods, and devices for storing database data in journal tables comprising a snapshot and a log table. A method includes defining a journal table comprising a snapshot and a log table, the snapshot comprising an up-to-date representation of data in the journal table at a point in time. The method includes assigning a timestamp to the snapshot indicating when the snapshot was generated. The method includes receiving a request to execute a transaction on the journal table to modify the data in the journal table, the transaction comprising one or more of an insert, a delete, an update, or a merge. The method includes inserting a new row into the log table in lieu of executing the transaction on the snapshot of the journal table, the new row comprising an indication of a change requested to be made to the journal table based on the transaction.

    Tracking changes in database data
    25.
    发明授权

    公开(公告)号:US10983980B2

    公开(公告)日:2021-04-20

    申请号:US16942717

    申请日:2020-07-29

    申请人: Snowflake Inc.

    摘要: The subject technology executes a change on an existing micro-partition of a table of a database, the executing of the change comprising generating a new micro-partition that embodies the change. The subject technology receives a request for a delta for the table between a first timestamp and a second timestamp. The subject technology queries at least one change tracking column to determine the delta between the first timestamp and the second timestamp, the delta including information indicating at least one database operation that was performed to at least one row of a set of rows of the table, without including information as to intermediate changes made to at least one row of the set of rows of the table between the first timestamp and the second timestamp, that facilitates a reduction in storage of historical versions of the set of rows of the table.

    Task Scheduling In Database Systems
    26.
    发明申请

    公开(公告)号:US20200167340A1

    公开(公告)日:2020-05-28

    申请号:US16203322

    申请日:2018-11-28

    申请人: Snowflake Inc.

    IPC分类号: G06F16/23 G06F9/46

    摘要: Systems, methods, and devices for executing a task on database data in response to a trigger event are disclosed. A method includes executing a transaction on a table comprising database data, wherein executing the transaction comprises generating a new table version. The method includes, in response to the transaction being fully executed, generating a change tracking entry comprising an indication of one or more modifications made to the table by the transaction and storing the change tracking entry in a change tracking stream. The method includes executing a task on the new table version in response to a trigger event.

    TRACKING INTERMEDIATE CHANGES IN DATABASE DATA

    公开(公告)号:US20240311369A1

    公开(公告)日:2024-09-19

    申请号:US18668993

    申请日:2024-05-20

    申请人: Snowflake Inc.

    摘要: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.

    TRACKING CHANGES IN DATABASE DATA
    29.
    发明公开

    公开(公告)号:US20240095229A1

    公开(公告)日:2024-03-21

    申请号:US18520845

    申请日:2023-11-28

    申请人: Snowflake Inc.

    摘要: A method includes detecting, by at least one hardware processor, a change request for a table of a database, the table comprising a plurality of micro-partitions. A transaction associated with the change request is executed at a first timestamp. The transaction causes replacement of a first micro-partition of the plurality of micro-partitions with a second micro-partition. A change tracking column is generated in the second micro-partition. The change tracking column comprises metadata for the transaction. A delta for the table between the first timestamp and a second timestamp is generated using the metadata in the change tracking column. The delta indicates changes made to one or more rows of the table between the first time stamp and the second timestamp.