Tracking intermediate changes in database data

    公开(公告)号:US12026153B2

    公开(公告)日:2024-07-02

    申请号:US18179762

    申请日:2023-03-07

    申请人: Snowflake Inc.

    摘要: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.

    Materialized views over external tables in database systems

    公开(公告)号:US11507571B2

    公开(公告)日:2022-11-22

    申请号:US16385720

    申请日:2019-04-16

    申请人: Snowflake Inc.

    摘要: Systems, methods, and devices for generating a materialized view over an external table. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating, by the database platform, a materialized view over the external table. The method includes receiving a notification that a modification has been made to the external table, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes, in response to the external table being modified, refreshing the materialized view such that the materialized view comprises an accurate representation of the external table.

    SCHEDULING DATA PROCESSING TASKS USING A STREAM OF TRACKING ENTRIES

    公开(公告)号:US20220164339A1

    公开(公告)日:2022-05-26

    申请号:US17650890

    申请日:2022-02-14

    申请人: Snowflake Inc.

    IPC分类号: G06F16/23 G06F9/46

    摘要: Systems, methods, and devices for scheduling of data processing tasks are disclosed. A method includes performing a modification on a first set of immutable partitions storing database data to generate a second set of immutable partitions. The second set is associated with a modified version of the database data. A change tracking entry is entered in a stream of tracking entries based on committing the modification. The change tracking entry includes an indication of the modification on the first set of immutable partitions. A stream offset of the stream of tracking entries is advanced based on the entering of the change tracking entry in the stream of tracking entries. The stream offset indicates a timestamp associated with a latest committed modification to the database data. A data processing task is scheduled for execution on the modified version of the database data based on the advancing of the stream offset.