Tracking intermediate changes in database data

    公开(公告)号:US11620281B2

    公开(公告)日:2023-04-04

    申请号:US17656960

    申请日:2022-03-29

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.

    CONFIGURING PARALLELISM PARAMETERS FOR INVOCATION OF EXTERNAL TABLE FUNCTIONS

    公开(公告)号:US20220414094A1

    公开(公告)日:2022-12-29

    申请号:US17823132

    申请日:2022-08-30

    Applicant: Snowflake Inc.

    Abstract: A query referencing an external table function provided by a remote software component is received. Requests to execute the external table function on input data are sent to a proxy service. A first request includes a batch of input rows from the input data. A first response to the first request received from the proxy service includes a first portion of result data and a pagination token. The pagination token indicates that at least a second portion of the result data corresponding to the first batch of input rows is to be obtained from the remote software component. Based on the pagination token, a second request is sent to obtain the second portion of the result data. One or more responses are received from the proxy service that comprise at least the second portion of the result data. The result data is processed according to the query.

    TABLE DATA PROCESSING USING A CHANGE TRACKING COLUMN

    公开(公告)号:US20220327107A1

    公开(公告)日:2022-10-13

    申请号:US17809203

    申请日:2022-06-27

    Applicant: Snowflake Inc.

    Abstract: A system includes one or more processors and data storage containing instructions executable by the one or more processors to perform operations. The operations include detecting a first executed transaction causing a first modification to table data stored in a table. The table data is associated with a corresponding metadata file with metadata information of the table. A new metadata file is generated responsive to the first executed transaction. The new metadata file includes the metadata information and additional metadata associated with the first modification. A second executed transaction causing a second modification to the table data is detected. The table data is updated with a change tracking column. The change tracking column includes lineage of executed transactions on the table data. The lineage indicates at least the first transaction and the second transaction.

    Partitioning to support invocation of external table functions on multiple batches of input rows

    公开(公告)号:US11468079B1

    公开(公告)日:2022-10-11

    申请号:US17646200

    申请日:2021-12-28

    Applicant: Snowflake Inc.

    Abstract: A query referencing an external table function provided by a remote software component is received. Requests to execute the external table function on input data are sent to a proxy service. A first request includes a batch of input rows from the input data. A first response to the first request received from the proxy service includes a first portion of result data and a pagination token. The pagination token indicates that at least a second portion of the result data corresponding to the first batch of input rows is to be obtained from the remote software component. Based on the pagination token, a second request is sent to obtain the second portion of the result data. One or more responses are received from the proxy service that comprise at least the second portion of the result data. The result data is processed according to the query.

Patent Agency Ranking