-
公开(公告)号:US11620281B2
公开(公告)日:2023-04-04
申请号:US17656960
申请日:2022-03-29
Applicant: Snowflake Inc.
Inventor: Torsten Grabs , Istvan Cseri , Benoit Dageville
IPC: G06F16/00 , G06F16/23 , G06F16/22 , G06F16/2455
Abstract: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.
-
公开(公告)号:US20230070152A1
公开(公告)日:2023-03-09
申请号:US18049325
申请日:2022-10-25
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Istvan Cseri , Tyler Jones , Daniel E. Sotolongo , Zhuo Zhang
IPC: G06F16/2455 , G06F16/2453 , G06F16/25 , G06F16/22
Abstract: A streaming ingest platform can improve latency and expense issues related to uploading data into a cloud data system. The streaming ingest platform can organize the data to be ingested into per-table chunks and per-account blobs. This data may be committed and may be made available for query processing before it is ingested into the target source tables. This significantly improves latency issues. The streaming ingest platform can also accommodate uploading data from various sources with different processing and communication capabilities, such as Internet of Things (IOT) devices.
-
公开(公告)号:US11593345B1
公开(公告)日:2023-02-28
申请号:US17648621
申请日:2022-01-21
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Istvan Cseri , Fabian Hueske , Tyler Jones , Yevgeniy Kogan , Dzmitry Pauliukevich , Daniel E. Sotolongo
IPC: G06F16/23 , G06F16/2455 , G06F16/22
Abstract: Techniques described herein can accelerate change data capture determinations such as stream reads, which show changes made to a table between two points in time. Three distinct row bitests that mark deleted, updated, inserted, rows in micro-partitions can be added as metadata for the table. These bitsets can be generated during DML operations and then stored as metadata of the new partition generated by the DML operations. The bitsets can then be used to generate streams showing the changes in the table between two points in time (changes interval).
-
公开(公告)号:US20220414106A1
公开(公告)日:2022-12-29
申请号:US17930150
申请日:2022-09-07
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Thierry Cruanes , Istvan Cseri , Benoit Dageville , Tyler Jones , Dinesh Chandrakant Kulkarni
IPC: G06F16/2455 , G06F16/2453
Abstract: Hybrid tables can be used in different use-case scenarios. Hybrid tables provide a flexible mechanism to support files and data in different formats while providing access to the different types of data as part of one table. This flexibility can allow the use of hybrid tables in data lake or other similar environments.
-
公开(公告)号:US20220414094A1
公开(公告)日:2022-12-29
申请号:US17823132
申请日:2022-08-30
Applicant: Snowflake Inc.
Inventor: Srilakshmi Chintala , Istvan Cseri , Albert L. Hu , Isaac Kunen , Nitya Kumar Sharma , Igor Zinkovsky
IPC: G06F16/2458 , G06F16/22
Abstract: A query referencing an external table function provided by a remote software component is received. Requests to execute the external table function on input data are sent to a proxy service. A first request includes a batch of input rows from the input data. A first response to the first request received from the proxy service includes a first portion of result data and a pagination token. The pagination token indicates that at least a second portion of the result data corresponding to the first batch of input rows is to be obtained from the remote software component. Based on the pagination token, a second request is sent to obtain the second portion of the result data. One or more responses are received from the proxy service that comprise at least the second portion of the result data. The result data is processed according to the query.
-
公开(公告)号:US11514022B2
公开(公告)日:2022-11-29
申请号:US17364772
申请日:2021-06-30
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Dinesh Chandrakant Kulkarni
IPC: G06F16/00 , G06F16/23 , G06F16/2457 , G06F16/17 , G06F16/18 , G06F16/16 , G06F16/22 , G06F16/182
Abstract: A database platform can track changes using a stream that records changes made by database transactions. Streams on views can be implemented by applying a stream to source data, and applying the view to the streamed data. Users of the database platform can query the stream on the view to review recent changes to the view, such as changes made since the last database transaction.
-
167.
公开(公告)号:US11474799B2
公开(公告)日:2022-10-18
申请号:US16945561
申请日:2020-07-31
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Isaac Kunen , Bing Li , Edward Ma , Srinath Shankar
IPC: G06F8/51 , G06F16/2452 , G06F16/2455 , G06F16/242 , G06F16/901 , G06F16/27
Abstract: The subject technology receives first code in a first programming language, the first code including a reference to a first dataset. The subject technology converting the first code to: second code in a second programming language different than the first programming language, and at least one data query in a second format different than the first programming language. Further the subject technology sends the converted first code and the at least one data query to a data system for execution.
-
公开(公告)号:US20220327131A1
公开(公告)日:2022-10-13
申请号:US17455594
申请日:2021-11-18
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Thierry Cruanes , Istvan Cseri , Benoit Dageville , Tyler Jones , Dinesh Chandrakant Kulkarni
IPC: G06F16/2455 , G06F16/2453
Abstract: Hybrid tables can be used in different use-case scenarios. Hybrid tables provide a flexible mechanism to support files and data in different formats while providing access to the different types of data as part of one table. This flexibility can allow the use of hybrid tables in data lake or other similar environments.
-
公开(公告)号:US20220327107A1
公开(公告)日:2022-10-13
申请号:US17809203
申请日:2022-06-27
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Istvan Cseri , Torsten Grabs , Benoit Dageville
IPC: G06F16/23 , G06F16/27 , G06F16/245 , G06F16/18
Abstract: A system includes one or more processors and data storage containing instructions executable by the one or more processors to perform operations. The operations include detecting a first executed transaction causing a first modification to table data stored in a table. The table data is associated with a corresponding metadata file with metadata information of the table. A new metadata file is generated responsive to the first executed transaction. The new metadata file includes the metadata information and additional metadata associated with the first modification. A second executed transaction causing a second modification to the table data is detected. The table data is updated with a change tracking column. The change tracking column includes lineage of executed transactions on the table data. The lineage indicates at least the first transaction and the second transaction.
-
170.
公开(公告)号:US11468079B1
公开(公告)日:2022-10-11
申请号:US17646200
申请日:2021-12-28
Applicant: Snowflake Inc.
Inventor: Srilakshmi Chintala , Istvan Cseri , Albert L. Hu , Isaac Kunen , Nitya Kumar Sharma , Igor Zinkovsky
IPC: G06F16/2458 , G06F16/22
Abstract: A query referencing an external table function provided by a remote software component is received. Requests to execute the external table function on input data are sent to a proxy service. A first request includes a batch of input rows from the input data. A first response to the first request received from the proxy service includes a first portion of result data and a pagination token. The pagination token indicates that at least a second portion of the result data corresponding to the first batch of input rows is to be obtained from the remote software component. Based on the pagination token, a second request is sent to obtain the second portion of the result data. One or more responses are received from the proxy service that comprise at least the second portion of the result data. The result data is processed according to the query.
-
-
-
-
-
-
-
-
-