-
公开(公告)号:US10997148B2
公开(公告)日:2021-05-04
申请号:US17078437
申请日:2020-10-23
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Thierry Cruanes , Subramanian Muralidhar , Benoit Dageville
Abstract: The subject technology receives one or more requests to execute one or more requested transactions on a journal table of a database, the journal table comprising a snapshot and a log table. The subject technology, based on the one or more requests, inserts a new row into the log table for each requested transaction, each new row reflecting a corresponding requested transaction. The subject technology receives, after the one or more requested transactions have been executed, a second request to execute a second requested transaction on the journal table. The subject technology generates, prior to executing the second requested transaction, a second snapshot, the second snapshot comprising a second representation of data in the journal table after the one or more requested transactions have been executed, the second snapshot stored in a third micro-partition different than the first micro-partition and the second micro-partition.
-
公开(公告)号:US10990576B2
公开(公告)日:2021-04-27
申请号:US17078397
申请日:2020-10-23
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Thierry Cruanes , Subramanian Muralidhar , Benoit Dageville
Abstract: The subject technology defines a journal table of a database, the journal table comprising a snapshot and a log table, the snapshot comprising a representation of data in the journal table at a particular time, the log table comprising a listing of requested changes to the journal table since the particular time, the snapshot stored in a first micro-partition, the log table stored in a second micro-partition. The subject technology receives, after at least one first requested transaction has been executed, a request to execute a second requested transaction on the journal table. The subject technology generates, prior to executing the second requested transaction, a second snapshot, the second snapshot comprising a second representation of data in the journal table after the at least one first requested transaction has been executed, the second snapshot stored in a third micro-partition different than the first micro-partition and the second micro-partition.
-
公开(公告)号:US20210089560A1
公开(公告)日:2021-03-25
申请号:US17118201
申请日:2020-12-10
Applicant: SNOWFLAKE INC.
Inventor: Florian Andreas Funke , Peter Povinec , Thierry Cruanes , Benoit Dageville
IPC: G06F16/28 , H04L29/08 , G06F16/2455 , H04L12/24
Abstract: A method for a multi-cluster warehouse includes allocating a plurality of compute clusters as part of a virtual warehouse. The compute clusters are used to access and perform queries against one or more databases in one or more cloud storage resources. The method includes providing queries for the virtual warehouse to each of the plurality of compute clusters. Each of the plurality of compute clusters of the virtual warehouse receives a plurality of queries so that the computing load is spread across the different clusters. The method also includes dynamically adding compute clusters to and removing compute clusters from the virtual warehouse as needed based on a workload of the plurality of compute clusters.
-
公开(公告)号:US20210089554A1
公开(公告)日:2021-03-25
申请号:US17111781
申请日:2020-12-04
Applicant: SNOWFLAKE INC.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.
-
585.
公开(公告)号:US20210064594A1
公开(公告)日:2021-03-04
申请号:US16555970
申请日:2019-08-29
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Johan Harjono , Kunal Prafulla Nabar , Steven James Pelley
IPC: G06F16/23 , G06F16/2455
Abstract: Systems, methods, and devices for retrying a query. A method includes receiving a query directed to database data and assigning execution of the query to one or more execution nodes of an execution platform, the one or more execution nodes configured to execute the query on a first version of a database platform. The method includes determining that execution of the query was unsuccessful. The method includes assigning a first retry execution of the query to the one or more execution nodes of the execution platform and determining whether a regression or an intermittent fault caused the execution of the query to be unsuccessful based at least in part on whether the first retry execution of the query was successful or unsuccessful.
-
公开(公告)号:US20210064499A1
公开(公告)日:2021-03-04
申请号:US17082993
申请日:2020-10-28
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Johan Harjono , Simon Holm Jensen , Kunal Prafulla Nabar , Steven James Pelley
IPC: G06F11/30 , G06F16/248 , G06F11/32 , G06F16/2458 , G06F11/34 , G06F16/242
Abstract: Systems, methods, and devices for automatically retrying a query. A method includes receiving a query directed to database data and assigning execution of the query to one or more execution nodes of a database platform. The method includes determining that execution of the query was unsuccessful. The method includes assigning a first retry execution of the query on the first version of the database platform and assigning a second retry execution of the query on a second version of the database platform.
-
公开(公告)号:US20210049179A1
公开(公告)日:2021-02-18
申请号:US17086279
申请日:2020-10-30
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Yi Fang , Martin Hentschel , Ashish Motivala , Spyros Triantafyllis , Yizhi Zhu
IPC: G06F16/2455 , G06F16/23 , G06F16/2457 , G06F16/22 , G06F16/2458 , G06F16/27
Abstract: The subject technology receives first metadata corresponding to a set of micro-partitions. The subject technology stores a first data structure and a second data structure in storage as a first file and a second file, first data structure including the first metadata and a second data structure including second metadata, the first metadata corresponding to a set of micro-partitions, the second metadata for a grouping of the first metadata, the second data structure including information associating the second metadata to the first metadata. The subject technology stores third metadata for a table, the third metadata comprising: cumulative table metadata comprising global information about a plurality of micro-partitions of the table, the cumulative table metadata being stored in a metadata micro-partition associated with the table.
-
公开(公告)号:US20210042326A1
公开(公告)日:2021-02-11
申请号:US17080219
申请日:2020-10-26
Applicant: SNOWFLAKE INC.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski , Allison Waingold Lee , Philipp Thomas Unterbrunner
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
-
公开(公告)号:US20210042282A1
公开(公告)日:2021-02-11
申请号:US17078397
申请日:2020-10-23
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Thierry Cruanes , Subramanian Muralidhar , Benoit Dageville
Abstract: The subject technology defines a journal table of a database, the journal table comprising a snapshot and a log table, the snapshot comprising a representation of data in the journal table at a particular time, the log table comprising a listing of requested changes to the journal table since the particular time, the snapshot stored in a first micro-partition, the log table stored in a second micro-partition. The subject technology receives, after at least one first requested transaction has been executed, a request to execute a second requested transaction on the journal table. The subject technology generates, prior to executing the second requested transaction, a second snapshot, the second snapshot comprising a second representation of data in the journal table after the at least one first requested transaction has been executed, the second snapshot stored in a third micro-partition different than the first micro-partition and the second micro-partition.
-
公开(公告)号:US20210019336A1
公开(公告)日:2021-01-21
申请号:US17030565
申请日:2020-09-24
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Marcin Zukowski , Benoit Dageville , Jiaqi Yan
IPC: G06F16/28 , G06F16/21 , G06F16/245 , G06F16/22
Abstract: The subject technology creates partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition. The subject technology maintains states for the partitions, each state from the plurality of states representing a particular degree of clustering of the table. The subject technology determines a number of overlapping partitions and a depth of the overlapping partitions, and determines a clustering ratio based at least n part on the number of overlapping partitions and the depth. The subject technology reclusters partitions of the table to increase the clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes.
-
-
-
-
-
-
-
-
-