-
公开(公告)号:US20210200754A1
公开(公告)日:2021-07-01
申请号:US17200751
申请日:2021-03-12
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/242 , G06F3/06 , G06F16/2453 , G06F16/25 , G06F16/23
Abstract: Systems, methods, and devices for querying over an external table are disclosed. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes receiving a query comprising a predicate, the query directed at least to data in the external table. The method includes determining, based on metadata, one or more partitions in the external table comprising data satisfying the predicate. The method includes pruning, based on the metadata, all partitions in the external table that do not comprise any data satisfying the predicate. The method includes generating a query plan comprising a plurality of discrete subtasks. The method includes assigning, based on the metadata, the plurality of discrete subtasks to one or more nodes in an execution platform.
-
公开(公告)号:US20210191954A1
公开(公告)日:2021-06-24
申请号:US17194182
申请日:2021-03-05
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Allison Waingold Lee
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/2455 , G06F16/21 , G06F16/951 , G06F16/23 , G06F16/2453 , H04L29/08 , G06F16/9535 , G06F16/182 , G06F16/22 , G06F16/2458 , G06F9/48
Abstract: A system and method for managing data storage and data access with querying data in a distributed system without buffering the results on intermediate operations in disk storage.
-
公开(公告)号:US20210157785A1
公开(公告)日:2021-05-27
申请号:US17143054
申请日:2021-01-06
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Istvan Cseri , Torsten Grabs , Benoit Dageville
IPC: G06F16/23 , G06F16/27 , G06F16/245 , G06F16/18
Abstract: Systems, methods, and devices for generating a transactional change tracking summary for a database. A method includes executing a transaction on a table of a database, wherein the table includes a micro-partition and the transaction is executed on the micro-partition. The method includes, in response to the transaction being fully executed, generating a change tracking entry comprising an indication of one or modifications made to the table by the transaction. The method includes storing the change tracking entry in a change tracking stream, wherein the change tracking stream includes one or more sequential change tracking entries that each correspond to a different transaction that is fully executed on the table.
-
公开(公告)号:US11010407B2
公开(公告)日:2021-05-18
申请号:US16905599
申请日:2020-06-18
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Marcin Zukowski
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/Furthermore, the device processes the set of queries using the updated set of processors.
-
公开(公告)号:US10999355B1
公开(公告)日:2021-05-04
申请号:US16814875
申请日:2020-03-10
Applicant: Snowflake Inc.
Inventor: Pui Kei Johnston Chu , Benoit Dageville , Matthew Glickman , Christian Kleinerman , Prasanna Krishnan , Justin Langseth
IPC: G06F15/173 , H04L29/08 , H04L29/06 , H04L12/24
Abstract: Sharing data in a data exchange across multiple cloud computing platforms and/or cloud computing platform regions is described. An example computer-implemented method can include receiving data sharing information from a data provider for sharing a data set in a data exchange from a first cloud computing entity to a set of second cloud computing entities. In response to receiving the data sharing information, the method may also include creating an account with each of the set of second cloud computing entities. The method may also further include sharing the data set from the first cloud computing entity with the set of second cloud computing entities using at least the corresponding account of that second cloud computing entity.
-
公开(公告)号:US10997215B2
公开(公告)日:2021-05-04
申请号:US17030565
申请日:2020-09-24
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Marcin Zukowski , Benoit Dageville , Jiaqi Yan
Abstract: The subject technology creates partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition. The subject technology maintains states for the partitions, each state from the plurality of states representing a particular degree of clustering of the table. The subject technology determines a number of overlapping partitions and a depth of the overlapping partitions, and determines a clustering ratio based at least in part on the number of overlapping partitions and the depth. The subject technology reclusters partitions of the table to increase the clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes.
-
公开(公告)号:US10997162B2
公开(公告)日:2021-05-04
申请号:US16824676
申请日:2020-03-19
Applicant: Snowflake Inc.
Inventor: Torsten Grabs , Istvan Cseri , Benoit Dageville
IPC: G06F17/00 , G06F16/23 , G06F16/22 , G06F16/2455
Abstract: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.
-
公开(公告)号:US10997151B2
公开(公告)日:2021-05-04
申请号:US16705566
申请日:2019-12-06
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Istvan Cseri , Torsten Grabs , Benoit Dageville
IPC: G06F16/23 , G06F9/46 , G06F16/245 , G06F16/27 , G06F16/18
Abstract: Systems, methods, and devices for generating a transactional change tracking summary for a database. A method includes executing a transaction on a table of a database, wherein the table includes a micro-partition and the transaction is executed on the micro-partition. The method includes, in response to the transaction being fully executed, generating a change tracking entry comprising an indication of one or modifications made to the table by the transaction. The method includes storing the change tracking entry in a change tracking stream, wherein the change tracking stream includes one or more sequential change tracking entries that each correspond to a different transaction that is fully executed on the table.
-
公开(公告)号:US20210124664A1
公开(公告)日:2021-04-29
申请号:US17142962
申请日:2021-01-06
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Johan Harjono , Simon Holm Jensen , Kunal Prafulla Nabar , Steven James Pelley
IPC: G06F11/30 , G06F16/2458 , G06F16/242 , G06F11/32 , G06F11/14 , G06F16/248 , G06F11/34
Abstract: Disclosed herein are systems and methods for query-attempt assignment in a database environment. In an embodiment, a database platform includes first and second database query managers respectively configured to manage external and internal tasks of the database platform. The first database query manager receives a query directed to database data in a client account of the database platform, and an attempt to execute the query is assigned to one or more execution nodes. Based on determining that the attempt was unsuccessful, the database platform transfers the query to the second database manager, which assigns a first retry attempt to execute the query to one or more execution nodes.
-
公开(公告)号:US10977241B2
公开(公告)日:2021-04-13
申请号:US16915435
申请日:2020-06-29
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Martin Hentschel , William Waddington
IPC: G06F16/00 , G06F16/23 , G06F21/60 , G06F16/22 , G06F16/2455
Abstract: A method for a database system includes storing table data for a database, the table data including information in rows and columns of one or more database tables. The method includes storing metadata on immutable storage, the metadata including information about the table data for the database. In one embodiment, mutable metadata may be periodically consolidated in the background to create new versions of metadata files and which allows for deletions of old metadata files and old data files.
-
-
-
-
-
-
-
-
-