-
公开(公告)号:US20210019335A1
公开(公告)日:2021-01-21
申请号:US17030549
申请日:2020-09-24
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Marcin Zukowski , Benoit Dageville , Jiaqi Yan
IPC: G06F16/28 , G06F16/21 , G06F16/245 , G06F16/22
Abstract: The subject technology creates one or more partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition, the creating of the one or more partitions resulting in a decrease in a degree of clustering of the table. The subject technology determines, after creating the one or more partitions, that the degree of clustering of the table is below a clustering threshold, the clustering threshold corresponding to a clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes. The subject technology increases the degree of clustering of the table by reclustering partitions of the table.
-
公开(公告)号:US10891306B2
公开(公告)日:2021-01-12
申请号:US16995599
申请日:2020-08-17
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Allison Waingold Lee
IPC: G06F15/16 , G06F16/27 , G06F16/2453 , G06F9/50 , G06F16/22 , G06F16/21 , G06F16/182 , H04L29/08 , G06F16/14 , G06F16/2455 , G06F16/2458 , G06F9/48 , G06F16/9535 , G06F16/951 , G06F16/23
Abstract: A system and method for managing data storage and data access with querying data in a distributed system without buffering the results on intermediate operations in disk storage.
-
公开(公告)号:US10860450B1
公开(公告)日:2020-12-08
申请号:US16846964
申请日:2020-04-13
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Johan Harjono , Simon Holm Jensen , Kunal Prafulla Nabar , Steven James Pelley
IPC: G06F11/30 , G06F11/32 , G06F16/248 , G06F16/2458 , G06F16/242 , G06F11/34
Abstract: Systems, methods, and devices for automatically retrying a query. A method includes receiving a query directed to database data and assigning execution of the query to one or more execution nodes of a database platform. The method includes determining that execution of the query was unsuccessful. The method includes assigning a first retry execution of the query on the first version of the database platform and assigning a second retry execution of the query on a second version of the database platform.
-
公开(公告)号:US20200380008A1
公开(公告)日:2020-12-03
申请号:US16797970
申请日:2020-02-21
Applicant: Snowflake Inc.
Inventor: Pui Kei Johnston Chu , Benoit Dageville , Matthew J. Glickman , Christian Kleinerman , Prasanna Krishnan , Justin Langseth
IPC: G06F16/25 , G06F21/62 , G06F16/2455
Abstract: A data warehouse storing databases for a plurality of users, including service providers hosting data for other users of the data warehouse may implement a data exchange. The data warehouse to verify identity of users and execute instructions with respect to databases of the data warehouse.
-
公开(公告)号:US20200379997A1
公开(公告)日:2020-12-03
申请号:US16944150
申请日:2020-07-31
Applicant: Snowflake Inc.
Inventor: Prasanna Rajaperumal , Thierry Cruanes , Allison Waingold Lee , Igor Demura , Jiaqi Yan , Benoit Dageville
IPC: G06F16/2453 , G06F16/27 , G06F16/23 , G06F16/22 , G06F16/2455
Abstract: The subject technology defines a materialized view over a source table that is associated with a first account of a database. The subject technology defines cross-account access rights to the materialized view for a second account. The subject technology modifies the source table for the materialized view. The subject technology identifies that the materialized view is stale with respect to the source table by merging the materialized view and the source table. The subject technology causes the materialized view to be refreshed with respect to the source table.
-
公开(公告)号:US20200379993A1
公开(公告)日:2020-12-03
申请号:US16428367
申请日:2019-05-31
Applicant: Snowflake Inc.
Inventor: Prasanna Rajaperumal , Thierry Cruanes , Allison Waingold Lee , Igor Demura , Jiaqi Yan , Benoit Dageville
IPC: G06F16/2453
Abstract: Systems, methods, and devices for generating and updating cross-account materialized views in multiple tenant database systems. A methods includes defining a share object in a first account wherein the share object includes data associated with the first account. The method includes granting cross-account access rights to the share object to a second account such that the second account has access to the share object without copying the share object. The method includes generating a materialized view over the share object. The method includes updating the data associated with the first account. The method includes identifying whether the materialized view is stale with respect to the share object by merging the materialized view and the share object.
-
公开(公告)号:US10831781B2
公开(公告)日:2020-11-10
申请号:US16913047
申请日:2020-06-26
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Allison Waingold Lee
IPC: G06F15/16 , G06F16/27 , G06F16/182 , G06F16/23 , G06F16/2455 , H04L29/08 , G06F16/2453 , G06F16/21 , G06F16/2458 , G06F9/50 , G06F9/48 , G06F16/951 , G06F16/9535 , G06F16/14 , G06F16/22
Abstract: A system and method for managing data storage and data access with querying data in a distributed system without buffering the results on intermediate operations in disk storage.
-
公开(公告)号:US10817536B2
公开(公告)日:2020-10-27
申请号:US16778817
申请日:2020-01-31
Applicant: Snowflake Inc.
Inventor: Eric Robinson , Benoit Dageville , Shigeyuki Takeda , Kristopher Wagner
Abstract: Systems, methods, and devices for transitioning a client connection for a database from a first deployment of the database to a second deployment of the database. A method includes maintaining the client connection between a client and the first deployment such that database traffic occurs at the first deployment. The method includes generating a unique identification that references a first connection object of the first deployment and a second connection object of a second deployment. The method includes receiving a notification that the first deployment is unavailable and providing an external connection group unique identification to the second deployment for the second deployment to determine whether the client should be connected to the second deployment. The method includes receiving a uniform resource locator from the second deployment if the client should be connected to the second deployment.
-
公开(公告)号:US20200334242A1
公开(公告)日:2020-10-22
申请号:US16842942
申请日:2020-04-08
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
Abstract: Systems, methods, and devices for automated maintenance of external tables in database systems are disclosed. A method includes receiving, by a database platform, read access to content in an external data storage platform that is separate from the database platform. The method includes defining an external table based on the content in the external data storage platform. The method includes connecting the database platform to the external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating metadata for the external table, the metadata comprising information about data stored in the external table. The method includes receiving a notification that a modification has been made to the content in the external data storage platform, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes refreshing the metadata for the external table in response to the modification being made to the content in the external data storage platform.
-
公开(公告)号:US20200334241A1
公开(公告)日:2020-10-22
申请号:US16841831
申请日:2020-04-07
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/242 , G06F3/06 , G06F16/23 , G06F16/25 , G06F16/2453
Abstract: Systems, methods, and devices for querying over an external table are disclosed. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes receiving a query comprising a predicate, the query directed at least to data in the external table. The method includes determining, based on metadata, one or more partitions in the external table comprising data satisfying the predicate. The method includes pruning, based on the metadata, all partitions in the external table that do not comprise any data satisfying the predicate. The method includes generating a query plan comprising a plurality of discrete subtasks. The method includes assigning, based on the metadata, the plurality of discrete subtasks to one or more nodes in an execution platform.
-
-
-
-
-
-
-
-
-