-
公开(公告)号:US10997212B2
公开(公告)日:2021-05-04
申请号:US17062859
申请日:2020-10-05
Applicant: Snowflake Inc.
Inventor: Ashish Motivala , Yi Fang , Martin Hentschel , Benoit Dageville , Spyridon Triantafyllis , Yizhi Zhu
IPC: G06F16/27 , G06F16/245 , G06F16/22
Abstract: The subject technology stores database data in a table, the table comprising a plurality of micro-partitions each having rows and columns. The subject technology stores metadata for the table in a data structure, the metadata comprising a first level of the data structure comprising a column expression property comprising information about data stored in a column of a micro-partition of the table, a micro-partition expression property comprising information about data stored in the micro-partition of the table, and a second level of the data structure comprising a grouping expression property comprising information about data stored in the micro-partition and one or more additional micro-partitions of the table.
-
公开(公告)号:US10997207B2
公开(公告)日:2021-05-04
申请号:US17064888
申请日:2020-10-07
Applicant: Snowflake Inc.
Inventor: Eric Robinson , Benoit Dageville , Shigeyuki Takeda , Kristopher Wagner
Abstract: The subject technology generates a unique identification that references a first connection object of a first deployment and a second connection object of a second deployment of a database. The subject technology receives a notification that the first deployment is unavailable, the notification that the first deployment is unavailable comprises an indication of a downtime for the first deployment. The subject technology provides an external connection group unique identification to the second deployment for the second deployment to determine whether a client is to be connected to the second deployment. The subject technology receives a uniform resource locator from the second deployment when a client device is to be connected to the second deployment, the uniform resource locator being provided to the client device for connecting to the second deployment.
-
公开(公告)号:US10997165B2
公开(公告)日:2021-05-04
申请号:US16842942
申请日:2020-04-08
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
Abstract: Systems, methods, and devices for automated maintenance of external tables in database systems are disclosed. A method includes receiving, by a database platform, read access to content in an external data storage platform that is separate from the database platform. The method includes defining an external table based on the content in the external data storage platform. The method includes connecting the database platform to the external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating metadata for the external table, the metadata comprising information about data stored in the external table. The method includes receiving a notification that a modification has been made to the content in the external data storage platform, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes refreshing the metadata for the external table in response to the modification being made to the content in the external data storage platform.
-
公开(公告)号:US10977278B2
公开(公告)日:2021-04-13
申请号:US16283431
申请日:2019-02-22
Applicant: Snowflake Inc.
Inventor: Ashish Motivala , Yi Fang , Martin Hentschel , Benoit Dageville , Spyros Triantafyllis , Yizhi Zhu
IPC: G06F16/27 , G06F16/245 , G06F16/22
Abstract: Systems, methods, and devices for multiple level metadata organization in database systems. A system includes a plurality of shared storage devices collectively storing database data across a plurality of tables that each comprise one or more micro-partitions constituting immutable storage devices. The system includes a resource manager configured to manage the storage of database data stored in a table across one or more of the plurality of shared storage devices. The resource manager is configured to manage the storage of metadata for the table. The metadata includes a column expression property with information about data stored in a column, a micro-partition expression property with information about data stored in a micro-partition, a grouping expression property with information about data stored in two or more micro-partitions, and cumulative table metadata with global information about all micro-partitions of the table.
-
公开(公告)号:US10977268B2
公开(公告)日:2021-04-13
申请号:US16428592
申请日:2019-05-31
Applicant: Snowflake Inc.
Inventor: Justin Langseth , Pui Kei Johnston Chu , Benoit Dageville , Matthew J. Glickman , Christian Kleinerman , Prasanna Krishnan
IPC: G06F16/2455 , G06F16/25 , G06F21/62
Abstract: A data warehouse storing databases for a plurality of users, including service providers hosting data for other users of the data warehouse may implement a data exchange. A provider of data creates a listing in the exchange including metadata. A consumer browses the metadata and select a listing to add to consumed shares of the consumer. Browsing may be filtered to implement a private exchange. Adding of data may be limited by access controls requiring verification of the consumer's identity. The added data may be filtered to include only data that relates to the consumer. Sharing may be bi-directional such that a request to add a listing for first data of a provider may also reference second data of the consumer such that the first data is added to the consumed shares of the consumer and the second data is added to the consumed shares of the provider.
-
公开(公告)号:US10977244B2
公开(公告)日:2021-04-13
申请号:US16182216
申请日:2018-11-06
Applicant: Snowflake Inc.
Inventor: Torsten Grabs , Istvan Cseri , Benoit Dageville
IPC: G06F17/00 , G06F16/23 , G06F16/22 , G06F16/2455
Abstract: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.
-
公开(公告)号:US20210103602A1
公开(公告)日:2021-04-08
申请号:US17125524
申请日:2020-12-17
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Allison Waingold Lee
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A system and method for managing data storage and data access with querying data in a distributed system without buffering the results on intermediate operations in disk storage.
-
公开(公告)号:US20210103600A1
公开(公告)日:2021-04-08
申请号:US17124240
申请日:2020-12-16
Applicant: SNOWFLAKE INC.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: Caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
-
公开(公告)号:US20210097076A1
公开(公告)日:2021-04-01
申请号:US17120546
申请日:2020-12-14
Applicant: Snowflake Inc.
Inventor: Prasanna Rajaperumal , Thierry Cruanes , Allison Waingold Lee , Igor Demura , Jiaqi Yan , Benoit Dageville
IPC: G06F16/2453 , G06F16/27 , G06F16/23 , G06F16/22 , G06F16/2455
Abstract: The subject technology generates a materialized view over a share object, the share object including data associated with a first account. The subject technology merges the materialized view and the share object. The subject technology determines that the materialized view is stale with respect to the share object based at least in part on the merging. The subject technology generates a refreshed materialized view with respect to the share object in response to the determining. The subject technology shares the refreshed materialized view to a second account in response to a request from the second account.
-
公开(公告)号:US20210089559A1
公开(公告)日:2021-03-25
申请号:US17116625
申请日:2020-12-09
Applicant: SNOWFLAKE INC.
Inventor: Florian Andreas Funke , Peter Povinec , Thierry Cruanes , Benoit Dageville
IPC: G06F16/28 , H04L29/08 , G06F16/2455 , H04L12/24
Abstract: A method for a multi-cluster warehouse includes allocating a plurality of compute clusters as part of a virtual warehouse. The compute clusters are used to access and perform queries against one or more databases in one or more cloud storage resources. The method includes providing queries for the virtual warehouse to each of the plurality of compute clusters. Each of the plurality of compute clusters of the virtual warehouse receives a plurality of queries so that the computing load is spread across the different clusters. The method also includes dynamically adding compute clusters to and removing compute clusters from the virtual warehouse as needed based on a workload of the plurality of compute clusters.
-
-
-
-
-
-
-
-
-