-
公开(公告)号:US20200218736A1
公开(公告)日:2020-07-09
申请号:US16823124
申请日:2020-03-18
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Florian Andreas FUNKE , Peter POVINEC
IPC: G06F16/28 , H04L29/08 , G06F16/2455 , H04L12/24
Abstract: A method for a multi-cluster warehouse includes allocating processing units as part of a data warehouse. The processing units access data within one or more databases in cloud storage resources. The method also includes providing one or more queries to each processing unit within the data warehouse. In response to the queries, each processing unit performs database operations on a particular portion of a database table. The method also includes monitoring a workload of the processing units to determine that a processing capacity of the processing units has reached a threshold processing capacity. The method also includes dynamically adding additional processing units to and removing processing units from the data warehouse as needed based on a workload of the processing units.
-
公开(公告)号:US20230297589A1
公开(公告)日:2023-09-21
申请号:US18202502
申请日:2023-05-26
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , G06F16/28 , G06F16/25 , A61F5/56 , G06F9/48 , H04L67/1095 , H04L67/1097
CPC classification number: G06F16/27 , G06F9/5016 , G06F16/148 , G06F16/211 , G06F16/221 , G06F16/951 , G06F16/1827 , G06F16/2365 , G06F16/2456 , G06F16/2471 , G06F16/9535 , G06F16/24532 , G06F16/24545 , G06F16/24552 , H04L67/568 , G06F16/283 , G06F16/254 , A61F5/566 , G06F9/4881 , G06F9/5088 , H04L67/1095 , G06F9/5044 , H04L67/1097 , G06F9/5083
Abstract: Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
-
公开(公告)号:US20230289367A1
公开(公告)日:2023-09-14
申请号:US18321564
申请日:2023-05-22
Applicant: Snowflake Inc.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , G06F16/28 , G06F16/25 , A61F5/56 , G06F9/48 , H04L67/1095 , H04L67/1097
CPC classification number: G06F16/27 , G06F9/5016 , G06F16/148 , G06F16/211 , G06F16/221 , G06F16/951 , G06F16/1827 , G06F16/2365 , G06F16/2456 , G06F16/2471 , G06F16/9535 , G06F16/24532 , G06F16/24545 , G06F16/24552 , H04L67/568 , G06F16/283 , G06F16/254 , A61F5/566 , G06F9/4881 , G06F9/5088 , H04L67/1095 , G06F9/5044 , H04L67/1097 , G06F9/5083
Abstract: A system and method of scaling capacity of data warehouses to user-defined levels. The method includes monitoring a processing of one or more queries by a data warehouse, the data warehouse including one or more processing resources and a storage platform. The method includes receiving a request to process database data stored on the storage platform. The method includes determining a performance of the processed request as observed by a user device. The method includes adjusting, by a processing device, a timing associated with the processing of the one or more queries based on the performance of the processed request.
-
公开(公告)号:US20210124761A1
公开(公告)日:2021-04-29
申请号:US17141220
申请日:2021-01-04
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/ Furthermore, the device processes the set of queries using the updated set of processors.
-
公开(公告)号:US20200257703A1
公开(公告)日:2020-08-13
申请号:US16861790
申请日:2020-04-29
Applicant: Snowflake Inc.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F16/14 , G06F16/2453 , H04L29/08 , G06F16/951 , G06F16/182 , G06F16/9535 , G06F16/21 , G06F16/2455 , G06F9/50 , G06F16/23 , G06F16/22 , G06F16/2458 , G06F9/48
Abstract: Example resource provisioning systems and methods are described. In one implementation, multiple processing resources are provided within a data warehouse. The processing resources include at least one processor and at least one storage device. At least one query to process database data is received. At least some of the processing resources may process the database data. When a processing capacity of the processing resources has reached a threshold processing capacity, the processing capacity is automatically scaled by adding at least one additional processor to the data warehouse.
-
公开(公告)号:US20200226148A1
公开(公告)日:2020-07-16
申请号:US16827103
申请日:2020-03-23
Applicant: Snowflake Inc.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F16/951 , G06F9/48 , G06F9/50 , H04L29/08 , G06F16/182 , G06F16/9535 , G06F16/21 , G06F16/2455 , G06F16/23 , G06F16/22 , G06F16/2453 , G06F16/2458 , G06F16/14
Abstract: Example resource provisioning systems and methods are described. In one implementation, multiple processing resources are provided within a data warehouse. The processing resources include at least one processor and at least one storage device. At least one query to process database data is received. At least some of the processing resources may process the database data. When a processing capacity of the processing resources has reached a threshold processing capacity, the processing capacity is automatically scaled by adding at least one additional processor to the data warehouse.
-
公开(公告)号:US20200218733A1
公开(公告)日:2020-07-09
申请号:US16823287
申请日:2020-03-18
Applicant: SNOWFLAKE INC.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin Zukowski , Allison Waingold LEE , Phillip Thomas Unterbrunner
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
-
公开(公告)号:US20220374451A1
公开(公告)日:2022-11-24
申请号:US17882439
申请日:2022-08-05
Applicant: Snowflake Inc.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin ZUKOWSKI
Abstract: A system and method of scaling capacity of data warehouses to user-defined levels. The method includes provisioning a data warehouse including a plurality of processing resources, the plurality of processing resources including at least one processor and at least one storage device. The method includes receiving a request to process database data stored on a storage platform including a plurality of shared storage devices in association with the data warehouse, wherein the request indicates a performance level for processing the request. The method includes determining that a capacity of the plurality of processing resources of the data warehouse would reach a threshold capacity when processing the request according to the performance level. The method includes increasing the capacity of the data warehouse for the data warehouse to process the request according to the performance level.
-
公开(公告)号:US20220206992A1
公开(公告)日:2022-06-30
申请号:US17696620
申请日:2022-03-16
Applicant: Snowflake Inc.
Inventor: Marcin ZUKOWSKI , Benoit DAGEVILLE , Thierry CRUANES , Ashish MOTIVALA
IPC: G06F16/16 , G06F16/22 , G06F16/215 , G06F16/9535 , G06F16/2453
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query that includes a plurality of predicates and identifying one or more applicable files including database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files including database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
公开(公告)号:US20210124717A1
公开(公告)日:2021-04-29
申请号:US17141165
申请日:2021-01-04
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Ashish MOTIVALA , Marcin ZUKOWSKI
IPC: G06F16/16 , G06F16/22 , G06F16/215 , G06F16/9535 , G06F16/2453
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query comprising a plurality of predicates and identifying one or more applicable files comprising database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
-
-
-
-
-
-
-
-