-
公开(公告)号:US20200257706A1
公开(公告)日:2020-08-13
申请号:US16863758
申请日:2020-04-30
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Florian Andreas FUNKE , Peter POVINEC
IPC: G06F16/28 , H04L29/08 , G06F16/2455 , H04L12/24
Abstract: A method for a multi-cluster warehouse includes allocating processing units as part of a data warehouse. The processing units access data within one or more databases in cloud storage resources. The method also includes providing one or more queries to each processing unit within the data warehouse. In response to the queries, each processing unit performs database operations on a particular portion of a database table. The method also includes monitoring a workload of the processing units to determine that a processing capacity of the processing units has reached a threshold processing capacity. The method also includes dynamically adding additional processing units to and removing processing units from the data warehouse as needed based on a workload of the processing units.
-
公开(公告)号:US20200265066A1
公开(公告)日:2020-08-20
申请号:US16860976
申请日:2020-04-28
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F9/50 , H04L29/08 , G06F9/48 , G06F16/2455 , G06F16/2453 , G06F16/9535 , G06F16/2458 , G06F16/23 , G06F16/182 , G06F16/951 , G06F16/22 , G06F16/21 , G06F16/14
Abstract: Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
-
公开(公告)号:US20200257705A1
公开(公告)日:2020-08-13
申请号:US16862140
申请日:2020-04-29
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Florian Andreas FUNKE , Peter POVINEC
IPC: G06F16/28 , H04L12/24 , H04L29/08 , G06F16/2455
Abstract: A method for a multi-cluster warehouse includes allocating processing units as part of a data warehouse. The processing units access data within one or more databases in cloud storage resources. The method also includes providing one or more queries to each processing unit within the data warehouse. In response to the queries, each processing unit performs database operations on a particular portion of a database table. The method also includes monitoring a workload of the processing units to determine that a processing capacity of the processing units has reached a threshold processing capacity. The method also includes dynamically adding additional processing units to and removing processing units from the data warehouse as needed based on a workload of the processing units.
-
公开(公告)号:US20240419687A1
公开(公告)日:2024-12-19
申请号:US18780180
申请日:2024-07-22
Applicant: Snowflake Inc.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin ZUKOWSKI
IPC: G06F16/27 , A61F5/56 , G06F9/48 , G06F9/50 , G06F16/14 , G06F16/182 , G06F16/21 , G06F16/22 , G06F16/23 , G06F16/2453 , G06F16/2455 , G06F16/2458 , G06F16/25 , G06F16/28 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
Abstract: A system and method of scaling capacity of data warehouses to user-defined levels. The method includes determining a performance associated with processing one or more queries. The method includes receiving a request to process data stored on a storage platform. The method includes adjusting, by a processing device based on the performance, a timing associated with the processing of the one or more queries to adjust a different timing associated with processing the request.
-
公开(公告)号:US20240256570A1
公开(公告)日:2024-08-01
申请号:US18587151
申请日:2024-02-26
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Marcin ZUKOWSKI
IPC: G06F16/27 , A61F5/56 , G06F9/48 , G06F9/50 , G06F16/14 , G06F16/182 , G06F16/21 , G06F16/22 , G06F16/23 , G06F16/2453 , G06F16/2455 , G06F16/2458 , G06F16/25 , G06F16/28 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
CPC classification number: G06F16/273 , A61F5/566 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F16/148 , G06F16/1827 , G06F16/211 , G06F16/221 , G06F16/2365 , G06F16/24532 , G06F16/24545 , G06F16/24552 , G06F16/2456 , G06F16/2471 , G06F16/254 , G06F16/27 , G06F16/283 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
Abstract: Example caching systems and methods are described. In one implementation, a method receives a set of queries to be processed by a set of virtual warehouses. The method distributes the set of queries to the set of virtual warehouses to be executed and creates, during the processing of the set of queries by the set of virtual warehouses, a new virtual warehouse, wherein cache resources associated with the new virtual warehouse are populated with data files associated with the set of queries at the time the virtual warehouse is created and the cache resources vary among the processors, wherein a first subset of the processors comprises minimal cache resources and a second subset of processors comprises cache resources providing faster input-output operations. The method redistributes the set of queries across the set of virtual warehouses.
-
公开(公告)号:US20200257702A1
公开(公告)日:2020-08-13
申请号:US16858518
申请日:2020-04-24
Applicant: SNOWFLAKE INC.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin Zukowski , Allison Waingold LEE , Philipp Thomas Unterbrunner
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
-
公开(公告)号:US20200257701A1
公开(公告)日:2020-08-13
申请号:US16858510
申请日:2020-04-24
Applicant: SNOWFLAKE INC.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin Zukowski , Allison Waingold LEE , Philipp Thomas Unterbrunner
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
-
公开(公告)号:US20200257658A1
公开(公告)日:2020-08-13
申请号:US16861516
申请日:2020-04-29
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Ashish MOTIVALA , Marcin ZUKOWSKI
IPC: G06F16/16 , G06F16/2453 , G06F16/9535 , G06F16/215 , G06F16/22
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query comprising a plurality of predicates and identifying one or more applicable files comprising database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
公开(公告)号:US20200218736A1
公开(公告)日:2020-07-09
申请号:US16823124
申请日:2020-03-18
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Florian Andreas FUNKE , Peter POVINEC
IPC: G06F16/28 , H04L29/08 , G06F16/2455 , H04L12/24
Abstract: A method for a multi-cluster warehouse includes allocating processing units as part of a data warehouse. The processing units access data within one or more databases in cloud storage resources. The method also includes providing one or more queries to each processing unit within the data warehouse. In response to the queries, each processing unit performs database operations on a particular portion of a database table. The method also includes monitoring a workload of the processing units to determine that a processing capacity of the processing units has reached a threshold processing capacity. The method also includes dynamically adding additional processing units to and removing processing units from the data warehouse as needed based on a workload of the processing units.
-
公开(公告)号:US20220374451A1
公开(公告)日:2022-11-24
申请号:US17882439
申请日:2022-08-05
Applicant: Snowflake Inc.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin ZUKOWSKI
Abstract: A system and method of scaling capacity of data warehouses to user-defined levels. The method includes provisioning a data warehouse including a plurality of processing resources, the plurality of processing resources including at least one processor and at least one storage device. The method includes receiving a request to process database data stored on a storage platform including a plurality of shared storage devices in association with the data warehouse, wherein the request indicates a performance level for processing the request. The method includes determining that a capacity of the plurality of processing resources of the data warehouse would reach a threshold capacity when processing the request according to the performance level. The method includes increasing the capacity of the data warehouse for the data warehouse to process the request according to the performance level.
-
-
-
-
-
-
-
-
-