-
公开(公告)号:US20230394009A1
公开(公告)日:2023-12-07
申请号:US18448512
申请日:2023-08-11
Applicant: Snowflake Inc.
Inventor: Marcin ZUKOWSKI , Benoit DAGEVILLE , Thierry Cruanes , Ashish MOTIVALA
IPC: G06F16/16 , G06F16/22 , G06F16/215 , G06F16/9535 , G06F16/2453
CPC classification number: G06F16/162 , G06F16/22 , G06F16/215 , G06F16/9535 , G06F16/24544
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query that includes a plurality of predicates and identifying one or more applicable files including database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files including database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
公开(公告)号:US20200265066A1
公开(公告)日:2020-08-20
申请号:US16860976
申请日:2020-04-28
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F9/50 , H04L29/08 , G06F9/48 , G06F16/2455 , G06F16/2453 , G06F16/9535 , G06F16/2458 , G06F16/23 , G06F16/182 , G06F16/951 , G06F16/22 , G06F16/21 , G06F16/14
Abstract: Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
-
公开(公告)号:US20240419687A1
公开(公告)日:2024-12-19
申请号:US18780180
申请日:2024-07-22
Applicant: Snowflake Inc.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin ZUKOWSKI
IPC: G06F16/27 , A61F5/56 , G06F9/48 , G06F9/50 , G06F16/14 , G06F16/182 , G06F16/21 , G06F16/22 , G06F16/23 , G06F16/2453 , G06F16/2455 , G06F16/2458 , G06F16/25 , G06F16/28 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
Abstract: A system and method of scaling capacity of data warehouses to user-defined levels. The method includes determining a performance associated with processing one or more queries. The method includes receiving a request to process data stored on a storage platform. The method includes adjusting, by a processing device based on the performance, a timing associated with the processing of the one or more queries to adjust a different timing associated with processing the request.
-
公开(公告)号:US20240256570A1
公开(公告)日:2024-08-01
申请号:US18587151
申请日:2024-02-26
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Marcin ZUKOWSKI
IPC: G06F16/27 , A61F5/56 , G06F9/48 , G06F9/50 , G06F16/14 , G06F16/182 , G06F16/21 , G06F16/22 , G06F16/23 , G06F16/2453 , G06F16/2455 , G06F16/2458 , G06F16/25 , G06F16/28 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
CPC classification number: G06F16/273 , A61F5/566 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F16/148 , G06F16/1827 , G06F16/211 , G06F16/221 , G06F16/2365 , G06F16/24532 , G06F16/24545 , G06F16/24552 , G06F16/2456 , G06F16/2471 , G06F16/254 , G06F16/27 , G06F16/283 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
Abstract: Example caching systems and methods are described. In one implementation, a method receives a set of queries to be processed by a set of virtual warehouses. The method distributes the set of queries to the set of virtual warehouses to be executed and creates, during the processing of the set of queries by the set of virtual warehouses, a new virtual warehouse, wherein cache resources associated with the new virtual warehouse are populated with data files associated with the set of queries at the time the virtual warehouse is created and the cache resources vary among the processors, wherein a first subset of the processors comprises minimal cache resources and a second subset of processors comprises cache resources providing faster input-output operations. The method redistributes the set of queries across the set of virtual warehouses.
-
公开(公告)号:US20200257658A1
公开(公告)日:2020-08-13
申请号:US16861516
申请日:2020-04-29
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Ashish MOTIVALA , Marcin ZUKOWSKI
IPC: G06F16/16 , G06F16/2453 , G06F16/9535 , G06F16/215 , G06F16/22
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query comprising a plurality of predicates and identifying one or more applicable files comprising database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
公开(公告)号:US20220374451A1
公开(公告)日:2022-11-24
申请号:US17882439
申请日:2022-08-05
Applicant: Snowflake Inc.
Inventor: Benoit DAGEVILLE , Thierry CRUANES , Marcin ZUKOWSKI
Abstract: A system and method of scaling capacity of data warehouses to user-defined levels. The method includes provisioning a data warehouse including a plurality of processing resources, the plurality of processing resources including at least one processor and at least one storage device. The method includes receiving a request to process database data stored on a storage platform including a plurality of shared storage devices in association with the data warehouse, wherein the request indicates a performance level for processing the request. The method includes determining that a capacity of the plurality of processing resources of the data warehouse would reach a threshold capacity when processing the request according to the performance level. The method includes increasing the capacity of the data warehouse for the data warehouse to process the request according to the performance level.
-
公开(公告)号:US20220206992A1
公开(公告)日:2022-06-30
申请号:US17696620
申请日:2022-03-16
Applicant: Snowflake Inc.
Inventor: Marcin ZUKOWSKI , Benoit DAGEVILLE , Thierry CRUANES , Ashish MOTIVALA
IPC: G06F16/16 , G06F16/22 , G06F16/215 , G06F16/9535 , G06F16/2453
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query that includes a plurality of predicates and identifying one or more applicable files including database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files including database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
公开(公告)号:US20210124717A1
公开(公告)日:2021-04-29
申请号:US17141165
申请日:2021-01-04
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Ashish MOTIVALA , Marcin ZUKOWSKI
IPC: G06F16/16 , G06F16/22 , G06F16/215 , G06F16/9535 , G06F16/2453
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query comprising a plurality of predicates and identifying one or more applicable files comprising database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
公开(公告)号:US20200320096A1
公开(公告)日:2020-10-08
申请号:US16905599
申请日:2020-06-18
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/ Furthermore, the device processes the set of queries using the updated set of processors.
-
公开(公告)号:US20230297589A1
公开(公告)日:2023-09-21
申请号:US18202502
申请日:2023-05-26
Applicant: Snowflake Inc.
Inventor: Thierry CRUANES , Benoit DAGEVILLE , Marcin ZUKOWSKI
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , G06F16/28 , G06F16/25 , A61F5/56 , G06F9/48 , H04L67/1095 , H04L67/1097
CPC classification number: G06F16/27 , G06F9/5016 , G06F16/148 , G06F16/211 , G06F16/221 , G06F16/951 , G06F16/1827 , G06F16/2365 , G06F16/2456 , G06F16/2471 , G06F16/9535 , G06F16/24532 , G06F16/24545 , G06F16/24552 , H04L67/568 , G06F16/283 , G06F16/254 , A61F5/566 , G06F9/4881 , G06F9/5088 , H04L67/1095 , G06F9/5044 , H04L67/1097 , G06F9/5083
Abstract: Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
-
-
-
-
-
-
-
-
-