-
公开(公告)号:US20200151192A1
公开(公告)日:2020-05-14
申请号:US16741660
申请日:2020-01-13
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski
IPC: G06F16/27 , G06F9/50 , H04L29/08 , G06F9/48 , G06F16/2455 , G06F16/2453 , G06F16/9535 , G06F16/2458 , G06F16/23 , G06F16/182 , G06F16/951 , G06F16/22 , G06F16/21 , G06F16/14
Abstract: A system is described inkling a cache memory and a hardware processor. The hardware processor to determine that at least one of additional data storage capacity and additional processing capacity are needed based on data processing requests of a plurality of computerized query sources to be executed by an execution platform comprising a plurality of nodes, where the data processing requests are directed to database data stored on a plurality of shared storage devices collectively storing the database data. The hardware processor further to increase, in response to the determining, an amount of the data cached by the execution platform by adding one or more nodes to the plurality of nodes.
-
公开(公告)号:US20200089655A1
公开(公告)日:2020-03-19
申请号:US16688888
申请日:2019-11-19
Applicant: Snowflake Inc.
Inventor: Marcin Zukowski , Benoit Dageville , Thierry Cruanes , Ashish Motivala
IPC: G06F16/16 , G06F16/2453 , G06F16/9535 , G06F16/215 , G06F16/22
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query comprising a plurality of predicates and identifying one or more applicable files comprising database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata. store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
公开(公告)号:US10534793B2
公开(公告)日:2020-01-14
申请号:US15171859
申请日:2016-06-02
Applicant: SNOWFLAKE INC.
Inventor: Ashish Motivala , Benoit Dageville
IPC: G06F16/00 , G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: Example systems and methods for cloning catalog objects are described. In one implementation, a method identifies an original catalog object associated with a set of data and creates a duplicate copy of the original catalog object without copying the data itself. The method allows access to the set of data using the duplicate catalog object and supports modifying the data associated with the original catalog object independently of the duplicate catalog object. The duplicate catalog object can be deleted upon completion of modifying the data associated with the original catalog object.
-
公开(公告)号:US20190303389A1
公开(公告)日:2019-10-03
申请号:US16447416
申请日:2019-06-20
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski
IPC: G06F16/27 , G06F16/2455 , G06F9/48 , G06F16/2453 , G06F16/9535 , G06F16/2458 , G06F16/23 , G06F16/182 , G06F16/951 , G06F16/22 , G06F16/21 , G06F16/14 , G06F9/50 , H04L29/08
Abstract: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.
-
公开(公告)号:US12267390B2
公开(公告)日:2025-04-01
申请号:US18617083
申请日:2024-03-26
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Florian Andreas Funke , Peter Povinec
IPC: H04L67/1097 , G06F9/50 , G06F16/2455 , G06F16/28 , H04L41/0896 , H04L41/5025 , H04L43/0817 , H04L67/1008
Abstract: A method implementing a fault-tolerant data warehouse including allocating a plurality of processing units to a data warehouse, the processing units located in different availability zones, an availability zone comprising one or more data centers. The method further includes, as a result of monitoring a number of queries running at an input degree of parallelism on the plurality of processing units of the data warehouse, determining that the number of queries is serviceable by one fewer processing units. The method further includes routing a query from a first processing unit to a second processing unit within the data warehouse, the query having a common session identifier with another query previously provided to the second processing unit, the second processing unit determined to be caching a data segment associated with a cloud storage resource, usable by the query, and removing the first processing unit from the data warehouse.
-
公开(公告)号:US12210527B2
公开(公告)日:2025-01-28
申请号:US18128152
申请日:2023-03-29
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Lyuping Du , Martin Hentschel , Elkhan Mammedov , Lukas Moll , Vikram Wakade
IPC: G06F7/00 , G06F16/23 , G06F16/2455 , G06F17/00
Abstract: Techniques for active invalidation of cache entries are described. A first timestamp and a second timestamp are stored by a compute instance, the first timestamp corresponding to an initial time the computing instance read from a version update log in a database and the second timestamp corresponding to a last time the computing instance read from the version update log in the database. The compute instance can read records from the version update log that are associated with version timestamps greater than the second timestamp. The compute instance can invalidate a first set of cache entries with older versions stored in a cache based on reading the records, retaining a second set of cache entries, and then execute a query using the second set of cache entries.
-
公开(公告)号:US12189614B2
公开(公告)日:2025-01-07
申请号:US18477834
申请日:2023-09-29
Applicant: Snowflake Inc.
Inventor: Khaled Yagoub , Wumengjian Zhu , Benoit Dageville , William Waddington
Abstract: The subject technology serializes, by at least one hardware processor, non-primary key data of column-organized data into compressed serialized value data that is in a row-organized sequence, the compressed serialized value data compressed using at least one bitmap, the non-primary key data comprising a schema identifier, the column-organized data being stored in a columnar database system, the column-organized data comprising primary key data and the non-primary key data. The subject technology stores the compressed serialized value data in a key-value data store of a key-value database system, the key-value database system processing key-value data in a key-value format. The subject technology receives a query by the columnar database system. The subject technology deserializes a portion of the compressed serialized value data that corresponds to the query. The subject technology processes the query using the columnar database system.
-
公开(公告)号:US20240419817A1
公开(公告)日:2024-12-19
申请号:US18478539
申请日:2023-09-29
Applicant: Snowflake Inc.
Inventor: Karol Pawel Bienkowski , Benoit Dageville , Scott C. Gray , Unmesh Jagtap , Anton Yafremau , Alan Jinsoo Yu
Abstract: A data platform grants privileges to applications accessing resources of the data platform in a secure and efficient way. The data platform determines whether a privilege has been granted to an application and, if not, generates a validation of the request to grant the privilege using a manifest of the application. The data platform generates a grant privilege request user interface, presents the grant privilege user interface to a consumer of the data platform, receives a privilege grant authorization from the consumer, and grants the privilege to the application. The granted privilege is then used by the application to access the resource.
-
公开(公告)号:US20240411777A1
公开(公告)日:2024-12-12
申请号:US18800912
申请日:2024-08-12
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Allison Waingold Lee
IPC: G06F16/27 , A61F5/56 , G06F9/48 , G06F9/50 , G06F16/14 , G06F16/182 , G06F16/21 , G06F16/22 , G06F16/23 , G06F16/2453 , G06F16/2455 , G06F16/2458 , G06F16/25 , G06F16/28 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
Abstract: A system and method for managing data storage and data access with querying data in a distributed system without buffering the results of intermediate operations in storage. The method includes providing, by a first processor, as a result of executing a query plan, an intermediate result to a plurality of secondary processors. Processing of the intermediate result by the plurality of secondary processors generates a plurality of secondary results. A final result is stored to a storage device.
-
公开(公告)号:US20240346044A1
公开(公告)日:2024-10-17
申请号:US18754836
申请日:2024-06-26
Applicant: Snowflake Inc.
Inventor: Ashish Motivala , Benoit Dageville
IPC: G06F16/27 , A61F5/56 , G06F9/48 , G06F9/50 , G06F16/14 , G06F16/182 , G06F16/21 , G06F16/22 , G06F16/23 , G06F16/2453 , G06F16/2455 , G06F16/2458 , G06F16/25 , G06F16/28 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
CPC classification number: G06F16/273 , A61F5/566 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F16/148 , G06F16/1827 , G06F16/211 , G06F16/221 , G06F16/2365 , G06F16/24532 , G06F16/24545 , G06F16/24552 , G06F16/2456 , G06F16/2471 , G06F16/254 , G06F16/27 , G06F16/283 , G06F16/951 , G06F16/9535 , G06F16/9538 , H04L67/1095 , H04L67/1097 , H04L67/568
Abstract: A system and method of using a stateless node to process data of a catalog object. The method includes accessing a catalog object comprising metadata associated with a dataset. The method includes distributing, by one or more processors, a task to a stateless node to cause the stateless node to process the dataset without storing information indicative of a particular state of the stateless node.
-
-
-
-
-
-
-
-
-