-
381.
公开(公告)号:US11709808B1
公开(公告)日:2023-07-25
申请号:US17656558
申请日:2022-03-25
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Adrian Hamza , William Waddington , Khaled Yagoub , Wumengjian Zhu , Lishi Jiang
CPC classification number: G06F16/213 , G06F16/221
Abstract: The subject technology receives a statement to perform an operation to add a new column into a table. The subject technology generates a schema hash value for a new schema version associated with a new schema version value. The subject technology stores a mapping of the schema hash value to the new schema version value for the table in a metadata database. The subject technology stores a new schema entry based on the schema hash value, the new schema version value, and the new column for the table in the metadata database. The subject technology performs an operation to add the new column to the table.
-
公开(公告)号:US20230195922A1
公开(公告)日:2023-06-22
申请号:US18109191
申请日:2023-02-13
Applicant: Snowflake Inc.
Inventor: Damien Carru , Pui Kei Johnston Chu , Benoit Dageville , Shreyas Narendra Desai , Subramanian Muralidhar , Bowen Zhang
CPC classification number: G06F21/6218 , G06F16/256 , G06F16/21 , G06F2221/2141
Abstract: Embodiments of the present disclosure relate to sharing database roles using hidden roles. A database role may be generated within a database container having a plurality of data objects, wherein the database role exists exclusively within the database container. A set of grants to a particular subset of the plurality of data objects of the database container may be assigned to the database role and the database role may be granted to the share object. The share object is mounted within a consumer account to generate an imported database container within the consumer account, the imported database container including an imported copy of the database role. The imported copy of the database role may be granted to each of one or more account level roles of the consumer account to share the particular subset of the plurality of data objects without creating proxy objects in the consumer account that represent the particular subset of the plurality of data objects.
-
公开(公告)号:US20230185824A1
公开(公告)日:2023-06-15
申请号:US18165666
申请日:2023-02-07
Applicant: SNOWFLAKE INC.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , G06F16/28 , G06F16/25 , A61F5/56 , G06F9/48 , H04L67/1095 , H04L67/1097
CPC classification number: G06F16/27 , A61F5/566 , G06F9/4881 , G06F9/5016 , G06F9/5044 , G06F9/5083 , G06F9/5088 , G06F16/148 , G06F16/211 , G06F16/221 , G06F16/254 , G06F16/283 , G06F16/951 , G06F16/1827 , G06F16/2365 , G06F16/2456 , G06F16/2471 , G06F16/9535 , G06F16/24532 , G06F16/24545 , G06F16/24552 , H04L67/568 , H04L67/1095 , H04L67/1097
Abstract: A method and apparatus for managing a set of storage resources for a set of queries is described. In an exemplary embodiment, a method provisions processing resources of an execution platform and provisions storage resources of a storage platform. The execution platform uses the storage platform, which is shared with the execution platform, to process the set of queries. The method changes a number of the storage resources provisioned for the storage platform based on a storage capacity utilization by the set of queries of the storage resources. The method changes the number of the storage resources independently to a change of the processing resources in the execution platform. The method processes the set of queries using the changed number of the storage resources provisioned for the storage platform.
-
公开(公告)号:US20230169095A1
公开(公告)日:2023-06-01
申请号:US18102681
申请日:2023-01-27
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Florian Andreas Funke , Peter Povinec
IPC: G06F16/28 , H04L67/1097 , G06F16/2455 , H04L41/0896 , H04L67/1008 , H04L41/5025 , G06F9/50
CPC classification number: G06F16/283 , H04L67/1097 , G06F16/2455 , H04L41/0896 , H04L67/1008 , H04L41/5025 , G06F9/5072 , H04L43/0817
Abstract: A method for a multi-cluster warehouse includes allocating processing units as part of a data warehouse. The processing units access data within one or more databases in cloud storage resources. The method also includes providing one or more queries to each processing unit within the data warehouse. In response to the queries, each processing unit performs database operations on a particular portion of a database table. The method also includes monitoring a workload of the processing units to determine that a processing capacity of the processing units has reached a threshold processing capacity. The method also includes dynamically adding additional processing units to and removing processing units from the data warehouse as needed based on a workload of the processing units.
-
公开(公告)号:US20230169068A1
公开(公告)日:2023-06-01
申请号:US17538818
申请日:2021-11-30
Applicant: Snowflake Inc.
Inventor: Khaled Yagoub , Wumengjian Zhu , Benoit Dageville , William Waddington
CPC classification number: G06F16/2379 , G06F16/221 , G06F11/1458 , G06F16/283
Abstract: A distributed database system can implement a column-based database system and a row-based database system for processing data. The row-based database system can store data organized into key value pairs, and data to be processed by the row-based database system is converted to a key-value format compressing keys that correspond to values. The distributed database system can perform serialization and compression in converting the data to the key-value format for efficient data storage performance. The distributed database system can unpack portions of the converted serialized compressed data in response to queries that process a portion of serialized compressed data without unpacking the entire converted dataset.
-
公开(公告)号:US11645262B2
公开(公告)日:2023-05-09
申请号:US17162790
申请日:2021-01-29
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Benoit Dageville
IPC: G06F16/23 , G06F16/27 , G06F16/2455
CPC classification number: G06F16/2358 , G06F16/2455 , G06F16/278
Abstract: The subject technology executes a change on an existing micro-partition of a table of a database, the executing of the change comprising generating a new micro-partition that embodies the change. The subject technology receives a request for a delta for the table between a first timestamp and a second timestamp. The subject technology queries at least one change tracking column to determine the delta between the first timestamp and the second timestamp, the delta including information indicating at least one database operation that was performed to at least one row of a set of rows of the table, without including information as to intermediate changes made to at least one row of the set of rows of the table between the first timestamp and the second timestamp, that facilitates a reduction in storage of historical versions of the set of rows of the table.
-
公开(公告)号:US11599440B2
公开(公告)日:2023-03-07
申请号:US17663253
申请日:2022-05-13
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Johan Harjono , Simon Holm Jensen , Kunal Prafulla Nabar , Steven James Pelley
IPC: G06F11/30 , G06F16/2458 , G06F11/34 , G06F16/242
Abstract: Techniques for automated query retry in a database platform include assigning by at least one hardware processor a first execution of a query directed to database data to a first execution node of a plurality of execution nodes of an execution platform. The first execution node uses a first set of configurations during the first execution. The techniques further include determining that the first execution of the query by the first execution node results in a failed execution. The query is transferred to a second execution node of the plurality of execution nodes. A second execution of the query at the second execution node is caused. The second execution node uses a second set of configurations during the second execution. A cause of the failed execution at the first execution node is determined based on a result of the second execution of the query at the second execution node.
-
公开(公告)号:US20230004552A1
公开(公告)日:2023-01-05
申请号:US17930177
申请日:2022-09-07
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Prasanna Rajaperumal , Jiaqi Yan
Abstract: Systems, methods, and devices for incrementally refreshing a materialized view are disclosed. A method includes generating a materialized view based on a source table. The method includes merging the source table and the materialized view to generate a merged table to identify whether an update has been executed on the source table that is not reflected in the materialized view. The method includes, in response to detecting an update made to the source table that is not reflected in the materialized view, applying the update to the materialized view.
-
公开(公告)号:US20220405252A1
公开(公告)日:2022-12-22
申请号:US17895823
申请日:2022-08-25
Applicant: SNOWFLAKE INC.
Inventor: Pui Kei Johnston Chu , Benoit Dageville , Shreyas Narendra Desai , German Alberto Gil Echeverri , Prasanna Krishnan , Vishnu Dutt Paladugu , Bowen Zhang
IPC: G06F16/182 , G06F16/11 , G06F9/54 , G06F16/17
Abstract: Provided herein are systems and methods for an efficient method of replicating share objects to remote deployments. An example method includes receiving, at a first deployment in a data exchange, a refresh message from a second deployment in the data exchange, wherein the first deployment comprises a database that stores a database object. The method also includes, in response to receiving the refresh message, retrieving, at the first deployment, a share grant from a share object of the first deployment and transmitting, by a processing device of the first deployment, a message including the share grant to the second deployment. The share grant includes a reference to the database object and allows a consumer to use the database object.
-
公开(公告)号:US11507570B2
公开(公告)日:2022-11-22
申请号:US17650890
申请日:2022-02-14
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Benoit Dageville
Abstract: Systems, methods, and devices for scheduling of data processing tasks are disclosed. A method includes performing a modification on a first set of immutable partitions storing database data to generate a second set of immutable partitions. The second set is associated with a modified version of the database data. A change tracking entry is entered in a stream of tracking entries based on committing the modification. The change tracking entry includes an indication of the modification on the first set of immutable partitions. A stream offset of the stream of tracking entries is advanced based on the entering of the change tracking entry in the stream of tracking entries. The stream offset indicates a timestamp associated with a latest committed modification to the database data. A data processing task is scheduled for execution on the modified version of the database data based on the advancing of the stream offset.
-
-
-
-
-
-
-
-
-