-
公开(公告)号:US20230244655A1
公开(公告)日:2023-08-03
申请号:US17649737
申请日:2022-02-02
Applicant: Snowflake Inc.
Inventor: Chen Luo , Alexander Miller , William Waddington , Zhaohui Zhang
CPC classification number: G06F16/2379 , G06F16/2365 , G06F9/524
Abstract: The subject technology performs a search for a key in a regular space to locate a first visible version of the key. The subject technology determines that the first visible version of the key is not one of a N number of newest versions of the key. The subject technology performs a search of an undo space to locate a second visible version of the key. The subject technology determines whether the first visible version or the second visible version of the key is newer. The subject technology provides a newer version of the key between the first visible version and the second visible version of the key.
-
公开(公告)号:US11709866B2
公开(公告)日:2023-07-25
申请号:US17463353
申请日:2021-08-31
Applicant: Snowflake Inc.
Inventor: Chen Luo , Alexander Miller , William Waddington , Zhaohui Zhang
CPC classification number: G06F16/273 , G06F16/2343 , G06F16/2379
Abstract: A distributed database compaction system can perform compaction for transactional queries in an asynchronous matter without affecting completion of the queries. The compaction system can implement asynchronous transformation of key pairs in the database, and older keys can be periodically purged using a scheduled compactor. Subsequent queries use the compacted stored keys to perform efficient queries with direct reads of committed transactions and more efficient access to key values stores of the distributed database.
-
公开(公告)号:US11645281B1
公开(公告)日:2023-05-09
申请号:US17823310
申请日:2022-08-30
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Xuelai Cui , Sangyong Hwang , Allison Waingold Lee , Boyung Lee , Nicola Dan Onose , William Waddington , Jiaqi Yan , Li Yan , Yongsik Yoon
IPC: G06F16/2453
CPC classification number: G06F16/24542 , G06F16/24539
Abstract: The subject technology receives a query, the query including a set of statements for performing the query. The subject technology populates a compilation context based at least in part the query. The subject technology invokes a compiler to perform a compilation process based on the compilation context. The subject technology performs a lookup operation on a stored plan cache for an exact match based on information from the compilation context. The subject technology, in response to determining an exact match, determines whether the particular query plan requires re-compilation based on a data dependent optimization. The subject technology determines whether a plan cache entry corresponding to the particular query plan includes a data property constraint. The subject technology determines whether the data property constraint still holds based on a set of data properties.
-
公开(公告)号:US20230011046A1
公开(公告)日:2023-01-12
申请号:US17821670
申请日:2022-08-23
Applicant: Snowflake Inc.
Inventor: Alexander Miller , William Waddington
IPC: G06F16/23 , G06F16/248 , G06F16/2455 , G06F16/22
Abstract: The subject technology receives a first transaction, the first transaction to be executed on linearizable storage. The subject technology assigns, prior to performing a read operation, a first read version to the first transaction, the first read version indicating a first version of the linearizable storage, the first read version corresponding to a particular timestamp of a current time. The subject technology performs the read operation from the first transaction on a table in a database. The subject technology determines a first commit version identifier corresponding to first data resulting from the read operation. The subject technology proceeds to perform a particular write operation from the first transaction. The subject technology determines that the first commit version identifier fails to match a second commit version identifier corresponding to a second transaction. The subject technology aborts the particular write operation from the first transaction.
-
公开(公告)号:US11544244B2
公开(公告)日:2023-01-03
申请号:US17654296
申请日:2022-03-10
Applicant: Snowflake Inc.
Inventor: Jiaqi Yan , Thierry Cruanes , Jeffrey Rosen , William Waddington , Prasanna Rajaperumal , Abdul Munir
Abstract: Disclosed herein are embodiments of systems and methods for selecting partitions for reclustering based on distribution of overlapping partitions. In an example, a database platform makes a determination to at least partially recluster a database table that includes data stored across a plurality of partitions. The database platform responsively selects a subset of the partitions. The selecting of the subset includes identifying a point on a domain of a clustering key that corresponds to a local maximum of overlapping partitions, and also includes selecting the subset from among a group of overlapping partitions. The group includes at least one partition that overlaps the identified point on the domain of the clustering key. Each partition in the selected subset is above a reduction goal of overlapping partitions. The database platform at least partially reclusters the selected subset based on the clustering key.
-
公开(公告)号:US11461327B1
公开(公告)日:2022-10-04
申请号:US17658535
申请日:2022-04-08
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Xuelai Cui , Sangyong Hwang , Allison Waingold Lee , Boyung Lee , Nicola Dan Onose , William Waddington , Jiaqi Yan , Li Yan , Yongsik Yoon
IPC: G06F16/2453
Abstract: The subject technology receives a query, the query including a set of statements for performing the query. The subject technology populates a compilation context based at least in part the query. The subject technology provides the compilation context to a compiler. The subject technology invokes the compiler to perform a compilation process based on the compilation context, the compilation process comprising performing a lookup operation on a stored plan cache for an exact match based on information from the compilation context, the stored plan cache including a set of stored query plans, and determining whether the exact match of a particular query plan is found in the stored plan cache to avoid compiling the query using the compilation context.
-
公开(公告)号:US11379492B2
公开(公告)日:2022-07-05
申请号:US17477663
申请日:2021-09-17
Applicant: Snowflake Inc.
Inventor: Jeffrey Rosen , Abdul Munir , Jiaqi Yan , William Waddington , Prasanna Rajaperumal , Thierry Cruanes
IPC: G06F16/2458 , G06F16/2453 , G06F9/50 , G06F16/2455
Abstract: Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.
-
公开(公告)号:US20220188325A1
公开(公告)日:2022-06-16
申请号:US17249598
申请日:2021-03-05
Applicant: Snowflake Inc.
Inventor: Leonidas Galanis , Alexander Miller , William Waddington , Khaled Yagoub
IPC: G06F16/25 , G06F16/27 , G06F16/28 , G06F16/2455 , G06F16/2452
Abstract: A hybrid network-based database system for handling OLTP and OLAP queries using decoupled compute and storage devices. A set of decoupled compute instances perform transactions on an OLTP database, and the data is replicated to an OLAP database, which is managed by another set of decoupled compute instances. Further, in response to queries, the database system can retrieve data from the OLTP and OLAP database for merging and processing according to the query.
-
公开(公告)号:US20200233856A1
公开(公告)日:2020-07-23
申请号:US16838301
申请日:2020-04-02
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Martin Hentschel , William Waddington
IPC: G06F16/23 , G06F16/2455 , G06F16/22 , G06F21/60
Abstract: A method for a database system includes storing table data for a database, the table data including information in rows and columns of one or more database tables. The method includes storing metadata on immutable storage, the metadata including information about the table data for the database. In one embodiment, mutable metadata may be periodically consolidated in the background to create new versions of metadata files and which allows for deletions of old metadata files and old data files.
-
公开(公告)号:US20200026695A1
公开(公告)日:2020-01-23
申请号:US16514877
申请日:2019-07-17
Applicant: Snowflake Inc.
Inventor: Jiaqi Yan , Thierry Cruanes , Jeffrey Rosen , William Waddington , Prasanna Rajaperumal , Abdul Munir
Abstract: Automatic clustering of a database table is disclosed. A method for automatic clustering of a database table includes receiving an indication that a data modification task has been executed on a table and determining whether the table is sufficiently clustered. The method includes, in response to determining the table is not sufficiently clustered, selecting one or more micro-partitions of the table to be reclustered. The method includes assigning each of the one or more micro-partitions to an execution node to be reclustered.
-
-
-
-
-
-
-
-
-