-
公开(公告)号:US20210019334A1
公开(公告)日:2021-01-21
申请号:US17062859
申请日:2020-10-05
Applicant: Snowflake Inc.
Inventor: Ashish Motivala , Yi Fang , Martin Hentschel , Benoit Dageville , Spyros Triantafyllis , Yizhi Zhu
IPC: G06F16/27 , G06F16/245 , G06F16/22
Abstract: The subject technology stores database data in a table, the table comprising a plurality of micro-partitions each having rows and columns. The subject technology stores metadata for the table in a data structure, the metadata comprising a first level of the data structure comprising a column expression property comprising information about data stored in a column of a micro-partition of the table, a micro-partition expression property comprising information about data stored in the micro-partition of the table, and a second level of the data structure comprising a grouping expression property comprising information about data stored in the micro-partition and one or more additional micro-partitions of the table.
-
公开(公告)号:US20200327115A1
公开(公告)日:2020-10-15
申请号:US16915435
申请日:2020-06-29
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Martin Hentschel , William Waddington
IPC: G06F16/23 , G06F21/60 , G06F16/22 , G06F16/2455
Abstract: A method for a database system includes storing table data for a database, the table data including information in rows and columns of one or more database tables. The method includes storing metadata on immutable storage, the metadata including information about the table data for the database. In one embodiment, mutable metadata may be periodically consolidated in the background to create new versions of metadata files and which allows for deletions of old metadata files and old data files.
-
公开(公告)号:US10798100B1
公开(公告)日:2020-10-06
申请号:US16746758
申请日:2020-01-17
Applicant: Snowflake Inc.
Inventor: Pui Kei Johnston Chu , Benoit Dageville , Matthew Glickman , Christian Kleinerman , Prasanna Krishnan , Justin Langseth
Abstract: Systems and methods for managing membership in a private data exchange are provided herein. In one embodiment, the method comprises adding a new member to a data exchange, wherein the data exchange comprises a set of listings, each listing comprising data from one or more datasets stored on a cloud computing service. An account type is assigned to the new member, the account type comprising at least one of a consumer type, provider type, or exchange administrator type. Each of the account types associated with one or more rights with respect to the set of listings, such that the new member has a set of rights based on the account type. The set of rights of the new member are then modified with respect to one or more listings from the set of listings based on a set of visibility and access rules for each of the set of listings.
-
公开(公告)号:US10776390B2
公开(公告)日:2020-09-15
申请号:US16823287
申请日:2020-03-18
Applicant: SNOWFLAKE INC.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski , Allison Waingold Lee , Phillip Thomas Unterbrunner
IPC: G06F16/00 , G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48 , H04L29/08
Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
-
公开(公告)号:US10776388B2
公开(公告)日:2020-09-15
申请号:US16741660
申请日:2020-01-13
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski
IPC: G06F9/50 , G06F9/455 , G06F15/173 , H04L29/08 , H04L12/911 , G06F16/27 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , G06F9/48
Abstract: A system is described inkling a cache memory and a hardware processor. The hardware processor to determine that at least one of additional data storage capacity and additional processing capacity are needed based on data processing requests of a plurality of computerized query sources to be executed by an execution platform comprising a plurality of nodes, where the data processing requests are directed to database data stored on a plurality of shared storage devices collectively storing the database data. The hardware processor further to increase, in response to the determining, an amount of the data cached by the execution platform by adding one or more nodes to the plurality of nodes.
-
公开(公告)号:US10769150B1
公开(公告)日:2020-09-08
申请号:US16727315
申请日:2019-12-26
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Ismail Oukid , Stefan Richter
IPC: G06F16/00 , G06F16/2455 , G06F16/9035 , G06F16/28 , G06F17/18 , G06F16/22
Abstract: A source table organized into a set of micro-partitions is accessed by a network-based data warehouse. A pruning index is generated based on the source table. The pruning index comprises a set of filters that indicate locations of distinct values in each column of the source table. A query directed at the source table is received at the network-based data warehouse. The query is processed using the pruning index. The processing of the query comprises pruning the set of micro-partitions of the source table to scan for data matching the query, the pruning of the plurality of micro-partitions comprising identifying, using the pruning index, a sub-set of micro-partitions to scan for the data matching the query.
-
公开(公告)号:US20200272638A1
公开(公告)日:2020-08-27
申请号:US16775092
申请日:2020-01-28
Applicant: Snowflake Inc
Inventor: Ashish Motivala , Yi Fang , Martin Hentschel , Benoit Dageville , Spyros Triantafylis , Yizhi Zhu
IPC: G06F16/27 , G06F16/22 , G06F16/245
Abstract: Systems, methods, and devices for multiple level metadata organization in database systems. A system includes a plurality of shared storage devices collectively storing database data across a plurality of tables that each comprise one or more micro-partitions including immutable storage devices. The system includes a resource manager configured to manage the storage of database data stored in a table across one or more of the plurality of shared storage devices. The resource manager is configured to manage the storage of metadata for the table. The metadata includes a column expression property with information about data stored in a column, a micro-partition expression property with information about data stored in a micro-partition, a grouping expression property with information about data stored in two or more micro-partitions, and cumulative table metadata with global information about all micro-partitions of the table.
-
公开(公告)号:US20200272628A1
公开(公告)日:2020-08-27
申请号:US16863831
申请日:2020-04-30
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Yi Fang , Martin Hentschel , Ashish Motivala , Spyridon Triantafyllis , Yizhi Zhu
IPC: G06F16/2455 , G06F16/23
Abstract: The subject technology receives first metadata corresponding to a set of micro-partitions. The subject technology generates second metadata for a grouping of the first metadata. The subject technology generates a first data structure including the first metadata and a second data structure including the second metadata, the second data structure including information associating the second metadata to the first metadata. The subject technology stores the first data structure and the second data structure in persistent storage as a first file and a second file. The subject technology receives a query on a table. Further, the subject technology analyzes the query against cumulative table metadata to determine whether data stored in the table matches the query.
-
公开(公告)号:US20200210449A1
公开(公告)日:2020-07-02
申请号:US16814711
申请日:2020-03-10
Applicant: Snowflake Inc.
Inventor: Ashish Motivala , Benoit Dageville
IPC: G06F16/27 , G06F16/951 , G06F9/50 , H04L29/08 , G06F16/22 , G06F16/2455 , G06F9/48 , G06F16/182 , G06F16/2453 , G06F16/21 , G06F16/2458 , G06F16/23 , G06F16/14 , G06F16/9535
Abstract: Example systems and methods for cloning catalog objects are described. In one implementation, a method includes creating a copy of a catalog object without copying a data associated with the catalog object by only coping metadata associated with the object. The method further includes modifying, by one or more processors, the data associated with the catalog object independently of the copy of the catalog object such that modified data is not visible to the copy of the catalog object and wherein the modified data is not visible to the catalog object in a database. The method further includes deleting the copy of the catalog object following completion of the modification of the data associated with the catalog object.
-
公开(公告)号:US10678753B2
公开(公告)日:2020-06-09
申请号:US16688888
申请日:2019-11-19
Applicant: Snowflake Inc.
Inventor: Marcin Zukowski , Benoit Dageville , Thierry Cruanes , Ashish Motivala
IPC: G06F16/00 , G06F16/16 , G06F16/22 , G06F16/215 , G06F16/9535 , G06F16/2453
Abstract: A system and method for pruning data based on metadata. The method may include receiving a query that includes a plurality of predicates and identifying one or more applicable files including database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
-
-
-
-
-
-
-
-
-