-
公开(公告)号:US20220358141A1
公开(公告)日:2022-11-10
申请号:US17815375
申请日:2022-07-27
Applicant: Snowflake Inc.
Inventor: Pui Kei Johnston Chu , Benoit Dageville , Shreyas Narendra Desai , Sameer Deshpande , Subramanian Muralidhar , Di Wu
IPC: G06F16/27
Abstract: Techniques for database replication using replication preparation objects include storing, by at least one hardware processor, at least one task object at a remote deployment of a data provider. A replication request originating from an account of a data consumer is detected at the remote deployment using the at least one task object. The replication request is associated with a database available at a primary deployment of the data provider. Responsive to the detecting of the replication request, a local instance of the database is generated at the remote deployment. The account of the data consumer at the remote deployment is configured to access the local instance of the database.
-
392.
公开(公告)号:US11494369B2
公开(公告)日:2022-11-08
申请号:US17536180
申请日:2021-11-29
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Johan Harjono , Kunal Prafulla Nabar , Steven James Pelley
IPC: G06F16/24 , G06F16/23 , G06F16/2455
Abstract: Systems, methods, and devices for retrying a query. A method includes receiving a query directed to database data and assigning execution of the query to one or more execution nodes of an execution platform, the one or more execution nodes configured to execute the query on a first version of a database platform. The method includes determining that execution of the query was unsuccessful. The method includes assigning a first retry execution of the query to the one or more execution nodes of the execution platform and determining whether a regression or an intermittent fault caused the execution of the query to be unsuccessful based at least in part on whether the first retry execution of the query was successful or unsuccessful.
-
公开(公告)号:US20220327040A1
公开(公告)日:2022-10-13
申请号:US17809780
申请日:2022-06-29
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Johan Harjono , Simon Holm Jensen , Kunal Prafulla Nabar , Steven James Pelley
IPC: G06F11/30 , G06F16/2458 , G06F11/34 , G06F16/242
Abstract: Techniques for automated query retry in a database platform include decoding, by at least one hardware processor, a query directed to database data from a client account of a database platform. The method further includes decoding, by the at least one hardware processor, an indication that execution of the query on at least one computing node of the database platform results in a failed execution. The method further includes configuring a processing loop with continuous retry executions of the query on the at least one computing node based on the indication. The method further includes exiting the processing loop based on detecting a retry execution of the continuous retry executions results in at least one successful execution of the query. The method includes logging each attempt to execute the query during the continuous retry executions in a query status log until the at least one successful execution of the query.
-
公开(公告)号:US11429639B2
公开(公告)日:2022-08-30
申请号:US17570638
申请日:2022-01-07
Applicant: SNOWFLAKE INC.
Inventor: Thierry Cruanes , Benoit Dageville , Allison Waingold Lee
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , A61F5/56 , G06F9/48 , H04L67/1095 , H04L67/1097
Abstract: A system and method for managing data storage and data access with querying data in a distributed system without buffering the results on intermediate operations in disk storage.
-
公开(公告)号:US20220269674A1
公开(公告)日:2022-08-25
申请号:US17650462
申请日:2022-02-09
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/242 , G06F3/06 , G06F16/2453 , G06F16/25 , G06F16/23
Abstract: Disclosed herein are embodiments of systems and methods for partition-based scanning of external tables for query processing. In an example embodiment, a database platform receives a query that includes one or more predicates, where the query is directed at least to data in an external table that is stored in an external storage platform that is external to the database platform. The database platform identifies, based on metadata that summarizes the data in the external table, one or more partitions of the external table that potentially include data that satisfies the one or more predicates. The database platform also identifies, from the one or more identified partitions, data that satisfies the one or more predicates. The database platform sends a response to the query to the client, the response comprising the data satisfying the one or more predicates.
-
公开(公告)号:US11409768B2
公开(公告)日:2022-08-09
申请号:US17524439
申请日:2021-11-11
Applicant: SNOWFLAKE INC.
Inventor: Benoit Dageville , Thierry Cruanes , Marcin Zukowski
IPC: G06F15/167 , G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , A61F5/56 , G06F9/48 , H04L67/1095 , H04L67/1097
Abstract: Example resource management systems and methods are described. In one implementation, a processing device is to: provide a plurality of virtual processors, each virtual processor comprising a cache memory to cache a portion of database data accessed from a storage platform; receive a query referencing the database data stored in the storage platform; determine at least one task to process the received query; and allocate the at least one task to a subset of the plurality of virtual processors to process the at least one task. The allocation may be based at least in part on the respective portions of the database data that are cached within the respective cache memories of the subset of the plurality of virtual processors.
-
公开(公告)号:US20220222161A1
公开(公告)日:2022-07-14
申请号:US17657393
申请日:2022-03-31
Applicant: Snowflake, Inc.
Inventor: Benoit Dageville , Johan Harjono , Simon Holm Jensen , Kunal Prafulla Nabar , Steven James Pelley
IPC: G06F11/30 , G06F11/32 , G06F16/248 , G06F16/2458 , G06F16/242 , G06F11/34 , G06F11/14
Abstract: Disclosed herein are embodiments of systems and methods for query-attempt processing in a database environment. In an embodiment, a database platform receives a query, and operates a first query manager of the database platform to manage a first attempt to execute the query. Based on determining that the attempt to execute the query was unsuccessful, the database platform identifies a second query manager of the database platform as implementing a same version of the database platform as the first query manager, and operates the second query manager to manage a first retry attempt to execute the query.
-
公开(公告)号:US20220215027A1
公开(公告)日:2022-07-07
申请号:US17657019
申请日:2022-03-29
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Ismail Oukid , Stefan Richter
IPC: G06F16/2455 , G06F16/9035 , G06F16/28 , G06F16/22 , G06F17/18
Abstract: A query directed at a source table organized into a set of batch units is received. The query includes a pattern matching predicate that specifies a search pattern. A set of N-grams are generated based on the search pattern. A pruning index associated with the source table is accessed. The pruning index comprises a set of filters that index distinct N-grams in each column of the source table. The pruning index is used to identify a subset of batch units to scan for matching data based on the set of N-grams generated for the search pattern. The query is processed by scanning the subset of batch units.
-
公开(公告)号:US20220188297A1
公开(公告)日:2022-06-16
申请号:US17653491
申请日:2022-03-04
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Torsten Grabs , Benoit Dageville
Abstract: A method includes generating a task using a plurality of logical statements embedded in a database, the plurality of logical statements corresponding to a data modification. Database data is ingested into a staging table that is configured within the database. The task is executed based on applying the data modification to a first set of partitions storing the database data and generating a second set of partitions. The second set of partitions store modified data corresponding to the database data. A stream of committed transactions is advanced at least in part by adding an entry into the stream. The entry corresponds to committed transactions performed on the first set of partitions during the data modification. A data processing task is scheduled for execution on the modified data based on the advancing of the stream offset.
-
公开(公告)号:US20220179846A1
公开(公告)日:2022-06-09
申请号:US17652504
申请日:2022-02-25
Applicant: Snowflake Inc.
Inventor: Jiansheng Huang , Jiaxing Liang , Scott Ziegler , Haowei Yu , Benoit Dageville , Varun Ganesh
Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.
-
-
-
-
-
-
-
-
-