-
公开(公告)号:US11550793B1
公开(公告)日:2023-01-10
申请号:US17721599
申请日:2022-04-15
Applicant: Snowflake Inc.
Inventor: Florian Andreas Funke , Megha Thakkar
IPC: G06F16/24 , G06F16/2455
Abstract: Systems and methods for spilling data for hash joins are described. An example method includes determining an amount of available space in a first memory used by a set of relational queries is insufficient for a first relational join query. The first relational join query comprises a join operation. The method also includes determining a set of build memory sizes and a set of probe memory sizes for a set of partitions for the set of relational queries. The method further includes identifying a first partition of the set of partitions based on the set of probe memory sizes and the set of build memory sizes. The method further includes copying the first partition from the first memory to a second memory, wherein the first partition comprises a first build portion and a first probe portion.
-
公开(公告)号:US20230004672A1
公开(公告)日:2023-01-05
申请号:US17900759
申请日:2022-08-31
Applicant: SNOWFLAKE INC.
Inventor: Justin Langseth , Matthew J. Glickman , Christian Kleinerman , Robert Muglia , Daniel Freundel , Thierry Cruanes , Allison Waingold Lee
IPC: G06F21/62 , G06F16/2455 , H04L9/32
Abstract: Systems, methods, and devices for generating a secure join of database data are disclosed. A method creates a secure view of datapoints of a consumer account and processes, using a secure user defined function (UDF), the datapoints of the consumer account and datapoints of a provider account to generate a secure join key. The secure UDF returns a count of matching data points between the consumer account and the provider account, and the method provides the count of matching data points to the consumer account.
-
公开(公告)号:US11544269B2
公开(公告)日:2023-01-03
申请号:US17657019
申请日:2022-03-29
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Ismail Oukid , Stefan Richter
IPC: G06F16/24 , G06F16/2455 , G06F16/9035 , G06F16/28 , G06F16/22 , G06F17/18
Abstract: A query directed at a source table organized into a set of batch units is received. The query includes a pattern matching predicate that specifies a search pattern. A set of N-grams are generated based on the search pattern. A pruning index associated with the source table is accessed. The pruning index comprises a set of filters that index distinct N-grams in each column of the source table. The pruning index is used to identify a subset of batch units to scan for matching data based on the set of N-grams generated for the search pattern. The query is processed by scanning the subset of batch units.
-
公开(公告)号:US20220414106A1
公开(公告)日:2022-12-29
申请号:US17930150
申请日:2022-09-07
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Thierry Cruanes , Istvan Cseri , Benoit Dageville , Tyler Jones , Dinesh Chandrakant Kulkarni
IPC: G06F16/2455 , G06F16/2453
Abstract: Hybrid tables can be used in different use-case scenarios. Hybrid tables provide a flexible mechanism to support files and data in different formats while providing access to the different types of data as part of one table. This flexibility can allow the use of hybrid tables in data lake or other similar environments.
-
公开(公告)号:US20220414094A1
公开(公告)日:2022-12-29
申请号:US17823132
申请日:2022-08-30
Applicant: Snowflake Inc.
Inventor: Srilakshmi Chintala , Istvan Cseri , Albert L. Hu , Isaac Kunen , Nitya Kumar Sharma , Igor Zinkovsky
IPC: G06F16/2458 , G06F16/22
Abstract: A query referencing an external table function provided by a remote software component is received. Requests to execute the external table function on input data are sent to a proxy service. A first request includes a batch of input rows from the input data. A first response to the first request received from the proxy service includes a first portion of result data and a pagination token. The pagination token indicates that at least a second portion of the result data corresponding to the first batch of input rows is to be obtained from the remote software component. Based on the pagination token, a second request is sent to obtain the second portion of the result data. One or more responses are received from the proxy service that comprise at least the second portion of the result data. The result data is processed according to the query.
-
公开(公告)号:US20220405285A1
公开(公告)日:2022-12-22
申请号:US17822264
申请日:2022-08-25
Applicant: Snowflake Inc.
Inventor: Bicheng Gao , Srinivas Vemuri , Jiaqi Yan
IPC: G06F16/2455 , G06F11/34 , G06F16/248 , G06F16/22 , G06F16/2453
Abstract: The subject technology receives, during a query compilation process, a query directed to a set of source tables. The subject technology performs, during the query compilation process, a modification of the query for adjusting at least one pruning operation. The subject technology determines, during a pruning process of a second query, the second query directed to a set of files in a database system and including a set of pruning operations on the set of files, whether to perform a pruning cutoff on the set of pruning operations, the pruning process performing a depth first search of a pruner tree structure, the set of files comprising a set of micro-partitions. The subject technology performs the pruning cutoff based on the determining, the pruning cutoff ceasing at least one pruning operation from the set of pruning operations.
-
公开(公告)号:US20220405266A1
公开(公告)日:2022-12-22
申请号:US17821581
申请日:2022-08-23
Applicant: Snowflake Inc.
Inventor: Chen Luo , Alexander Miller , William Waddington , Zhaohui Zhang
IPC: G06F16/23
Abstract: The subject technology inserts, by a first transaction, a new version of an object, the first transaction including a first statement to perform an update operation to a row in a first table, the object corresponding to data in the row to be updated, the first statement including information comprising an object key associated with the object. The subject technology performs, by a second transaction, a range read, the range read including information indicating the object key. The subject technology receives a set of conflicting transactions from the range read. The subject technology determines that a conflict occurred between the first transaction and a third transaction from the set of conflicting transactions. The subject technology performs a restart of the first transaction in response to determining that the conflict occurred.
-
公开(公告)号:US11531681B2
公开(公告)日:2022-12-20
申请号:US17704783
申请日:2022-03-25
Applicant: Snowflake Inc.
Inventor: Pui Kei Johnston Chu , Benoit Dageville , Matthew J. Glickman , Christian Kleinerman , Prasanna Krishnan , Justin Langseth
IPC: G06F16/2455 , G06F16/25 , G06F21/62
Abstract: A method for accessing listings in a data exchange includes creating a first listing in a data exchange, the first listing referencing a first database of a plurality of databases and specifying identity-based sharing of the first database, creating a second listing in the data exchange, the second listing referencing a second database of the plurality of databases and data of the first database shared according to the identity-based sharing of the first database, and receiving an instruction from a user of the data exchange, the instruction referencing the second listing and instructing the addition of the second listing to a set of consumed data shares accessible by the user.
-
公开(公告)号:US20220391408A1
公开(公告)日:2022-12-08
申请号:US17396576
申请日:2021-08-06
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Polita Paulus , Sahaj Saini , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
IPC: G06F16/27 , G06F16/25 , G06F16/955
Abstract: The embodiments described herein provide means for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with multiple storage locations that it can point to, as well as a designation of an active storage location. The storage integration may also be defined with base file paths for each storage location as well as a relative file path which together may serve to synchronize data loading operations between deployments when e.g., a fail-over occurs from one deployment to another. The storage integration may be replicated from a first deployment to a second deployment, and when database replication occurs, an external stage may be replicated to the second deployment and bound to the replicated storage integration. Thus, a fail-over to the second deployment may result in a seamless transition of data loading processes to the second deployment.
-
公开(公告)号:US11520920B1
公开(公告)日:2022-12-06
申请号:US17580341
申请日:2022-01-20
Applicant: Snowflake Inc.
Inventor: Damien Carru , Jeremy Yujui Chen , Pui Kei Johnston Chu , Benoit Dageville , Subramanian Muralidhar
IPC: G06F12/00 , G06F16/00 , G06F21/62 , G06F16/21 , G06F16/2455
Abstract: Embodiments of the present disclosure provide an enhanced method of discovering shared objects that utilizes share authorization in addition to role authorization when a role is attempting to discover shared objects. A consumer account may invoke an operation referencing shared objects within a provider account using an imported database as a current session database. In response, a call context of the operation may be updated to save the imported database as a current session database and the imported database may be mapped to a first share and to a shared database. A first authorization based on whether the role has access privileges to the shared objects may be performed. The shared database may be used to identify schemas and the schemas may be used to identify shares associated with the imported database. A secondary authorization may be performed based on permissions that the shares associated with the imported database have on the shared objects.
-
-
-
-
-
-
-
-
-