-
公开(公告)号:US20250139076A1
公开(公告)日:2025-05-01
申请号:US18496734
申请日:2023-10-27
Applicant: Snowflake Inc.
Inventor: Pradeep Dorairaj , Garrett Frere , Carsten Stann , Boyu Wang , Mengjia Xia
IPC: G06F16/22 , G06F16/242 , G06F16/2453
Abstract: The subject technology receives first data. The subject technology transforms first data to a wide format, the wide format comprising a second table. The subject technology splits a set of rows into a set of shards of rows. The subject technology, for each shard from the set of shards, sends a particular user defined table function (UDTF), including a particular shard of rows, to a different execution node to perform a set of operations for determining a rolling correlation over a window size. The subject technology provides a set of output values of each particular UDTF corresponding to each shard from the set of shards in a second UDTF. The subject technology sends the second UDTF to a particular execution node to perform an aggregate operation. The subject technology receives a value of the rolling correlation from the particular execution node based on the aggregate operation.
-
公开(公告)号:US12287898B2
公开(公告)日:2025-04-29
申请号:US18155293
申请日:2023-01-17
Applicant: SNOWFLAKE INC.
Inventor: Boxin Jiang , Qiming Jiang
IPC: G06F16/2455 , G06F21/62
Abstract: Embodiments of the present disclosure describe systems, methods, and computer program products for redacting sensitive data within a database. An example method can include receiving a data query referencing unredacted data of a database, wherein the data query that is received comprises a value identifying a type of sensitive data to be redacted from the unredacted data, responsive to the data query, executing, by a processing device, a redaction operation to identify sensitive data that matches the type within the unredacted data of the database, and returning a redacted data set in which the sensitive data that matches the type is replaced or removed to the data query.
-
公开(公告)号:US12277124B2
公开(公告)日:2025-04-15
申请号:US18425312
申请日:2024-01-29
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F16/2455 , G06F16/242 , G06F16/27 , G06F21/62
Abstract: Disclosed herein are systems and methods for query processing with restrictions in a database clean room. In an embodiment, a system receives a query directed to a combination of a first source dataset from a first database account of a distributed database and a second source dataset from a second database account of the distributed database. The system generates an approved statements table that contains database statement language that can be executed against the combination of the first and second source datasets. Based on determining that the approved statements table includes the query, the system executes the query to produce results data, and stores the results data in the first database account.
-
公开(公告)号:US12277101B2
公开(公告)日:2025-04-15
申请号:US18393279
申请日:2023-12-21
Applicant: Snowflake Inc.
Inventor: Srilakshmi Chintala , Jianzhun Du , Naresh Kumar , Srinath Shankar , Leonhard Franz Spiegelberg , Eric Shawn Vandenberg , Andong Zhan , Yun Zou
IPC: G06F16/22 , G06F16/11 , G06F16/242 , G06F16/2453
Abstract: A method provides deterministic row ordering for query result sets in a database system. A framework receives a query referencing a database table and generating a row index column in response. The framework generates the row index column that involves determining an order associated with source files underlying the referenced table and assigns each table row a unique index value based on the file order and row position within the files. The framework executes the received query to produce a result set ordered by the generated row index column. The framework determines row order based on analyzing underlying data file metadata and facilitates returning the result set with consistent, reliable ordering, enabling stable positional-based access semantics when querying distributed database tables.
-
公开(公告)号:US20250117387A1
公开(公告)日:2025-04-10
申请号:US18987419
申请日:2024-12-19
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Benoit Dageville , Igor Demura , Brian Charles Forney , Weili Gu , Yujie Li , Mohammad Saber Mirzaei , Jaeyun Noh , Jiaqi Yan
IPC: G06F16/2453
Abstract: Interleaved infrastructure can interleave fragment execution during query compilation. For example, compilation can be paused, and a plan fragment is executed. The results of the plan fragment can be fed back into the compiler, which can then restart the compilation and modify the compilation based on the results of the plan fragment execution. The interleaved workflow provides a flexible, robust infrastructure, which can enable and improve a variety of database features and query optimizations.
-
公开(公告)号:US20250086168A1
公开(公告)日:2025-03-13
申请号:US18955789
申请日:2024-11-21
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Tyler Jones , Daniel Mills , Leon Papke , Prasanna Rajaperumal , Daniel E. Dotolongo
Abstract: A method for query execution includes configuring, by at least one hardware processor, a materialized table (MT) based on the query and a base table. The MT includes a lag duration indicating a maximum time period that a result of a prior refresh of the query on the base table can lag behind a current time instance. The method further includes generating a query plan for the query. The method further includes determining multiple sets of data processing operations included in the query plan. The method further includes generating a plurality of intermediate MTs. The one or more intermediate MTs hold an intermediate processing state for the multiple sets of data processing operations. The method further includes configuring a refresh of the MT based on the intermediate processing state for the multiple sets of data processing operations.
-
公开(公告)号:US12248448B1
公开(公告)日:2025-03-11
申请号:US18451522
申请日:2023-08-17
Applicant: Snowflake Inc.
IPC: G06F16/215 , G06F11/07 , G06F16/23 , G06F16/2453
Abstract: Provided herein are systems and methods for configuring integrity constraints (including a check constraint) and row violation logging using error tables. An example method includes decoding a query received at a network-based database system. The query includes a command to perform an operation on a base table. An integrity constraint associated with the base table is retrieved. The integrity constraint specifies a desired configuration for the base table. A verification of the integrity constraint is performed to detect erroneous data of the base table that violates the desired configuration. The erroneous data is input into an error table that is configured as a nested object of the base table. A notification that the erroneous data is available in the error table is generated and output.
-
公开(公告)号:US12242550B1
公开(公告)日:2025-03-04
申请号:US18238986
申请日:2023-08-28
Applicant: Snowflake Inc.
Inventor: Shuodong Dang , Orestis Kostakis
IPC: G06F16/9532 , G06F9/445 , G06F16/9538
Abstract: A data access event may be recognized, using a browser plug-in, wherein the data access event constitutes a reference to previously obtained data. As a result of recognizing the event, the plug-in may send, to a search engine of a data exchange, a set of extracted terms. The plug-in may receive a set of related data listings related to the set of extracted terms. Upon a selection of a data listing from the set of related data listings, the plug-in may install the data listing to an account.
-
公开(公告)号:US12242457B2
公开(公告)日:2025-03-04
申请号:US18459256
申请日:2023-08-31
Applicant: Snowflake Inc.
Inventor: Istvan Cseri , Tyler Jones , Daniel Mills , Daniel E. Sotolongo
IPC: G06F16/23 , G06F16/22 , G06F16/2455 , G06F16/27
Abstract: Provided herein are systems and methods for a stream object configuration, including query processing of stream objects using stream expansion. For example, a method includes decoding a query to obtain a first data processing operation and a first stream object. The first stream object is associated with a view on a base table. A first stream expansion on the first stream object is performed. The first stream expansion is based on generating a second stream object on the base table. A second stream expansion of the second stream object is performed. The second stream expansion is based on replacing the second stream object with at least a second data processing operation. The query is executed based on completing the first data processing operation and the at least a second data processing operation.
-
公开(公告)号:US20250068628A1
公开(公告)日:2025-02-27
申请号:US18392385
申请日:2023-12-21
Applicant: Snowflake Inc.
Inventor: Hossein Ahmandi , Jayanta Das , Joshua Klahr , Boyung Lee , Wenye Li , Abdul Q. Munir , Yi Pan
IPC: G06F16/2453 , G06F16/2455 , G06F16/27
Abstract: A method includes generating, by at least one hardware processor, a query plan of a query. The query includes a JOIN operation between first time series data and second time series data. At least one node in the query plan corresponding to the JOIN operation is modified to generate a modified query plan. The modifying is based on replacing the at least one node with a new node including a UNION operation. The UNION operation is based on the first time series data and the second time series data. Execution of the query is scheduled based on the modified query plan.
-
-
-
-
-
-
-
-
-