DISTRIBUTED IN-DATABASE VECTORIZED OPERATIONS USING USER DEFINED TABLE FUNCTIONS

    公开(公告)号:US20250139076A1

    公开(公告)日:2025-05-01

    申请号:US18496734

    申请日:2023-10-27

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives first data. The subject technology transforms first data to a wide format, the wide format comprising a second table. The subject technology splits a set of rows into a set of shards of rows. The subject technology, for each shard from the set of shards, sends a particular user defined table function (UDTF), including a particular shard of rows, to a different execution node to perform a set of operations for determining a rolling correlation over a window size. The subject technology provides a set of output values of each particular UDTF corresponding to each shard from the set of shards in a second UDTF. The subject technology sends the second UDTF to a particular execution node to perform an aggregate operation. The subject technology receives a value of the rolling correlation from the particular execution node based on the aggregate operation.

    Query-based database redaction
    302.
    发明授权

    公开(公告)号:US12287898B2

    公开(公告)日:2025-04-29

    申请号:US18155293

    申请日:2023-01-17

    Applicant: SNOWFLAKE INC.

    Abstract: Embodiments of the present disclosure describe systems, methods, and computer program products for redacting sensitive data within a database. An example method can include receiving a data query referencing unredacted data of a database, wherein the data query that is received comprises a value identifying a type of sensitive data to be redacted from the unredacted data, responsive to the data query, executing, by a processing device, a redaction operation to identify sensitive data that matches the type within the unredacted data of the database, and returning a redacted data set in which the sensitive data that matches the type is replaced or removed to the data query.

    Query processing with restrictions in a database clean room

    公开(公告)号:US12277124B2

    公开(公告)日:2025-04-15

    申请号:US18425312

    申请日:2024-01-29

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are systems and methods for query processing with restrictions in a database clean room. In an embodiment, a system receives a query directed to a combination of a first source dataset from a first database account of a distributed database and a second source dataset from a second database account of the distributed database. The system generates an approved statements table that contains database statement language that can be executed against the combination of the first and second source datasets. Based on determining that the approved statements table includes the query, the system executes the query to produce results data, and stores the results data in the first database account.

    QUERY EXECUTION USING INTERMEDIATE MATERIALIZED TABLES

    公开(公告)号:US20250086168A1

    公开(公告)日:2025-03-13

    申请号:US18955789

    申请日:2024-11-21

    Applicant: Snowflake Inc.

    Abstract: A method for query execution includes configuring, by at least one hardware processor, a materialized table (MT) based on the query and a base table. The MT includes a lag duration indicating a maximum time period that a result of a prior refresh of the query on the base table can lag behind a current time instance. The method further includes generating a query plan for the query. The method further includes determining multiple sets of data processing operations included in the query plan. The method further includes generating a plurality of intermediate MTs. The one or more intermediate MTs hold an intermediate processing state for the multiple sets of data processing operations. The method further includes configuring a refresh of the MT based on the intermediate processing state for the multiple sets of data processing operations.

    Configuring check constraint and row violation logging using error tables

    公开(公告)号:US12248448B1

    公开(公告)日:2025-03-11

    申请号:US18451522

    申请日:2023-08-17

    Applicant: Snowflake Inc.

    Abstract: Provided herein are systems and methods for configuring integrity constraints (including a check constraint) and row violation logging using error tables. An example method includes decoding a query received at a network-based database system. The query includes a command to perform an operation on a base table. An integrity constraint associated with the base table is retrieved. The integrity constraint specifies a desired configuration for the base table. A verification of the integrity constraint is performed to detect erroneous data of the base table that violates the desired configuration. The erroneous data is input into an error table that is configured as a nested object of the base table. A notification that the erroneous data is available in the error table is generated and output.

    Browser plug-in for marketplace recommendations

    公开(公告)号:US12242550B1

    公开(公告)日:2025-03-04

    申请号:US18238986

    申请日:2023-08-28

    Applicant: Snowflake Inc.

    Abstract: A data access event may be recognized, using a browser plug-in, wherein the data access event constitutes a reference to previously obtained data. As a result of recognizing the event, the plug-in may send, to a search engine of a data exchange, a set of extracted terms. The plug-in may receive a set of related data listings related to the set of extracted terms. Upon a selection of a data listing from the set of related data listings, the plug-in may install the data listing to an account.

    Query processing of stream objects using stream expansion

    公开(公告)号:US12242457B2

    公开(公告)日:2025-03-04

    申请号:US18459256

    申请日:2023-08-31

    Applicant: Snowflake Inc.

    Abstract: Provided herein are systems and methods for a stream object configuration, including query processing of stream objects using stream expansion. For example, a method includes decoding a query to obtain a first data processing operation and a first stream object. The first stream object is associated with a view on a base table. A first stream expansion on the first stream object is performed. The first stream expansion is based on generating a second stream object on the base table. A second stream expansion of the second stream object is performed. The second stream expansion is based on replacing the second stream object with at least a second data processing operation. The query is executed based on completing the first data processing operation and the at least a second data processing operation.

Patent Agency Ranking