-
公开(公告)号:US20240232224A1
公开(公告)日:2024-07-11
申请号:US18610863
申请日:2024-03-20
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Istvan Cseri , Tyler Jones , Dinesh Chandrakant Kulkarni , Daniel Mills , Daniel E. Sotolongo , Di Fei Zhang
IPC: G06F16/27
CPC classification number: G06F16/273
Abstract: Techniques for triggering pipeline execution based on data change (transaction commit) are described. The pipelines can be used for data ingestion or other specified tasks. These tasks can be operational across account, organization, cloud region, and cloud provider boundaries. The tasks can be triggered by commit post-processing. Gates in the tasks can be set up to reference change data capture information. If the gate is satisfied, tasks can be executed to set up data pipelines.
-
公开(公告)号:US20240232019A1
公开(公告)日:2024-07-11
申请号:US18618273
申请日:2024-03-27
Applicant: Snowflake Inc.
Inventor: Vlad Bunescu , Yan Huang , Jaeha Lee , Shiyu Qu , Jiaqi Yan
CPC classification number: G06F11/1433 , G06F11/3414 , G06F16/213 , G06F2201/80
Abstract: A method includes updating, by at least one hardware processor, a table with a detected regression associated with database code of a database. A first stored procedure is performed to determine a root cause of the detected regression. A second stored procedure is performed to determine an impact of the detected regression based at least on the root cause. A determination is made on whether to perform mitigation of the detected regression based on the impact.
-
公开(公告)号:US12032685B2
公开(公告)日:2024-07-09
申请号:US17936770
申请日:2022-09-29
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Ganeshan Ramachandran Iyer , Isaac Kunen
IPC: G06F21/54 , G06F16/2455 , G06F21/53 , G06F21/60
CPC classification number: G06F21/54 , G06F16/2455 , G06F21/53 , G06F21/602 , G06F2221/033
Abstract: The logging techniques described herein can enable using logging tools without having to use different methods for sandbox implementations and push out the log data to storage without problems. The log data is treated as sensitive data and is protected according to the defined security policies. Further, the results may be compressed and encrypted.
-
公开(公告)号:US12032588B2
公开(公告)日:2024-07-09
申请号:US18160802
申请日:2023-01-27
Applicant: SNOWFLAKE INC.
Inventor: Pui Kei Johnston Chu , Benoit Dageville , Matthew J. Glickman , Christian Kleinerman , Prasanna Krishnan , Justin Langseth
IPC: G06F16/21 , G06F16/2455 , G06F16/25 , G06F21/62
CPC classification number: G06F16/254 , G06F16/21 , G06F16/2456 , G06F21/6218
Abstract: A method includes creating, by a first provider, a first listing referencing first shared data and comprising first access controls, wherein access to the first shared data by a second provider is filtered based on the first access controls, creating, by the second provider, a second listing referencing second shared data and the first shared data filtered based on the first access controls, and adding the second listing to a catalog in a data exchange, the catalog comprising metadata describing the second shared data.
-
公开(公告)号:US12026159B2
公开(公告)日:2024-07-02
申请号:US18059125
申请日:2022-11-28
Applicant: Snowflake Inc.
Inventor: Shrainik Jain , Allison Waingold Lee , Boyung Lee , Prasanna Rajaperumal , Jiaqi Yan , Yongsik Yoon
IPC: G06F16/2453
CPC classification number: G06F16/24542
Abstract: Queries executed against a materialized view can execute up to orders of magnitude faster than equivalent queries on a source (or base) table. However, although a query can reference a materialized view directly, a user (e.g., query author) may not know about a relevant materialized view. Moreover, if a source table has multiple materialized views generated, the user may not know which materialized view to reference in the query. Thus, embodiments of the present disclosure provide techniques for automatically rewriting queries directed to a source table to utilize existing materialized views.
-
公开(公告)号:US12026140B1
公开(公告)日:2024-07-02
申请号:US18112198
申请日:2023-02-21
Applicant: Snowflake Inc.
Inventor: Vlad Bunescu , Joshua Klahr , Louis Magarshack , Shiyu Qu , Zerui Wei , Jiaqi Yan
CPC classification number: G06F16/2228 , G06F16/254
Abstract: Methods, systems, and computer programs are presented for providing performance metrics in an online performance analysis system employing customer production workloads. A plurality of metric source data is received from a cloud data platform. A workload is identified as a stable workload candidate based at least in part on the plurality of metric source data. The cloud data platform generates a performance index based on the workload being identified as a stable workload candidate. The performance index is tracked over a period of time to identify changes in workload.
-
公开(公告)号:US12026129B2
公开(公告)日:2024-07-02
申请号:US17657229
申请日:2022-03-30
Applicant: Snowflake Inc.
Inventor: Selcuk Aya , Ju-yi Kuo , Jonathan Lee Leang , Nitya Kumar Sharma , Igor Zinkovsky
IPC: G06F15/16 , G06F16/17 , G06F16/172 , G06F16/176
CPC classification number: G06F16/1774 , G06F16/172 , G06F16/1734
Abstract: Systems and methods for caching file system collections atomically. The systems and methods perform operations comprising: receiving a request to access a collection of data comprising a plurality of files stored on a storage device; in response to receiving the request, transferring the collection of data from the storage device to a first subfolder in a cache associated with the storage device; generating a lock file comprising a reference to the first subfolder in the cache; and atomically controlling access to the collection of data in the first subfolder and removal of the collection of data in the first subfolder from the cache via the lock file.
-
公开(公告)号:US20240211491A1
公开(公告)日:2024-06-27
申请号:US18599647
申请日:2024-03-08
Applicant: Snowflake Inc.
Inventor: Vasile Paraschiv , Saurin Shah , Marianne Shaw , Nileema Shingte
IPC: G06F16/27 , G06F9/30 , G06F16/11 , G06F16/13 , G06F16/182 , G06F16/22 , G06F16/2455 , G06F16/25 , G06F16/28
CPC classification number: G06F16/278 , G06F9/3009 , G06F16/137 , G06F16/182 , G06F16/2282 , G06F16/24554 , G06F16/283 , G06F16/116 , G06F16/254
Abstract: A database export system exports data using a plurality of nodes that process the data to generate structured result files that are partitioned by an export parameter in an export request. The database export system distributes the data and merges the files to avoid small file creation and increase processing speed via parallelism. The database export system generates the result files of a specified maximum size in a final format, where the files are processed merged in a temporary file format. The parallel processing is optimized and constrained per the amount of processing nodes, available memory, requested final file sizes, and operation based ordering to complete data exports in a scalable multi-stage approach.
-
公开(公告)号:US12008001B2
公开(公告)日:2024-06-11
申请号:US17804434
申请日:2022-05-27
Applicant: Snowflake Inc.
Inventor: Matthew J. Glickman , Orestis Kostakis , Justin Langseth
IPC: G06F16/245 , G06F16/24 , G06F16/242 , G06F16/2455
CPC classification number: G06F16/24568 , G06F16/244 , G06F16/2456 , G06F16/24564
Abstract: Systems, methods, and machine-readable storage devices provide for identifying a user dataset on a distributed database. The system includes generating a similarity score dataset that indicates a similarity between the user dataset and a plurality of datasets of other users of the distributed database. The system generates a plurality of overlap queries that are configured to output overlap datasets between the user dataset and one or more of the plurality of datasets. The system further generates a results dataset by applying one or more of the plurality of overlap queries to a joined dataset comprising data from the user dataset and one of the plurality of datasets of other users on the distributed database.
-
公开(公告)号:US12007990B1
公开(公告)日:2024-06-11
申请号:US18447934
申请日:2023-08-10
Applicant: Snowflake Inc.
Inventor: Nikolaos Romanos Katsipoulakis , Dimitrios Tsirogiannis , William Waddington , Zhaohui Zhang
IPC: G06F16/00 , G06F16/2453 , G06F16/27
CPC classification number: G06F16/24537 , G06F16/24542 , G06F16/27
Abstract: The subject technology receives a query, the query including a set of statements, the set of statements including a first statement comprising a first data manipulation language operation. The subject technology determines that the set of statements includes at least one statement defining a first constraint, and a second statement indicating that the first constraint is statement deferrable for performing validation of the first constraint. The subject technology generates a key value constraint check operator as a node in a query plan. The subject technology generates a key value DML operator as a second node in the query plan, the key value DML operator corresponding to the first DML operation. The subject technology performs, using the key value constraint check operator, a validation process of the first constraint based at least in part on a set of violating keys of each DML operation from the set of statements.
-
-
-
-
-
-
-
-
-