-
公开(公告)号:US20220292213A1
公开(公告)日:2022-09-15
申请号:US17644732
申请日:2021-12-16
Applicant: Snowflake Inc.
Inventor: Monica J. Holboke , Justin Langseth , Stuart Ozer , William L. Stratton, JR.
Abstract: A secure machine learning system of a database system can be implemented to use secure shared data to train a machine learning model. To manage the model, a first user of the database can share data in an encrypted view with a second user of the database, and further share one or more functions of an application that accesses the data while the data is encrypted. The second user can access functions of the application and can call the functions to generate a trained machine learning model and further generate machine learning outputs (e.g., predictions) from the trained model.
-
公开(公告)号:US20220292111A1
公开(公告)日:2022-09-15
申请号:US17832361
申请日:2022-06-03
Applicant: Snowflake, Inc.
Inventor: Ashish Motivala , Benoit Dageville
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , A61F5/56 , G06F9/48 , H04L67/1095 , H04L67/1097
Abstract: Example systems and methods for cloning catalog objects are described. In one implementation, a method identifies an original catalog object associated with data and creates a duplicate copy of the original catalog object without copying the data itself. The method allows access to the data using the duplicate catalog object and supports modifying the data associated with the original catalog object independently of the duplicate catalog object. The duplicate catalog object can be deleted upon completion of modifying the data associated with the original catalog object.
-
公开(公告)号:US20220292109A1
公开(公告)日:2022-09-15
申请号:US17827310
申请日:2022-05-27
Applicant: SNOWFLAKE INC.
Inventor: Ashish Motivala , Benoit Dageville
IPC: G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , G06F9/48 , H04L67/1095 , H04L67/1097
Abstract: Example systems and methods for cloning catalog objects are described. In one implementation, a method includes copying first metadata of a first catalog object to create second metadata of a second catalog object such that a second portion of the second metadata and a first portion of the first metadata are associated with same data of a same data file, and generating a response to a data storage or data retrieval request directed to the first catalog object by accessing the data file using the second metadata of the second catalog object.
-
公开(公告)号:US20220292098A1
公开(公告)日:2022-09-15
申请号:US17804630
申请日:2022-05-31
Applicant: Snowflake Inc.
Inventor: Max Heimel , Ismail Oukid , Linnea Passing , Stefan Richter , Juliane K. Waack
IPC: G06F16/2455 , G06F16/9035 , G06F16/28 , G06F17/18 , G06F16/22
Abstract: A query directed at a table organized into a set of batch units is received. The query comprises a predicate for which values are unknown prior to runtime. A set of values for the predicate are determined based on the query. An index access plan is created based on the set of values. Based on the index access plan, the set of batch units are pruned using a pruning index associated with the table. The pruning index comprises a set of filters that index distinct values in each column of the table. The pruning of the set of batch units comprises identifying a subset of batch units to scan for data that satisfies the query. The subset of batch units of the table are scanned to identify data that satisfies the query.
-
公开(公告)号:US20220292071A1
公开(公告)日:2022-09-15
申请号:US17804674
申请日:2022-05-31
Applicant: Snowflake Inc.
Inventor: Andrey Fedorov , Maria Polyakova , Omer Singer
IPC: G06F16/23 , G06F16/2455 , G06F9/54
Abstract: Disclosed herein are systems and methods for providing percentile-based alerts from historical event-occurrence data. An embodiment of a method includes identifying a baseline result of an aggregation operation performed with respect to a detection set of attributes of event records. Each baseline result corresponds to a baseline set of one or more event records from a baseline time period within a baseline time window. The method includes determining a subject result of the aggregation operation performed with respect to the detection set of attributes of a subject set of the event records corresponding to a subject time period. The method includes determining, based on the baseline result and the subject result, whether the subject result indicates an alert condition for the detection set for the subject time period, and if so then outputting one or more alerts.
-
公开(公告)号:US11436253B2
公开(公告)日:2022-09-06
申请号:US17517935
申请日:2021-11-03
Applicant: Snowflake Inc.
Inventor: Sebastian Breß , Moritz Eyssen , Max Heimel
IPC: G06F16/00 , G06F16/27 , G06F16/2455
Abstract: A global and local row count limit associated with a limit query are received by a stop operator of a first execution node among a set of execution nodes that are assigned to process the limit query. Local distributed row count data is generated based on a local row count corresponding to a number of rows output by the first execution node in processing the query. Based on determining the local row count satisfies the local limit, the first execution node buffers rows produced in processing the query. The local distributed row count data is updated based on remote distributed row count data received from a second execution node. A stopping condition is detected based on determining the global limit is satisfied based on updated local distributed row count data and query processing by the first execution node based on detecting the stopping condition.
-
公开(公告)号:US20220269676A1
公开(公告)日:2022-08-25
申请号:US17663248
申请日:2022-05-13
Applicant: Snowflake Inc.
Inventor: Jeffrey Rosen , Abdul Munir , Jiaqi Yan , William Waddington , Prasanna Rajaperumal , Thierry Cruanes
IPC: G06F16/2458 , G06F16/2453 , G06F9/50 , G06F16/2455
Abstract: Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.
-
公开(公告)号:US11423081B1
公开(公告)日:2022-08-23
申请号:US17390969
申请日:2021-08-01
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Sukruth Komarla Sukumar , Isaac Kunen , Ju-Yi Kuo , Jonathan Leang , Edward Ma , Schuyler James Manchester , Polita Paulus , Saurin Shah , Igor Zinkovsky
IPC: G06F16/00 , G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF signatures and interfaces (e.g., classes, sub-classes) that can be called by other users. Upon a UDF being called, one or more interface objects (e.g., InputStream) can be created and requests transferred to a execution node via a network channel. The execution node can implement multiple threads that are authorized and download file data from a staging location (e.g., internal stage, external stage) concurrently.
-
公开(公告)号:US20220237231A1
公开(公告)日:2022-07-28
申请号:US17657548
申请日:2022-03-31
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Sukruth Komarla Sukumar , Isaac Kunen , Ju-yi Kuo , Jonathan Lee Leang , Edward Ma , Schuyler James Manchester , Polita Paulus , Saurin Shah , Igor Zinkovsky
IPC: G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF interfaces and file reference objects that can be called by other users. Upon a UDF being called, files on a stage, one or more interface objects (e.g., InputStream), and file reference objects can be implemented by execution nodes of the distributed database system. The execution nodes can implement multiple threads that are authenticated and can download file data from a staging location concurrently.
-
公开(公告)号:US20220237192A1
公开(公告)日:2022-07-28
申请号:US17157233
申请日:2021-01-25
Applicant: Snowflake Inc.
Inventor: Qiming Jiang , Orestis Kostakis
IPC: G06F16/2453 , G06N20/00 , G06F16/2455
Abstract: The subject technology receives a query directed to a set of source tables, each source table organized into a set of micro-partitions. The subject technology determines a set of metadata, the set of metadata comprising table metadata, query metadata, and historical data related to the query. The subject technology predicts, using a machine learning model, an indicator of an amount of computing resources for executing the query based at least in part on the set of metadata. The subject technology generates a query plan for executing the query based at least in part on the predicted indicator of the amount of computing resources. The subject technology executes the query based at least in part on the query plan.
-
-
-
-
-
-
-
-
-