-
公开(公告)号:US11599512B1
公开(公告)日:2023-03-07
申请号:US17938401
申请日:2022-10-06
Applicant: Snowflake Inc.
Inventor: Yucan Liu
Abstract: Systems and methods for inferring a schema for a text file are provided. The systems and methods perform operations including: accessing a file comprising a plurality of textual records, each textual record of the plurality of textual records being associated with one or more columns of data; sampling a set of textual records from the plurality of textural records; obtaining a hierarchy comprising a plurality of levels of schema types; determining whether an individual column of the one or more columns of data corresponding to the set of textual records is successfully associated with a first level of the plurality of levels of the schema types and, in response, associating a schema type represented by the first level with the individual column of the one or more columns of data corresponding to the plurality of textual records.
-
公开(公告)号:US11593521B1
公开(公告)日:2023-02-28
申请号:US17650032
申请日:2022-02-04
Applicant: Snowflake Inc.
Inventor: Raja Suresh Krishna Balakrishnan , Khalid Zaman Bijon , Subramanian Muralidhar , David Schultz
IPC: G06F21/62 , G06F16/245 , G06F16/28
Abstract: Various embodiments provide for tag-based application of a masking policy, which can be used in connection with a data platform. In particular, various embodiments enable enforcement of one or more masking policies against an entity (e.g., object) of a data platform, such as a database, a table, a row, or a column, based on one or more tags associated with the entity.
-
公开(公告)号:US11593379B2
公开(公告)日:2023-02-28
申请号:US17804630
申请日:2022-05-31
Applicant: Snowflake Inc.
Inventor: Max Heimel , Ismail Oukid , Linnea Passing , Stefan Richter , Juliane K. Waack
IPC: G06F16/24 , G06F16/2455 , G06F16/9035 , G06F16/28 , G06F17/18 , G06F16/22
Abstract: A query directed at a table organized into a set of batch units is received. The query comprises a predicate for which values are unknown prior to runtime. A set of values for the predicate are determined based on the query. An index access plan is created based on the set of values. Based on the index access plan, the set of batch units are pruned using a pruning index associated with the table. The pruning index comprises a set of filters that index distinct values in each column of the table. The pruning of the set of batch units comprises identifying a subset of batch units to scan for data that satisfies the query. The subset of batch units of the table are scanned to identify data that satisfies the query.
-
公开(公告)号:US11593345B1
公开(公告)日:2023-02-28
申请号:US17648621
申请日:2022-01-21
Applicant: Snowflake Inc.
Inventor: Tyler Arthur Akidau , Istvan Cseri , Fabian Hueske , Tyler Jones , Yevgeniy Kogan , Dzmitry Pauliukevich , Daniel E. Sotolongo
IPC: G06F16/23 , G06F16/2455 , G06F16/22
Abstract: Techniques described herein can accelerate change data capture determinations such as stream reads, which show changes made to a table between two points in time. Three distinct row bitests that mark deleted, updated, inserted, rows in micro-partitions can be added as metadata for the table. These bitsets can be generated during DML operations and then stored as metadata of the new partition generated by the DML operations. The bitsets can then be used to generate streams showing the changes in the table between two points in time (changes interval).
-
公开(公告)号:US20230059980A1
公开(公告)日:2023-02-23
申请号:US17656581
申请日:2022-03-25
Applicant: Snowflake Inc.
Abstract: Techniques described herein can monitor various data metrics. The auto-insight techniques can further detect and rank data segments that contributed to, or counteracted, shifts in data and detect when such shifts occurred. Thus, the techniques described herein can detect and identify root causes in shifts in different metrics. The techniques include pruning and ranking causes to identify the root causes and identify non-relevant factors, as well.
-
公开(公告)号:US11580251B1
公开(公告)日:2023-02-14
申请号:US17519729
申请日:2021-11-05
Applicant: SNOWFLAKE INC.
Inventor: Boxin Jiang , Qiming Jiang
IPC: G06F16/2455 , G06F21/62
Abstract: Embodiments of the present disclosure describe systems, methods, and computer program products for redacting sensitive data within a database. An example method can include receiving a data query referencing unredacted data of a database, responsive to the data query, executing, by a processing device, a redaction operation to identify sensitive data within the unredacted data of the database, and returning a redacted data set in which the sensitive data is replaced or removed to the data query.
-
公开(公告)号:US11574072B2
公开(公告)日:2023-02-07
申请号:US17334315
申请日:2021-05-28
Applicant: Snowflake Inc.
Inventor: Artin Avanes , Khalid Zaman Bijon , Damien Carru , Thierry Cruanes , Vikas Jain , Zheng Mi , Subramanian Muralidhar
IPC: G06F21/62 , G06F16/25 , G06F16/248 , G06F16/22 , G06F16/27
Abstract: A shared database platform implements dynamic masking on data shared between users where specific data is masked, transformed, or otherwise modified based on preconfigured functions that are associated with user roles. The shared database platform can implement the masking at runtime dynamically in response to users requesting access to a database object that is associated with one or more masking policies.
-
公开(公告)号:US11573978B2
公开(公告)日:2023-02-07
申请号:US17832361
申请日:2022-06-03
Applicant: Snowflake Inc.
Inventor: Ashish Motivala , Benoit Dageville
IPC: G06F16/00 , G06F16/27 , G06F9/50 , G06F16/14 , G06F16/21 , G06F16/22 , G06F16/951 , G06F16/182 , G06F16/23 , G06F16/2455 , G06F16/2458 , G06F16/9535 , G06F16/2453 , H04L67/568 , G06F9/48 , H04L67/1095 , H04L67/1097
Abstract: Example systems and methods for cloning catalog objects are described. In one implementation, a method identifies an original catalog object associated with data and creates a duplicate copy of the original catalog object without copying the data itself. The method allows access to the data using the duplicate catalog object and supports modifying the data associated with the original catalog object independently of the duplicate catalog object. The duplicate catalog object can be deleted upon completion of modifying the data associated with the original catalog object.
-
公开(公告)号:US11570259B2
公开(公告)日:2023-01-31
申请号:US17661096
申请日:2022-04-28
Applicant: Snowflake Inc.
Inventor: Khalid Zaman Bijon , Damien Carru , Christopher Peter Child , Eric Karlson , Zheng Mi
IPC: H04L67/306 , G06F9/54 , H04L9/40 , G06F21/31 , H04L67/02 , H04L41/50 , H04L41/5041 , H04L67/10 , H04L67/1097 , H04L67/59 , H04L67/60
Abstract: Embodiments of the present disclosure may provide a streamlined process for performing operations, such as data sharing and data replication, using multiple accounts. A global identity (also referred to as an organization user) may be employed, where the global identity may have access to multiple accounts across the same or different deployments. The global identity may switch between accounts from its login session and perform various tasks in the context of different accounts without undergoing further authentication.
-
公开(公告)号:US11550845B2
公开(公告)日:2023-01-10
申请号:US17657548
申请日:2022-03-31
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Sukruth Komarla Sukumar , Isaac Kunen , Ju-yi Kuo , Jonathan Lee Leang , Edward Ma , Schuyler James Manchester , Polita Paulus , Saurin Shah , Igor Zinkovsky
IPC: G06F16/00 , G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF interfaces and file reference objects that can be called by other users. Upon a UDF being called, files on a stage, one or more interface objects (e.g., InputStream), and file reference objects can be implemented by execution nodes of the distributed database system. The execution nodes can implement multiple threads that are authenticated and can download file data from a staging location concurrently.
-
-
-
-
-
-
-
-
-