Schema inference for files
    481.
    发明授权

    公开(公告)号:US11599512B1

    公开(公告)日:2023-03-07

    申请号:US17938401

    申请日:2022-10-06

    Applicant: Snowflake Inc.

    Inventor: Yucan Liu

    Abstract: Systems and methods for inferring a schema for a text file are provided. The systems and methods perform operations including: accessing a file comprising a plurality of textual records, each textual record of the plurality of textual records being associated with one or more columns of data; sampling a set of textual records from the plurality of textural records; obtaining a hierarchy comprising a plurality of levels of schema types; determining whether an individual column of the one or more columns of data corresponding to the set of textual records is successfully associated with a first level of the plurality of levels of the schema types and, in response, associating a schema type represented by the first level with the individual column of the one or more columns of data corresponding to the plurality of textual records.

    Join query processing using pruning index

    公开(公告)号:US11593379B2

    公开(公告)日:2023-02-28

    申请号:US17804630

    申请日:2022-05-31

    Applicant: Snowflake Inc.

    Abstract: A query directed at a table organized into a set of batch units is received. The query comprises a predicate for which values are unknown prior to runtime. A set of values for the predicate are determined based on the query. An index access plan is created based on the set of values. Based on the index access plan, the set of batch units are pruned using a pruning index associated with the table. The pruning index comprises a set of filters that index distinct values in each column of the table. The pruning of the set of batch units comprises identifying a subset of batch units to scan for data that satisfies the query. The subset of batch units of the table are scanned to identify data that satisfies the query.

    AUTO INSIGHTS INTO DATA CHANGES
    485.
    发明申请

    公开(公告)号:US20230059980A1

    公开(公告)日:2023-02-23

    申请号:US17656581

    申请日:2022-03-25

    Applicant: Snowflake Inc.

    Abstract: Techniques described herein can monitor various data metrics. The auto-insight techniques can further detect and rank data segments that contributed to, or counteracted, shifts in data and detect when such shifts occurred. Thus, the techniques described herein can detect and identify root causes in shifts in different metrics. The techniques include pruning and ranking causes to identify the root causes and identify non-relevant factors, as well.

    Query-based database redaction
    486.
    发明授权

    公开(公告)号:US11580251B1

    公开(公告)日:2023-02-14

    申请号:US17519729

    申请日:2021-11-05

    Applicant: SNOWFLAKE INC.

    Abstract: Embodiments of the present disclosure describe systems, methods, and computer program products for redacting sensitive data within a database. An example method can include receiving a data query referencing unredacted data of a database, responsive to the data query, executing, by a processing device, a redaction operation to identify sensitive data within the unredacted data of the database, and returning a redacted data set in which the sensitive data is replaced or removed to the data query.

Patent Agency Ranking