INDEX GENERATION USING LAZY REASSEMBLING OF SEMI-STRUCTURED DATA

    公开(公告)号:US20230139194A1

    公开(公告)日:2023-05-04

    申请号:US18146912

    申请日:2022-12-27

    Applicant: Snowflake Inc.

    Abstract: A pruning index is generated for a source table organized into a set of batch units. The source table comprises a column of semi-structured data. The pruning index comprises a set of filters that index distinct values in each column of the source table. Rather than reassembling an entire tree structure of the semi-structured data prior to indexing, the generating of the pruning index comprises traversing a reassembly hook object that represents a first portion of the semi-structured data that is subcolumnarized and traversing a residual object that represents a second portion of the semi-structured data that is not subcolumnarized. The reassembly hook object is traversed to identify values corresponding to the first portion of the semi-structured data and the residual object is traversed to identify values corresponding to the second portion. The pruning index is stored with an association with the source table.

    CERTIFICATE REVOCATION CHECK PROXY SERVICE

    公开(公告)号:US20230138608A1

    公开(公告)日:2023-05-04

    申请号:US17823922

    申请日:2022-08-31

    Applicant: Snowflake Inc.

    Abstract: A proxy revocation service provides a reliable service for performing revocation checks. The proxy revocation service queries public certificate authorities for the revocation status of a set of digital certificates and maintains a database of the revocation statuses. The proxy revocation service provides a singular endpoint that is Application Protocol Interface (API) accessible to web clients. Web clients communicate with the proxy revocation service through use of API message to perform revocation checks, rather than communicating with the public certificate authorities using an online certificate status protocol (OCSP). Use of the proxy revocation service provides both a reliable service for performing revocation checks as well as shifts the complexity away from the web clients.

    SHARED OBJECT DISCOVERY TECHNIQUES
    134.
    发明申请

    公开(公告)号:US20230135712A1

    公开(公告)日:2023-05-04

    申请号:US17980427

    申请日:2022-11-03

    Applicant: Snowflake Inc.

    Abstract: A consumer account may invoke an operation referencing a set of shared objects stored within a database of a provider account using an imported database that makes the set of shared objects available within the consumer account. A call context of the operation may be updated to cache the imported database, which references a share created from the provider account database, the share having grants to the set of shared objects. One or more database level objects may be discovered in a context of the share and each role granted to the share may be obtained based on the one or more database level objects. Whether any role granted to the share has access to any of the set of shared objects may be determined and the operation may be executed for each of the set of shared objects to which any role granted to the share has access.

    Automated query retry execution in a database system

    公开(公告)号:US11640347B2

    公开(公告)日:2023-05-02

    申请号:US17809780

    申请日:2022-06-29

    Applicant: Snowflake Inc.

    Abstract: Techniques for automated query retry in a database platform include decoding, by at least one hardware processor, a query directed to database data from a client account of a database platform. The method further includes decoding, by the at least one hardware processor, an indication that execution of the query on at least one computing node of the database platform results in a failed execution. The method further includes configuring a processing loop with continuous retry executions of the query on the at least one computing node based on the indication. The method further includes exiting the processing loop based on detecting a retry execution of the continuous retry executions results in at least one successful execution of the query. The method includes logging each attempt to execute the query during the continuous retry executions in a query status log until the at least one successful execution of the query.

    Multi-level data for database systems

    公开(公告)号:US11636114B2

    公开(公告)日:2023-04-25

    申请号:US17123551

    申请日:2020-12-16

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives first metadata corresponding to a set of micro-partitions. The subject technology stores a first data structure and a second data structure in storage as a first file and a second file, first data structure including the first metadata and a second data structure including second metadata, the first metadata corresponding to a set of micro-partitions, the second metadata for a grouping of the first metadata, the second data structure including information associating the second metadata to the first metadata. The subject technology stores third metadata for a table, the third metadata comprising information about data stored in a micro-partition of the table.

    Data replication and data failover in database systems

    公开(公告)号:US11630845B2

    公开(公告)日:2023-04-18

    申请号:US17703804

    申请日:2022-03-24

    Applicant: Snowflake Inc.

    Abstract: Replication and failover of database data is disclosed. A method includes copying a first dataset of a first account stored in a primary deployment to a secondary deployment. The method includes determining that the first dataset stored in the primary deployment is unavailable. The method includes executing a first transaction on the first dataset at the secondary deployment in response to determining that the first dataset stored in the primary deployment is unavailable. The method includes executing a second transaction on a second dataset of a second account at the primary deployment while the first dataset of the first account stored in the primary deployment is unavailable.

    Client side certificate revocation service

    公开(公告)号:US11621859B1

    公开(公告)日:2023-04-04

    申请号:US17823928

    申请日:2022-08-31

    Applicant: Snowflake Inc.

    Abstract: A proxy revocation service provides a reliable service for performing revocation checks. The proxy revocation service queries public certificate authorities for the revocation status of a set of digital certificates and maintains a database of the revocation statuses. The proxy revocation service provides a singular endpoint that is Application Protocol Interface (API) accessible to web clients. Web clients communicate with the proxy revocation service through use of API message to perform revocation checks, rather than communicating with the public certificate authorities using an online certificate status protocol (OCSP). Use of the proxy revocation service provides both a reliable service for performing revocation checks as well as shifts the complexity away from the web clients.

    Secure data comparison using data clean rooms

    公开(公告)号:US11620409B2

    公开(公告)日:2023-04-04

    申请号:US17932610

    申请日:2022-09-15

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are methods and systems for secure data comparison using data clean rooms. In an embodiment, a computer system generates a replica database based on a provider database, which stores a cross reference table that cross references a client dataset of a client database and a provider dataset of the provider database. The system receives, at the replica database, a table that is generated by the client database using the cross-reference table. The system transmits, from the replica database, the table to the provider database. The system receives, at the replica database, a results dataset that is generated by the provider database by applying a database statement to the provider database using the table generated by the client database. The system shares, from the replica database, the results dataset with the client database.

    Multi-cluster warehouse
    140.
    发明授权

    公开(公告)号:US11620313B2

    公开(公告)日:2023-04-04

    申请号:US17118201

    申请日:2020-12-10

    Applicant: SNOWFLAKE INC.

    Abstract: A method for a multi-cluster warehouse includes allocating a plurality of compute clusters as part of a virtual warehouse. The compute clusters are used to access and perform queries against one or more databases in one or more cloud storage resources. The method includes providing queries for the virtual warehouse to each of the plurality of compute clusters. Each of the plurality of compute clusters of the virtual warehouse receives a plurality of queries so that the computing load is spread across the different clusters. The method also includes dynamically adding compute clusters to and removing compute clusters from the virtual warehouse as needed based on a workload of the plurality of compute clusters.

Patent Agency Ranking