DATA PRUNING BASED ON METADATA
    342.
    发明申请

    公开(公告)号:US20200089655A1

    公开(公告)日:2020-03-19

    申请号:US16688888

    申请日:2019-11-19

    Applicant: Snowflake Inc.

    Abstract: A system and method for pruning data based on metadata. The method may include receiving a query comprising a plurality of predicates and identifying one or more applicable files comprising database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata. store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.

    Multi-cluster warehouse
    345.
    发明授权

    公开(公告)号:US12267390B2

    公开(公告)日:2025-04-01

    申请号:US18617083

    申请日:2024-03-26

    Applicant: Snowflake Inc.

    Abstract: A method implementing a fault-tolerant data warehouse including allocating a plurality of processing units to a data warehouse, the processing units located in different availability zones, an availability zone comprising one or more data centers. The method further includes, as a result of monitoring a number of queries running at an input degree of parallelism on the plurality of processing units of the data warehouse, determining that the number of queries is serviceable by one fewer processing units. The method further includes routing a query from a first processing unit to a second processing unit within the data warehouse, the query having a common session identifier with another query previously provided to the second processing unit, the second processing unit determined to be caching a data segment associated with a cloud storage resource, usable by the query, and removing the first processing unit from the data warehouse.

    Active invalidation of metadata cache entries

    公开(公告)号:US12210527B2

    公开(公告)日:2025-01-28

    申请号:US18128152

    申请日:2023-03-29

    Applicant: Snowflake Inc.

    Abstract: Techniques for active invalidation of cache entries are described. A first timestamp and a second timestamp are stored by a compute instance, the first timestamp corresponding to an initial time the computing instance read from a version update log in a database and the second timestamp corresponding to a last time the computing instance read from the version update log in the database. The compute instance can read records from the version update log that are associated with version timestamps greater than the second timestamp. The compute instance can invalidate a first set of cache entries with older versions stored in a cache based on reading the records, retaining a second set of cache entries, and then execute a query using the second set of cache entries.

    Serialization of data in a concurrent transaction processing distributed database

    公开(公告)号:US12189614B2

    公开(公告)日:2025-01-07

    申请号:US18477834

    申请日:2023-09-29

    Applicant: Snowflake Inc.

    Abstract: The subject technology serializes, by at least one hardware processor, non-primary key data of column-organized data into compressed serialized value data that is in a row-organized sequence, the compressed serialized value data compressed using at least one bitmap, the non-primary key data comprising a schema identifier, the column-organized data being stored in a columnar database system, the column-organized data comprising primary key data and the non-primary key data. The subject technology stores the compressed serialized value data in a key-value data store of a key-value database system, the key-value database system processing key-value data in a key-value format. The subject technology receives a query by the columnar database system. The subject technology deserializes a portion of the compressed serialized value data that corresponds to the query. The subject technology processes the query using the columnar database system.

    FRAMEWORK TO REQUEST AND GRANT ACCESS TO PROTECTED RESOURCES

    公开(公告)号:US20240419817A1

    公开(公告)日:2024-12-19

    申请号:US18478539

    申请日:2023-09-29

    Applicant: Snowflake Inc.

    Abstract: A data platform grants privileges to applications accessing resources of the data platform in a secure and efficient way. The data platform determines whether a privilege has been granted to an application and, if not, generates a validation of the request to grant the privilege using a manifest of the application. The data platform generates a grant privilege request user interface, presents the grant privilege user interface to a consumer of the data platform, receives a privilege grant authorization from the consumer, and grants the privilege to the application. The granted privilege is then used by the application to access the resource.

Patent Agency Ranking