OPTIMIZATIONS FOR LONG-LIVED STATEMENTS IN A DATABASE SYSTEM

    公开(公告)号:US20230244655A1

    公开(公告)日:2023-08-03

    申请号:US17649737

    申请日:2022-02-02

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/2379 G06F16/2365 G06F9/524

    Abstract: The subject technology performs a search for a key in a regular space to locate a first visible version of the key. The subject technology determines that the first visible version of the key is not one of a N number of newest versions of the key. The subject technology performs a search of an undo space to locate a second visible version of the key. The subject technology determines whether the first visible version or the second visible version of the key is newer. The subject technology provides a newer version of the key between the first visible version and the second visible version of the key.

    Caching query plans in database systems

    公开(公告)号:US11645281B1

    公开(公告)日:2023-05-09

    申请号:US17823310

    申请日:2022-08-30

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24542 G06F16/24539

    Abstract: The subject technology receives a query, the query including a set of statements for performing the query. The subject technology populates a compilation context based at least in part the query. The subject technology invokes a compiler to perform a compilation process based on the compilation context. The subject technology performs a lookup operation on a stored plan cache for an exact match based on information from the compilation context. The subject technology, in response to determining an exact match, determines whether the particular query plan requires re-compilation based on a data dependent optimization. The subject technology determines whether a plan cache entry corresponding to the particular query plan includes a data property constraint. The subject technology determines whether the data property constraint still holds based on a set of data properties.

    MANAGING CONCURRENT TRANSACTIONS IN DATABASE SYSTEMS

    公开(公告)号:US20230011046A1

    公开(公告)日:2023-01-12

    申请号:US17821670

    申请日:2022-08-23

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a first transaction, the first transaction to be executed on linearizable storage. The subject technology assigns, prior to performing a read operation, a first read version to the first transaction, the first read version indicating a first version of the linearizable storage, the first read version corresponding to a particular timestamp of a current time. The subject technology performs the read operation from the first transaction on a table in a database. The subject technology determines a first commit version identifier corresponding to first data resulting from the read operation. The subject technology proceeds to perform a particular write operation from the first transaction. The subject technology determines that the first commit version identifier fails to match a second commit version identifier corresponding to a second transaction. The subject technology aborts the particular write operation from the first transaction.

    Selecting partitions for reclustering based on distribution of overlapping partitions

    公开(公告)号:US11544244B2

    公开(公告)日:2023-01-03

    申请号:US17654296

    申请日:2022-03-10

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are embodiments of systems and methods for selecting partitions for reclustering based on distribution of overlapping partitions. In an example, a database platform makes a determination to at least partially recluster a database table that includes data stored across a plurality of partitions. The database platform responsively selects a subset of the partitions. The selecting of the subset includes identifying a point on a domain of a clustering key that corresponds to a local maximum of overlapping partitions, and also includes selecting the subset from among a group of overlapping partitions. The group includes at least one partition that overlaps the identified point on the domain of the clustering key. Each partition in the selected subset is above a reduction goal of overlapping partitions. The database platform at least partially reclusters the selected subset based on the clustering key.

    Query plan caching for networked database systems

    公开(公告)号:US11461327B1

    公开(公告)日:2022-10-04

    申请号:US17658535

    申请日:2022-04-08

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query, the query including a set of statements for performing the query. The subject technology populates a compilation context based at least in part the query. The subject technology provides the compilation context to a compiler. The subject technology invokes the compiler to perform a compilation process based on the compilation context, the compilation process comprising performing a lookup operation on a stored plan cache for an exact match based on information from the compilation context, the stored plan cache including a set of stored query plans, and determining whether the exact match of a particular query plan is found in the stored plan cache to avoid compiling the query using the compilation context.

    Internal resource provisioning in database systems

    公开(公告)号:US11379492B2

    公开(公告)日:2022-07-05

    申请号:US17477663

    申请日:2021-09-17

    Applicant: Snowflake Inc.

    Abstract: Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.

    DATABASE METADATA IN IMMUTABLE STORAGE
    79.
    发明申请

    公开(公告)号:US20200233856A1

    公开(公告)日:2020-07-23

    申请号:US16838301

    申请日:2020-04-02

    Applicant: Snowflake Inc.

    Abstract: A method for a database system includes storing table data for a database, the table data including information in rows and columns of one or more database tables. The method includes storing metadata on immutable storage, the metadata including information about the table data for the database. In one embodiment, mutable metadata may be periodically consolidated in the background to create new versions of metadata files and which allows for deletions of old metadata files and old data files.

Patent Agency Ranking