Batch data ingestion in database systems

    公开(公告)号:US10896172B2

    公开(公告)日:2021-01-19

    申请号:US16720418

    申请日:2019-12-19

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.

    Database metadata in immutable storage

    公开(公告)号:US10860573B2

    公开(公告)日:2020-12-08

    申请号:US16838301

    申请日:2020-04-02

    Applicant: Snowflake Inc.

    Abstract: A method for a database system includes storing table data for a database, the table data including information in rows and columns of one or more database tables. The method includes storing metadata on immutable storage, the metadata including information about the table data for the database. In one embodiment, mutable metadata may be periodically consolidated in the background to create new versions of metadata files and which allows for deletions of old metadata files and old data files.

    SHARING MATERIALIZED VIEWS IN MULTIPLE TENANT DATABASE SYSTEMS

    公开(公告)号:US20200379995A1

    公开(公告)日:2020-12-03

    申请号:US16831000

    申请日:2020-03-26

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for sharing materialized views in multiple tenant database systems. A method includes defining a materialized view over a source table that is associated with a first account of a multiple tenant database. The method includes defining cross-account access rights to the materialized view to a second account such that that second account can read the materialized view without copying the materialized view. The method includes modifying the source table for the materialized view. The method includes identifying whether the materialized view is stale with respect to the source table by merging the materialized view and the source table.

    Sharing Materialized Views In Multiple Tenant Database Systems

    公开(公告)号:US20200379994A1

    公开(公告)日:2020-12-03

    申请号:US16428395

    申请日:2019-05-31

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for sharing materialized views in multiple tenant database systems. A method includes defining a materialized view over a source table that is associated with a first account of a multiple tenant database. The method includes defining cross-account access rights to the materialized view to a second account such that that second account can read the materialized view without copying the materialized view. The method includes modifying the source table for the materialized view. The method includes identifying whether the materialized view is stale with respect to the source table by merging the materialized view and the source table.

    Multi-level metadata in database systems

    公开(公告)号:US10810232B2

    公开(公告)日:2020-10-20

    申请号:US16775092

    申请日:2020-01-28

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for multiple level metadata organization in database systems. A system includes a plurality of shared storage devices collectively storing database data across a plurality of tables that each comprise one or more micro-partitions including immutable storage devices. The system includes a resource manager configured to manage the storage of database data stored in a table across one or more of the plurality of shared storage devices. The resource manager is configured to manage the storage of metadata for the table. The metadata includes a column expression property with information about data stored in a column, a micro-partition expression property with information about data stored in a micro-partition, a grouping expression property with information about data stored in two or more micro-partitions, and cumulative table metadata with global information about all micro-partitions of the table.

    Transferring Connections in a Multiple Deployment Database

    公开(公告)号:US20200301942A1

    公开(公告)日:2020-09-24

    申请号:US16357822

    申请日:2019-03-19

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for transitioning a client connection for a database from a first deployment of the database to a second deployment of the database. A method includes maintaining the client connection between a client and the first deployment such that database traffic occurs at the first deployment. The method includes generating a unique identification that references a first connection object of the first deployment and a second connection object of a second deployment. The method includes receiving a notification that the first deployment is unavailable and providing an external connection group unique identification to the second deployment for the second deployment to determine whether the client should be connected to the second deployment. The method includes receiving a uniform resource locator from the second deployment if the client should be connected to the second deployment.

    Transactional Streaming Of Change Tracking Data

    公开(公告)号:US20200183908A1

    公开(公告)日:2020-06-11

    申请号:US16213554

    申请日:2018-12-07

    Applicant: Snowflake Inc.

    Abstract: Systems, methods, and devices for generating a transactional change tracking summary for a database. A method includes executing a transaction on a table of a database, wherein the table includes a micro-partition and the transaction is executed on the micro-partition. The method includes, in response to the transaction being fully executed, generating a change tracking entry comprising an indication of one or modifications made to the table by the transaction. The method includes storing the change tracking entry in a change tracking stream, wherein the change tracking stream includes one or more sequential change tracking entries that each correspond to a different transaction that is fully executed on the table.

Patent Agency Ranking