Data replication with cross replication group references

    公开(公告)号:US11704199B1

    公开(公告)日:2023-07-18

    申请号:US17823690

    申请日:2022-08-31

    Applicant: Snowflake Inc.

    CPC classification number: G06F11/1435 G06F16/256 G06F16/27 G06F2201/84

    Abstract: This disclosure provides methods and techniques of data replication involving cross replication group (RG) references. The present disclosure avoids automatic replication failing when an entity in an RG refers to another entity external to the RG. The entity to be replicated within the RG is referred to as the “referring entity,” and the entity as the dangling reference is referred to as the “referred entity.” Although the referring and referred entities are not replicated together in a replication operation, the referred entity may have already been replicated to the target account in another replication operation on a different replication group. In such cases, the data replication procedure may, according to aspects of the present disclosure, check if the referred entity has already been replicated, and if so, proceed to replicate the referring entity without fail, and link the referring and referred entities to enable normal functioning of the referring entity.

    SYMMETRIC QUERY PROCESSING IN A DATABASE CLEAN ROOM

    公开(公告)号:US20230222127A1

    公开(公告)日:2023-07-13

    申请号:US18162708

    申请日:2023-01-31

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24565 G06F16/27 G06F16/2443

    Abstract: First and second database accounts respectively share first and second source datasets with one another in a data clean room. The first database account stores a first approved-statements table that contains database statements that are permitted by the first database account to be executed against a combination of the shared first source dataset and the shared second source dataset. The second database account stores a second approved-statements table that contains database statements that are permitted by the second database account to be executed against the combination of the shared first source dataset and the shared second source dataset. One or more queries received from the first database account that are included in the second approved-statements table are processed, as are one or more queries received from the second database account that are included in the first approved-statements table.

    TRACKING INTERMEDIATE CHANGES IN DATABASE DATA
    454.
    发明公开

    公开(公告)号:US20230205766A1

    公开(公告)日:2023-06-29

    申请号:US18179762

    申请日:2023-03-07

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/2379 G06F16/2282 G06F16/2456 G06F16/24554

    Abstract: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.

    FEATURE SETS USING SEMI-STRUCTURED DATA STORAGE

    公开(公告)号:US20230177063A1

    公开(公告)日:2023-06-08

    申请号:US18162522

    申请日:2023-01-31

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/25 G06F16/24558 G06F16/86 G06F16/2282

    Abstract: The subject technology receives by a database system, raw input data from a source table provided by an external environment, the source table comprising multiple rows and multiple columns, the raw input data comprising values in a first format, the values comprising input features corresponding to datasets included in the raw input data for machine learning models, the external environment comprising an external system from the database system and is accessed by different users. The subject technology generates cell data for a second table based on the values from the source table. The subject technology performs a database operation to generate the second table including table metadata, column metadata, and the generated cell data.

    Background service to generate data dictionary metadata

    公开(公告)号:US11669630B1

    公开(公告)日:2023-06-06

    申请号:US18051401

    申请日:2022-10-31

    Applicant: Snowflake Inc.

    CPC classification number: G06F21/6218 G06F21/604 G06F2221/2141

    Abstract: A data dictionary generation system utilizes a background service that is programmed to automatically populate and update a data dictionary for listings offering shared data. A data dictionary includes metadata describing the shared data overall as well as the individual objects included in the listing, such as the individual tables, schemas, views, and functions. To generate the data dictionary, the data dictionary generation system analyzes the shared data to identify objects, identifies a set of data fields associated with each identified object and populates the set of data fields associated with each identified object based on the shared data offered by the listing. To ensure that a data dictionary for each listing remains up to date, the data dictionary generation system periodically scans the listings to identify any changes to share access granted to the listings.

Patent Agency Ranking