Overlap queries on a distributed database

    公开(公告)号:US12008001B2

    公开(公告)日:2024-06-11

    申请号:US17804434

    申请日:2022-05-27

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24568 G06F16/244 G06F16/2456 G06F16/24564

    Abstract: Systems, methods, and machine-readable storage devices provide for identifying a user dataset on a distributed database. The system includes generating a similarity score dataset that indicates a similarity between the user dataset and a plurality of datasets of other users of the distributed database. The system generates a plurality of overlap queries that are configured to output overlap datasets between the user dataset and one or more of the plurality of datasets. The system further generates a results dataset by applying one or more of the plurality of overlap queries to a joined dataset comprising data from the user dataset and one of the plurality of datasets of other users on the distributed database.

    OVERLAP QUERIES ON A DISTRIBUTED DATABASE
    45.
    发明公开

    公开(公告)号:US20230385284A1

    公开(公告)日:2023-11-30

    申请号:US17804434

    申请日:2022-05-27

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24568 G06F16/2456 G06F16/244 G06F16/24564

    Abstract: Systems, methods, and machine-readable storage devices provide for identifying a user dataset on a distributed database. The system includes generating a similarity score dataset that indicates a similarity between the user dataset and a plurality of datasets of other users of the distributed database. The system generates a plurality of overlap queries that are configured to output overlap datasets between the user dataset and one or more of the plurality of datasets. The system further generates a results dataset by applying one or more of the plurality of overlap queries to a joined dataset comprising data from the user dataset and one of the plurality of datasets of other users on the distributed database.

    Accessing listings in a data exchange

    公开(公告)号:US11531681B2

    公开(公告)日:2022-12-20

    申请号:US17704783

    申请日:2022-03-25

    Applicant: Snowflake Inc.

    Abstract: A method for accessing listings in a data exchange includes creating a first listing in a data exchange, the first listing referencing a first database of a plurality of databases and specifying identity-based sharing of the first database, creating a second listing in the data exchange, the second listing referencing a second database of the plurality of databases and data of the first database shared according to the identity-based sharing of the first database, and receiving an instruction from a user of the data exchange, the instruction referencing the second listing and instructing the addition of the second listing to a set of consumed data shares accessible by the user.

    DATA OVERLAP COUNT ADJUSTMENT IN A MULTIPLE TENANT DATABASE SYSTEM

    公开(公告)号:US20220327232A1

    公开(公告)日:2022-10-13

    申请号:US17847681

    申请日:2022-06-23

    Applicant: SNOWFLAKE INC.

    Abstract: Systems, methods, and devices for generating a secure join of database data are disclosed. A method creates a secure view of datapoints of a consumer account and processes, using a secure user defined function (UDF), the datapoints of the consumer account and datapoints of a provider account to generate a secure join key. The datapoints of the consumer account are provided to the secure UDF using the secure view. The method further performs, by a processor, an analysis of the datapoints of the consumer account and the datapoints of the provider account of the secure join key. The analysis returns a count value of overlapping datapoints between the consumer account and the provider account. The method further adjusts the count value of overlapping datapoints based on a number of distinct rows associated with the provider account, and provides the adjusted count value of overlapping datapoints to the consumer account.

Patent Agency Ranking