Overlap results data generation on a cloud data platform

    公开(公告)号:US11836138B1

    公开(公告)日:2023-12-05

    申请号:US18162688

    申请日:2023-01-31

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24568 G06F16/244 G06F16/2456 G06F16/24564

    Abstract: A system for generating similarity data for different datasets in a cloud data platform. A first dataset of a plurality of datasets on the cloud data platform is identified, where the first dataset is associated with a first user of the cloud data platform. A semantic type for each feature the first dataset is identified, and each semantic type for the first dataset is compared with existing data of the first user. Semantic types for each feature of each dataset are identified, and each semantic type for the first dataset is compared to each semantic type of each dataset. Overlap requests are generated to output overlap datasets between the first dataset and each of the plurality of datasets. A results dataset is generated by applying the overlap requests to a joined dataset comprising data from the first dataset and data from each of the plurality of datasets.

    MACHINE LEARNING USING SECURED SHARED DATA
    13.
    发明公开

    公开(公告)号:US20230186160A1

    公开(公告)日:2023-06-15

    申请号:US18055248

    申请日:2022-11-14

    Applicant: Snowflake Inc.

    Abstract: Disclosed are systems, methods, and non-transitory computer-readable media for sharing, on a distributed database, a database application to a first user of the distributed database, the database application generated by a second user of the distributed database. The training dataset includes a first database training dataset from the first user of the distributed database and a second database training dataset from the second user of the distributed database, the first database training dataset and the second database training dataset including non-overlapping dataset features. The database application further identifies a query from the second user to train the machine learning model on the training dataset and generates a trained machine learning model by training the machine learning model on a joined dataset according to the query. The database application generates outputs from the trained machine learning model by applying the trained machine learning model on new data.

    Secure data comparison using data clean rooms

    公开(公告)号:US11620409B2

    公开(公告)日:2023-04-04

    申请号:US17932610

    申请日:2022-09-15

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are methods and systems for secure data comparison using data clean rooms. In an embodiment, a computer system generates a replica database based on a provider database, which stores a cross reference table that cross references a client dataset of a client database and a provider dataset of the provider database. The system receives, at the replica database, a table that is generated by the client database using the cross-reference table. The system transmits, from the replica database, the table to the provider database. The system receives, at the replica database, a results dataset that is generated by the provider database by applying a database statement to the provider database using the table generated by the client database. The system shares, from the replica database, the results dataset with the client database.

    ENABLING APPLICATION ACCESS TO CLOUD DATA

    公开(公告)号:US20220147647A1

    公开(公告)日:2022-05-12

    申请号:US17646191

    申请日:2021-12-28

    Abstract: Security credentials associated with a first account maintained by a database system are authenticated. Based on authenticating the security credentials, a user interface element is provided to enable sharing of customer data associated with the first account with an application. A request to share customer data with the application is received. Based on the request, third-party data that is accessible by the first account is identified. The third-party data corresponds to a second account maintained by the database system. The second account corresponds to a third-party data provider. The application is enabled to access cloud data associated with the first account based on the request. The cloud data comprises the customer data and the third-party data.

Patent Agency Ranking