-
公开(公告)号:US20230401200A1
公开(公告)日:2023-12-14
申请号:US17816420
申请日:2022-07-31
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F16/242 , G06F16/2455 , G06F16/27
CPC classification number: G06F16/242 , G06F16/2456 , G06F16/27
Abstract: Embodiments of the present disclosure may provide a data sharing system implemented as a local application of a distributed database. A query from a query template can be validated and executed against shared dataset that comprises portions of data from the database dataset and additional portions of data from another database of the distributed database.
-
公开(公告)号:US11822554B2
公开(公告)日:2023-11-21
申请号:US18162708
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F16/2455 , G06F16/27 , G06F16/242
CPC classification number: G06F16/24565 , G06F16/2443 , G06F16/27
Abstract: First and second database accounts respectively share first and second source datasets with one another in a data clean room. The first database account stores a first approved-statements table that contains database statements that are permitted by the first database account to be executed against a combination of the shared first source dataset and the shared second source dataset. The second database account stores a second approved-statements table that contains database statements that are permitted by the second database account to be executed against the combination of the shared first source dataset and the shared second source dataset. One or more queries received from the first database account that are included in the second approved-statements table are processed, as are one or more queries received from the second database account that are included in the first approved-statements table.
-
公开(公告)号:US20230222127A1
公开(公告)日:2023-07-13
申请号:US18162708
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F16/2455 , G06F16/27 , G06F16/242
CPC classification number: G06F16/24565 , G06F16/27 , G06F16/2443
Abstract: First and second database accounts respectively share first and second source datasets with one another in a data clean room. The first database account stores a first approved-statements table that contains database statements that are permitted by the first database account to be executed against a combination of the shared first source dataset and the shared second source dataset. The second database account stores a second approved-statements table that contains database statements that are permitted by the second database account to be executed against the combination of the shared first source dataset and the shared second source dataset. One or more queries received from the first database account that are included in the second approved-statements table are processed, as are one or more queries received from the second database account that are included in the first approved-statements table.
-
公开(公告)号:US11347886B1
公开(公告)日:2022-05-31
申请号:US17390938
申请日:2021-07-31
Applicant: Snowflake Inc.
Inventor: Justin Langseth , Michael Earle Rainey , William L. Stratton, Jr.
IPC: G06F21/62 , G06F16/9535 , G06F16/27 , H04L9/40 , H04L9/06
Abstract: Embodiments of the present disclosure may provide a data clean room allowing encryption based data analysis across multiple accounts, including different provider database user accounts that provide user data and a network service and a requesting user that generates one or more clean room requests. The data clean room may also restrict which data may be used in the analysis and may restrict the output.
-
公开(公告)号:US11138340B1
公开(公告)日:2021-10-05
申请号:US17334297
申请日:2021-05-28
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F21/00 , G06F21/62 , G06F16/245 , G06F16/27
Abstract: Embodiments of the present disclosure may provide a data clean room allowing secure data analysis across multiple accounts and across different regions and cloud providers. The data clean room may also restrict which data may be used in the analysis and may restrict the output. The overlap data may be anonymized to prevent sensitive information from being revealed. A version of a provider account can be created that is similar to a client account, such as in the same cloud type or the same region as the client database account. The client account can share data that is replicated to the provider account to complete client requests using an anonymized data and the cross reference table.
-
公开(公告)号:US12135721B2
公开(公告)日:2024-11-05
申请号:US18480028
申请日:2023-10-03
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F16/2455 , G06F16/242 , G06F16/27 , G06F21/62
Abstract: Disclosed herein are systems and methods for query processing with restrictions in a database clean room. In an embodiment, a system receives a query directed to a combination of a first source dataset from a first database account of a distributed database and a second source dataset from a second database account of the distributed database. The system generates an approved statements table that contains database statement language that can be executed against the combination of the first and second source datasets. Based on determining that the approved statements table includes the query, the system executes the query to produce results data, and stores the results data in the first database account.
-
公开(公告)号:US11989173B2
公开(公告)日:2024-05-21
申请号:US18162710
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F16/30 , G06F16/242 , G06F16/2455 , G06F16/27
CPC classification number: G06F16/242 , G06F16/2456 , G06F16/27
Abstract: A method includes generating a defined access clean room in a provider database account, as well as installing, in a consumer account, an application instance that implements the defined access clean room. The method also includes sharing, by the provider database account, source provider data with the defined access clean room, where the sharing makes the source provider data accessible to the consumer database account via the application instance. The method further includes sharing, by the provider database account, a query template with the consumer database account via the application instance. Additionally, the method includes receiving a query generated by the consumer database account based on the query template, validating that the query is consistent with the query template, responsively generating query results by executing the query, as well as storing the query results in the consumer database account.
-
公开(公告)号:US20240062098A1
公开(公告)日:2024-02-22
申请号:US17821587
申请日:2022-08-23
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Nancy Dou , Matthew J. Glickman , Boxin Jiang , Orestis Kostakis , Justin Langseth , Michael Earle Rainey , Haoran Yu
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: The subject technology receives first party training data provided by an end-user of a baseline machine learning model. The subject technology determines a first set of common features based on the first party training data. The subject technology receives, from at least one data source. The subject technology determines a second set of common features based on the set of datasets. The subject technology trains, using the first set of common features and the second set of common features, a second machine learning model, the second machine learning model incorporating additional training data from the external data supplier during training compared to the baseline machine learning model. The subject technology generates a boosted machine learning model based at least in part on the training, the boosted machine learning model comprising the trained second machine learning model.
-
公开(公告)号:US20240028597A1
公开(公告)日:2024-01-25
申请号:US18480028
申请日:2023-10-03
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F16/2455 , G06F16/242 , G06F16/27 , G06F21/62
CPC classification number: G06F16/24565 , G06F16/2443 , G06F16/27 , G06F21/6227
Abstract: Disclosed herein are systems and methods for query processing with restrictions in a database clean room. In an embodiment, a system receives a query directed to a combination of a first source dataset from a first database account of a distributed database and a second source dataset from a second database account of the distributed database. The system generates an approved statements table that contains database statement language that can be executed against the combination of the first and second source datasets. Based on determining that the approved statements table includes the query, the system executes the query to produce results data, and stores the results data in the first database account.
-
公开(公告)号:US11816248B2
公开(公告)日:2023-11-14
申请号:US18162701
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F21/00 , G06F21/62 , G06F16/245 , G06F16/27
CPC classification number: G06F21/6254 , G06F16/245 , G06F16/27
Abstract: A distributed database generates a cross reference table that cross references a first dataset from a first database account and a second dataset from a second database account. The distributed database receives a query directed to a combination of the first and second datasets, and generates an interim table in the first database account by applying the query to the cross reference table and the first dataset. The distributed database generates results data in the second database account by applying the query to the interim table and the second dataset, and stores the results data in the first database account.
-
-
-
-
-
-
-
-
-