-
公开(公告)号:US12021877B2
公开(公告)日:2024-06-25
申请号:US18369404
申请日:2023-09-18
Applicant: SNOWFLAKE INC.
Inventor: Pui Kei Johnston Chu , Benoit Dageville , Matthew J. Glickman , Christian Kleinerman , Prasanna Krishnan , Justin Langseth
IPC: H04L29/06 , G06F16/2455 , G06F21/60 , G06F21/62 , H04L9/40 , H04L41/22 , H04L51/212 , H04L67/1097
CPC classification number: H04L63/102 , G06F16/2455 , G06F21/606 , G06F21/6218 , H04L41/22 , H04L51/212 , H04L63/101 , H04L67/1097
Abstract: Systems and methods for managing membership in a private data exchange are provided herein. In one embodiment, the method includes receiving a request for access to a first listing of a data exchange, each listing of the data exchange comprising version metadata. The method further includes, in response to the request, accessing a first version of a data set referenced by the first listing, wherein the first version of the data set comprises a first structure defined by first version metadata, a second listing of the data exchange references a second version of the data set, the second version of the data set comprising a second structure defined by second version metadata, and the second structure is incompatible with the first structure.
-
公开(公告)号:US12020128B2
公开(公告)日:2024-06-25
申请号:US18162695
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Orestis Kostakis , Justin Langseth
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: A method includes installing, in a consumer database account, a shared-instance database that includes a shared instance of a provider-account database that resides in a provider database account. The shared-instance database includes a first schema that includes provider-account training data, provider-account scoring data, a training function, and a scoring function. The method also includes invoking the training function from the consumer database account, which results in creation in the consumer database account of a second schema that includes a machine-learning-model instance of a machine learning model, and which also results in training the machine-learning model instance with at least the provider-account training data. Additionally, the method includes generating consumer-account scoring data by inputting, into the trained machine-learning-model instance, consumer-account input data that is stored in the consumer database account. The method also includes storing the consumer-account scoring data in the consumer database account.
-
公开(公告)号:US11928115B2
公开(公告)日:2024-03-12
申请号:US18161541
申请日:2023-01-30
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F16/2455 , G06F16/242 , G06F16/27 , G06F21/62
CPC classification number: G06F16/24565 , G06F16/2443 , G06F16/27 , G06F21/6227
Abstract: Disclosed herein are systems and methods for query processing with restrictions in a database clean room. In an embodiment, a system receives a query directed to a combination of a first source dataset from a first database account of a distributed database and a second source dataset from a second database account of the distributed database. The system generates an approved statements table that contains database statement language that can be executed against the combination of the first and second source datasets. Based on determining that the approved statements table includes the query, the system executes the query to produce results data, and stores the results data in the first database account.
-
公开(公告)号:US11893462B2
公开(公告)日:2024-02-06
申请号:US18055248
申请日:2022-11-14
Applicant: Snowflake Inc.
Inventor: Monica J. Holboke , Justin Langseth , Stuart Ozer , William L. Stratton, Jr.
CPC classification number: G06N20/00 , G06F16/256 , G06F16/283 , G06F18/214 , G06F21/6227
Abstract: Disclosed are systems, methods, and non-transitory computer-readable media for sharing, on a distributed database, a database application to a first user of the distributed database, the database application generated by a second user of the distributed database. The training dataset includes a first database training dataset from the first user of the distributed database and a second database training dataset from the second user of the distributed database, the first database training dataset and the second database training dataset including non-overlapping dataset features. The database application further identifies a query from the second user to train the machine learning model on the training dataset and generates a trained machine learning model by training the machine learning model on a joined dataset according to the query. The database application generates outputs from the trained machine learning model by applying the trained machine learning model on new data.
-
公开(公告)号:US20230409968A1
公开(公告)日:2023-12-21
申请号:US18162695
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Orestis Kostakis , Justin Langseth
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: A method includes installing, in a consumer database account, a shared-instance database that includes a shared instance of a provider-account database that resides in a provider database account. The shared-instance database includes a first schema that includes provider-account training data, provider-account scoring data, a training function, and a scoring function. The method also includes invoking the training function from the consumer database account, which results in creation in the consumer database account of a second schema that includes a machine-learning-model instance of a machine learning model, and which also results in training the machine-learning model instance with at least the provider-account training data. Additionally, the method includes generating consumer-account scoring data by inputting, into the trained machine-learning-model instance, consumer-account input data that is stored in the consumer database account. The method also includes storing the consumer-account scoring data in the consumer database account.
-
公开(公告)号:US20230385286A1
公开(公告)日:2023-11-30
申请号:US18162688
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Matthew J. Glickman , Orestis Kostakis , Justin Langseth
IPC: G06F16/2455 , G06F16/242
CPC classification number: G06F16/24568 , G06F16/24564 , G06F16/244 , G06F16/2456
Abstract: A system for generating similarity data for different datasets in a cloud data platform. A first dataset of a plurality of datasets on the cloud data platform is identified, where the first dataset is associated with a first user of the cloud data platform. A semantic type for each feature the first dataset is identified, and each semantic type for the first dataset is compared with existing data of the first user. Semantic types for each feature of each dataset are identified, and each semantic type for the first dataset is compared to each semantic type of each dataset. Overlap requests are generated to output overlap datasets between the first dataset and each of the plurality of datasets. A results dataset is generated by applying the overlap requests to a joined dataset comprising data from the first dataset and data from each of the plurality of datasets.
-
公开(公告)号:US11809600B2
公开(公告)日:2023-11-07
申请号:US18104281
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Joshua James Chacona , Christian Kleinerman , Justin Langseth , William L. Stratton, Jr.
IPC: G06F21/62 , G06F16/22 , G06F16/2455
CPC classification number: G06F21/6254 , G06F16/2282 , G06F16/2455 , G06F21/6263
Abstract: Embodiments of the present disclosure may provide a data clean room allowing secure data analysis across multiple accounts, without the use of third parties. Each account may be associated with a different company or party. The data clean room may provide security functions to safeguard sensitive information. For example, the data clean room may restrict access to data in other accounts. The data clean room may also restrict which data may be used in the analysis and may restrict the output. The overlap data may be anonymized to prevent sensitive information from being revealed.
-
公开(公告)号:US20230188530A1
公开(公告)日:2023-06-15
申请号:US18106901
申请日:2023-02-07
Applicant: Snowflake Inc.
Inventor: Benoit Dageville , Matthew Glickman , Christian Kleinerman , Prasanna Krishnan , Justin Langseth
IPC: H04L9/40 , G06F16/2455 , H04L41/22 , H04L67/1097 , H04L51/212
CPC classification number: H04L63/102 , H04L63/101 , G06F16/2455 , H04L41/22 , H04L67/1097 , H04L51/212
Abstract: Providing access to consumer information on a private data exchange is described. In an example, privileges are assigned to a share object, granting access to consumer information obtained from a query against data referenced by a data listing of a data exchange. The data exchange comprises one or more data listings provided by one or more data providers. In response to the share object being added to a list of consumed share objects of a data provider that provided the data listing, a database is generated that references the consumer information based on the share object. Access to the database is granted to the data provider that provided the data listing.
-
公开(公告)号:US20230169213A1
公开(公告)日:2023-06-01
申请号:US18162701
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Rachel Frances Blum , Justin Langseth , Michael Earle Rainey
IPC: G06F21/62 , G06F16/245 , G06F16/27
CPC classification number: G06F21/6254 , G06F16/245 , G06F16/27
Abstract: A distributed database generates a cross reference table that cross references a first dataset from a first database account and a second dataset from a second account. The distributed database receives a query directed to a combination of the first and second datasets, and generates an interim table in the first database account by applying the query to the cross reference table and the first dataset. The distributed database generates results data in the second database account by applying the query to the interim table and the second dataset, and stores the results data in the first database account.
-
公开(公告)号:US20230169202A1
公开(公告)日:2023-06-01
申请号:US18162692
申请日:2023-01-31
Applicant: Snowflake Inc.
Inventor: Christopher Peter Child , Matthew J. Glickman , Justin Langseth
IPC: G06F21/62 , G06F21/31 , G06F16/248 , G06F16/2457 , G06F16/28 , G06F3/06 , G06F16/25
CPC classification number: G06F21/6227 , G06F3/0637 , G06F16/248 , G06F16/256 , G06F16/283 , G06F16/287 , G06F16/24573 , G06F21/31
Abstract: A request to share cloud data associated with a first account in a database system with an application is received. Based on the request, second-party application data and third-party data that is accessible by the first account is identified. The second-party application data comprises application data corresponding to a second application provided by a second-party application provider. The third-party data corresponds to a second account maintained by the database system. The second account corresponds to a third-party data provider. The application is enabled to access the cloud data associated with the first account based on the request. The cloud data comprises customer data associated with the first account, second-party application data, and the third-party data.
-
-
-
-
-
-
-
-
-