-
公开(公告)号:US12135745B2
公开(公告)日:2024-11-05
申请号:US18461897
申请日:2023-09-06
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/00 , G06F16/22 , G06F16/2455 , G06F16/901 , G06F16/908 , G06F16/955
Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.
-
公开(公告)号:US20240338521A1
公开(公告)日:2024-10-10
申请号:US18629693
申请日:2024-04-08
Applicant: Snowflake Inc.
Inventor: Michal Gdak , Ganeshan Ramachandran Iyer , Tomasz Malisz , Mikolaj Niedbala , Pawel Pollak , Saurin Shah , Jan Tomasz Topinski
IPC: G06F40/226
CPC classification number: G06F40/226
Abstract: Systems and methods for: processing a current electronic document, using a set of machine-learning (ML) models, to extract a set of values for a set of data points based on a schema, where the schema describes the set of data points to be extracted from electronic documents; determining whether to select the current electronic document for human validation based on the schema; and adding the current electronic document to a human validation queue in response to determining to select the current electronic document for human validation based on the schema.
-
公开(公告)号:US20240111762A1
公开(公告)日:2024-04-04
申请号:US18526666
申请日:2023-12-01
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/242 , G06F3/06 , G06F9/54 , G06F16/16 , G06F16/22 , G06F16/23 , G06F16/2455 , G06F16/25
CPC classification number: G06F16/2423 , G06F3/0605 , G06F3/0644 , G06F3/0653 , G06F3/067 , G06F9/542 , G06F16/164 , G06F16/2282 , G06F16/2358 , G06F16/2393 , G06F16/24557 , G06F16/256
Abstract: Disclosed herein are systems and methods for efficiently querying external tables. In an embodiment, a database platform receives a query that is directed at least in part to external data in an external table stored on a data storage platform that is external to the database platform. The external table includes a plurality of partitions. The database platform identifies, from external-table metadata, a subset of the plurality of partitions of the external table as including data that potentially satisfies the query. The external-table metadata is stored by the database platform. The database platform identifies data that satisfies the query by scanning the identified subset of the partitions, and responds to the query at least in part with the identified data that satisfies the query.
-
公开(公告)号:US11734451B1
公开(公告)日:2023-08-22
申请号:US17931424
申请日:2022-09-12
Applicant: Snowflake Inc.
Inventor: Abhiram Arikapudi , Brian Jack Lee , Saurin Shah , Cameron Tekiyeh , Sujay C. Yantrapragada
IPC: G06F21/00 , G06F21/62 , G06F16/242
CPC classification number: G06F21/6254 , G06F16/2445
Abstract: An exchange template framework for a data platform. A data provider of the data platform instructs the data platform to generate an exchange template based on compliance rules. In response to determining by the data platform to initiate a compliance report, the data platform maps evidence items for the compliance report from one or more source tables using a data field mapper of the exchange template. The data platform generates compliance program specific views based on the evidence items mapped from the source tables and generates an anonymized view based on the compliance program specific views using an anonymization layer of the exchange template. The data platform provides the anonymized view to an assessor using one or more validation scripts of the exchange template.
-
公开(公告)号:US20230214405A1
公开(公告)日:2023-07-06
申请号:US18119775
申请日:2023-03-09
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Polita Paulus , Sahaj Saini , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
IPC: G06F16/27 , G06F16/955 , G06F16/25
CPC classification number: G06F16/27 , G06F16/9566 , G06F16/254
Abstract: The embodiments described herein provide means for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with a set of storage locations, wherein the storage integration comprises a base URL for each of the set of storage locations and wherein each storage location identifies a remote deployment where a cloud platform is hosted and a geographic region of the remote deployment. An external stage object may be bound to the storage integration, wherein the external stage object facilitates a data loading operation that is currently in progress on the first storage location. In response to detecting an outage at the first storage location, the data loading operation that was in progress on the first storage location may be continued at the second storage location using the storage integration.
-
公开(公告)号:US11620307B2
公开(公告)日:2023-04-04
申请号:US17396576
申请日:2021-08-06
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Polita Paulus , Sahaj Saini , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
IPC: G06F16/20 , G06F16/27 , G06F16/955 , G06F16/25
Abstract: Described herein are techniques for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with multiple storage locations that it can point to, as well as a designation of an active storage location. The storage integration may also be defined with base file paths for each storage location as well as a relative file path which together may serve to synchronize data loading operations between deployments when e.g., a fail-over occurs from one deployment to another. The storage integration may be replicated from a first deployment to a second deployment, and when database replication occurs, an external stage may be replicated to the second deployment and bound to the replicated storage integration. Thus, a fail-over to the second deployment may result in a seamless transition of data loading processes to the second deployment.
-
公开(公告)号:US11522860B2
公开(公告)日:2022-12-06
申请号:US17587829
申请日:2022-01-28
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Peter Povinec , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
Abstract: A command to load or unload data at a storage location is received. In response to the command, a storage integration object associated with the storage location is identified. The storage integration object identifies a cloud identity object that corresponds to a cloud identity that is associated with a proxy identity object corresponding to a proxy identity granted permission to access the storage location. The data is loaded or unloaded at the storage location by assuming the proxy identity.
-
公开(公告)号:US11507571B2
公开(公告)日:2022-11-22
申请号:US16385720
申请日:2019-04-16
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/23 , G06F16/248 , G06F16/2458 , G06F3/06 , G06F16/25
Abstract: Systems, methods, and devices for generating a materialized view over an external table. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating, by the database platform, a materialized view over the external table. The method includes receiving a notification that a modification has been made to the external table, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes, in response to the external table being modified, refreshing the materialized view such that the materialized view comprises an accurate representation of the external table.
-
公开(公告)号:US11494438B2
公开(公告)日:2022-11-08
申请号:US17645415
申请日:2021-12-21
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/00 , G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
Abstract: Disclosed herein are systems and methods for population of a file-catalog table for a file stage in a user account on a data platform. In an embodiment, a data platform receives, from a client associated with a user account, a request to populate a file-catalog table of the user account based on a plurality of files stored in a file stage of the user account. The data platform responsively executes a list-files table function with respect to the file stage to generate a database-table object having a row for each file stored in the file stage. The data platform populates the file-catalog table of the user account based on the database-table object generated by the list-files table function.
-
公开(公告)号:US20220350795A1
公开(公告)日:2022-11-03
申请号:US17812878
申请日:2022-07-15
Applicant: Snowflake, Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shringte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/242 , G06F3/06 , G06F16/23 , G06F16/16 , G06F16/25 , G06F9/54 , G06F16/22 , G06F16/2455
Abstract: Disclosed herein are systems and methods for efficiently querying external tables. In an embodiment, a database platform receives a query that is directed at least in part to external data in an external table stored on a data storage platform that is external to the database platform. The external table includes a plurality of partitions. The database platform identifies, from external-table metadata, a subset of the plurality of partitions of the external table as including data that potentially satisfies the query. The external-table metadata is stored by the database platform. The database platform identifies data that satisfies the query by scanning the identified subset of the partitions, and responds to the query at least in part with the identified data that satisfies the query.
-
-
-
-
-
-
-
-
-