-
公开(公告)号:US20200334240A1
公开(公告)日:2020-10-22
申请号:US16385837
申请日:2019-04-16
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
Abstract: Systems, methods, and devices for automated maintenance of external tables in database systems are disclosed. A method includes receiving, by a database platform, read access to content in an external data storage platform that is separate from the database platform. The method includes defining an external table based on the content in the external data storage platform. The method includes connecting the database platform to the external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating metadata for the external table, the metadata comprising information about data stored in the external table. The method includes receiving a notification that a modification has been made to the content in the external data storage platform, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes refreshing the metadata for the external table in response to the modification being made to the content in the external data storage platform.
-
公开(公告)号:US12235835B2
公开(公告)日:2025-02-25
申请号:US18526666
申请日:2023-12-01
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/20 , G06F3/06 , G06F9/54 , G06F16/16 , G06F16/22 , G06F16/23 , G06F16/242 , G06F16/2455 , G06F16/25
Abstract: System and method for efficiently querying external tables are described herein. In an embodiment, a database platform receives a query that is directed at least in part to external data in an external table stored on a data storage platform that is external to the database platform. The external table includes a plurality of partitions. The database platform identifies, from external-table metadata, a subset of the plurality of partitions of the external table as including data that potentially satisfies the query. The external-table metadata is stored by the database platform. The database platform identifies data that satisfies the query by scanning the identified subset of the partitions, and responds to the query at least in part with the identified data that satisfies the query.
-
公开(公告)号:US20240419685A1
公开(公告)日:2024-12-19
申请号:US18819108
申请日:2024-08-29
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Polita Paulus , Sahaj Saini , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
IPC: G06F16/27 , G06F16/25 , G06F16/955
Abstract: The embodiments described herein provide means for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with a set of storage locations, wherein the storage integration comprises a base URL for each of the set of storage locations and wherein each storage location identifies a remote deployment where a cloud platform is hosted and a geographic region of the remote deployment. An external stage object may be bound to the storage integration, wherein the external stage object facilitates a data loading operation that is currently in progress on the first storage location. In response to detecting an outage at the first storage location, the data loading operation that was in progress on the first storage location may be continued at the second storage location using the storage integration.
-
公开(公告)号:US11922328B1
公开(公告)日:2024-03-05
申请号:US18472883
申请日:2023-09-22
Applicant: Snowflake Inc.
Inventor: Michal Gdak , Ganeshan Ramachandran Iyer , Tomasz Malisz , Mikolaj Niedbala , Pawel Pollak , Saurin Shah , Jan Tomasz Topinski , Daria Wieteska
IPC: G06N5/022
CPC classification number: G06N5/022
Abstract: Systems and methods for generating a machine-learning (ML) model for extracting information from one or more electronic documents, where the ML model can be used as a data object, which can be part of a database command or as part of a document information extraction process that is continuously running (e.g., document information extraction pipeline).
-
公开(公告)号:US20230409636A1
公开(公告)日:2023-12-21
申请号:US18461897
申请日:2023-09-06
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
CPC classification number: G06F16/9017 , G06F16/955 , G06F16/24568 , G06F16/2282 , G06F16/908
Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.
-
公开(公告)号:US11841849B2
公开(公告)日:2023-12-12
申请号:US17812878
申请日:2022-07-15
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/20 , G06F16/242 , G06F3/06 , G06F16/23 , G06F16/16 , G06F16/25 , G06F9/54 , G06F16/22 , G06F16/2455
CPC classification number: G06F16/2423 , G06F3/0605 , G06F3/067 , G06F3/0644 , G06F3/0653 , G06F9/542 , G06F16/164 , G06F16/2282 , G06F16/2358 , G06F16/2393 , G06F16/24557 , G06F16/256
Abstract: In an embodiment, a database platform receives a query that is directed at least in part to external data in an external table stored on a data storage platform that is external to the database platform. The external table includes a plurality of partitions. The database platform identifies, from external-table metadata, a subset of the plurality of partitions of the external table as including data that potentially satisfies the query. The external-table metadata is stored by the database platform. The database platform identifies data that satisfies the query by scanning the identified subset of the partitions, and responds to the query at least in part with the identified data that satisfies the query.
-
公开(公告)号:US11797608B2
公开(公告)日:2023-10-24
申请号:US18049695
申请日:2022-10-26
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/00 , G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
CPC classification number: G06F16/9017 , G06F16/2282 , G06F16/24568 , G06F16/908 , G06F16/955
Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.
-
公开(公告)号:US20230229802A1
公开(公告)日:2023-07-20
申请号:US18189558
申请日:2023-03-24
Applicant: Snowflake Inc.
Inventor: Abhiram Arikapudi , Saurin Shah
CPC classification number: G06F21/6227 , G06F21/31 , G06F21/44 , G06F21/46 , G06F21/78 , G06F21/6245
Abstract: Techniques for secure sharing of stage data include generating a listing in a first data exchange of a data provider. The listing includes stage data stored in a stage of the data provider and metadata associated with the stage data. A secure view of the listing is configured in a second data exchange based on posting the listing from the first data exchange to the second data exchange. The stage data is retrieved in response to a request from a client device to view the stage data received in the second data exchange. A security function is applied to the stage data to generate modified stage data. The modified stage data is stored at a second location in the stage. The metadata is updated to reference the second location in the stage.
-
公开(公告)号:US11645243B2
公开(公告)日:2023-05-09
申请号:US17522276
申请日:2021-11-09
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Sukruth Komarla Sukumar , Isaac Kunen , Ju-yi Kuo , Jonathan Lee Leang , Edward Ma , Schuyler James Manchester , Polita Paulus , Saurin Shah , Igor Zinkovsky
IPC: G06F12/00 , G06F16/00 , G06F16/182 , G06F9/54 , G06F16/14 , G06F16/176 , G06F16/13
CPC classification number: G06F16/183 , G06F9/547 , G06F16/13 , G06F16/148 , G06F16/176
Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF signatures and interfaces (e.g., classes, sub-classes) that can be called by other users. Upon a UDF being called, one or more interface objects (e.g., InputStream) can be created and requests transferred to a execution node via a network channel. The execution node can implement multiple threads that are authorized and download file data from a staging location (e.g., internal stage, external stage) concurrently.
-
公开(公告)号:US20230070935A1
公开(公告)日:2023-03-09
申请号:US18054621
申请日:2022-11-11
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Peter Povinec , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
Abstract: A command to load or unload data at a storage location is received. In response to the command, a storage integration object associated with the storage location is identified. The storage integration object identifies a cloud identity object that corresponds to a cloud identity that is associated with a proxy identity object corresponding to a proxy identity granted permission to access the storage location. The data is loaded or unloaded at the storage location by assuming the proxy identity.
-
-
-
-
-
-
-
-
-