-
公开(公告)号:US20220374547A1
公开(公告)日:2022-11-24
申请号:US17815031
申请日:2022-07-26
Applicant: Snowflake Inc.
Inventor: Abhiram Arikapudi , Saurin Shah
IPC: G06F21/62
Abstract: Techniques for document sharing include configuring a secure view of shared data as a listing in a data exchange of a data provider. The listing includes metadata with a first uniform resource locator (URL) of the shared data. An invitation to view the listing for is encoded transmission to a client device. A user defined function (UDF) client is invoked to access a data object of the shared data, based on detecting an acceptance of the invitation by the client device. A security function is applied to the data object to generate a modified data object. The metadata is updated to include a second URL, which is associated with a storage location storing the modified data object. The client device is granted access to the modified data object using the second URL, based on detecting a request by the client device to view the listing in the data exchange.
-
公开(公告)号:US11436363B1
公开(公告)日:2022-09-06
申请号:US17546546
申请日:2021-12-09
Applicant: Snowflake Inc.
Inventor: Abhiram Arikapudi , Saurin Shah
Abstract: Provided herein are systems and methods for secure document sharing in a database system. For example, a system includes at least one hardware processor and a memory. The memory stores instructions that cause the at least one hardware processor to perform operations including receiving a query for a data set from a client device. The data set is shared in a data exchange by a data provider. The operations further include retrieving a data file responsive to the query for the data set. A security function is applied to the retrieved data file to generate a modified data file. A scoped uniform resource locator (URL) associated with the modified data file is encoded for transmission to the client device. The scoped URL includes an encrypted hash with a storage location of the modified data file.
-
公开(公告)号:US20220121673A1
公开(公告)日:2022-04-21
申请号:US17086221
申请日:2020-10-30
Applicant: Snowflake Inc.
Inventor: Vasile Paraschiv , Saurin Shah , Marianne Shaw , Nileema Shingte
IPC: G06F16/25 , G06F16/13 , G06F16/182 , G06F16/11
Abstract: A database export system exports data using a plurality of nodes that process the data to generate structured result files that are partitioned by an export parameter in an export request. The database export system distributes the data and merges the files to avoid small file creation and increase processing speed via parallelism. The database export system generates the result files of a specified maximum size in a final format, where the files are processed merged in a temporary file format. The parallel processing is optimized and constrained per the amount of processing nodes, available memory, requested final file sizes, and operation based ordering to complete data exports in a scalable multi-stage approach.
-
公开(公告)号:US20220114217A1
公开(公告)日:2022-04-14
申请号:US17645415
申请日:2021-12-21
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Aravind Ramarathinam , Saurin Shah , Sukruth Komarla Sukumar
IPC: G06F16/901 , G06F16/955 , G06F16/2455 , G06F16/22 , G06F16/908
Abstract: Disclosed herein are systems and methods for population of a file-catalog table for a file stage in a user account on a data platform. In an embodiment, a data platform receives, from a client associated with a user account, a request to populate a file-catalog table of the user account based on a plurality of files stored in a file stage of the user account. The data platform responsively executes a list-files table function with respect to the file stage to generate a database-table object having a row for each file stored in the file stage. The data platform populates the file-catalog table of the user account based on the database-table object generated by the list-files table function.
-
公开(公告)号:US20220075776A1
公开(公告)日:2022-03-10
申请号:US17455798
申请日:2021-11-19
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
Abstract: Disclosed herein are systems and methods for pruning external data. In an embodiment, a database platform receives a query directed at least in part to external data in an external table on an external data storage platform. The external table includes partitions that correspond to storage locations in a source directory of the external data storage platform. The storage locations contain files that contain the external data. The database platform identifies, from external-table metadata that is stored by the database platform and that maps the partitions of the external table to the storage locations in the source directory, a subset of the partitions as including data that potentially satisfies the query. The database platform identifies data that satisfies the query by scanning the identified subset of the partitions, and responds to the query at least in part with the identified data that satisfies the query.
-
公开(公告)号:US11269868B2
公开(公告)日:2022-03-08
申请号:US17219854
申请日:2021-03-31
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
Abstract: Systems, methods, and devices for automated maintenance of external tables in database systems are disclosed. A method includes receiving, by a database platform, read access to content in an external data storage platform that is separate from the database platform. The method includes defining an external table based on the content in the external data storage platform. The method includes connecting the database platform to the external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes generating metadata for the external table, the metadata comprising information about data stored in the external table. The method includes receiving a notification that a modification has been made to the content in the external data storage platform, the modification comprising one or more of an addition of a file, a deletion of a file, or an update to a file in a source location for the external table. The method includes refreshing the metadata for the external table in response to the modification being made to the content in the external data storage platform.
-
公开(公告)号:US20220021673A1
公开(公告)日:2022-01-20
申请号:US17491373
申请日:2021-09-30
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Peter Povinec , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
Abstract: A command to load or unload data at a storage location is received. In response to the command, a storage integration object associated with the storage location is identified. The storage integration object identifies a cloud identity object that corresponds to a cloud identity that is associated with a proxy identity object corresponding to a proxy identity granted permission to access the storage location. The data is loaded or unloaded at the storage location by assuming the proxy identity.
-
公开(公告)号:US11165775B2
公开(公告)日:2021-11-02
申请号:US17249971
申请日:2021-03-19
Applicant: Snowflake Inc.
Inventor: Polita Paulus , Peter Povinec , Saurin Shah , Srinidhi Karthik Bisthavalli Srinivasa
Abstract: A command to load or unload data at a storage location is received. In response to the command, a storage integration object associated with the storage location is identified. The storage integration object identifies a cloud identity object that corresponds to a cloud identity that is associated with a proxy identity object corresponding to a proxy identity granted permission to access the storage location. The data is loaded or unloaded at the storage location by assuming the proxy identity.
-
公开(公告)号:US11138232B1
公开(公告)日:2021-10-05
申请号:US17086215
申请日:2020-10-30
Applicant: Snowflake Inc.
Inventor: Vasile Paraschiv , Saurin Shah , Marianne Shaw , Nileema Shingte
IPC: G06F16/27 , G06F16/28 , G06F9/30 , G06F16/22 , G06F16/2455
Abstract: A database export system exports data using a plurality of nodes that process the data to generate structured result files that are partitioned by an export parameter in an export request. The database export system distributes the data and merges the files to avoid small file creation and increase processing speed via parallelism. The database export system generates the result files of a specified maximum size in a final format, where the files are processed merged in a temporary file format. The parallel processing is optimized and constrained per the amount of processing nodes, available memory, requested final file sizes, and operation based ordering to complete data exports in a scalable multi-stage approach.
-
公开(公告)号:US11030191B2
公开(公告)日:2021-06-08
申请号:US16841831
申请日:2020-04-07
Applicant: Snowflake Inc.
Inventor: Subramanian Muralidhar , Benoit Dageville , Thierry Cruanes , Nileema Shingte , Saurin Shah , Torsten Grabs , Istvan Cseri
IPC: G06F16/242 , G06F3/06 , G06F16/2453 , G06F16/25 , G06F16/23
Abstract: Systems, methods, and devices for querying over an external table are disclosed. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes receiving a query comprising a predicate, the query directed at least to data in the external table. The method includes determining, based on metadata, one or more partitions in the external table comprising data satisfying the predicate. The method includes pruning, based on the metadata, all partitions in the external table that do not comprise any data satisfying the predicate. The method includes generating a query plan comprising a plurality of discrete subtasks. The method includes assigning, based on the metadata, the plurality of discrete subtasks to one or more nodes in an execution platform.
-
-
-
-
-
-
-
-
-