-
公开(公告)号:US11928110B2
公开(公告)日:2024-03-12
申请号:US18051483
申请日:2022-10-31
Applicant: Snowflake Inc.
Inventor: Srilakshmi Chintala , Chong Han , Albert L. Hu , Nitya Kumar Sharma , Igor Zinkovsky
IPC: G06F16/00 , G06F16/242 , G06F16/248 , G06F16/27 , G06F8/41
CPC classification number: G06F16/2448 , G06F16/248 , G06F16/27 , G06F8/433
Abstract: A database dependency resolver system can identify different dependencies of a user application and integrate the identified dependencies in different execution environments of a distributed database system. The different execution environments can manage different versions of a given programming language, or other types of computational architectures (e.g., different CPU types). A database user can provide a database statement (e.g., query) that activates the different dependencies in the different environments to generate results data.
-
公开(公告)号:US12174817B2
公开(公告)日:2024-12-24
申请号:US18338938
申请日:2023-06-21
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Chong Han , Igor Zinkovsky
IPC: G06F16/22 , G06F16/248 , G06F16/25 , G06F16/27
Abstract: A method includes detecting, by at least one hardware processor, an upload of a user application within a database system. The user application includes user-defined function (UDF) code. A plurality of dependencies of the user application is determined by the at least one hardware processor. A plurality of execution environments corresponding to the plurality of dependencies is generated by the at least one hardware processor. The plurality of execution environments is associated with a corresponding plurality of data types of the UDF. A database query is decoded. The database query specifies database data of a data type of the plurality of data types of the UDF. The database query is processed using at least one of the plurality of execution environments to generate results data. The at least one of the plurality of execution environments corresponds to the data type of the plurality of data types of the UDF.
-
公开(公告)号:US11983165B1
公开(公告)日:2024-05-14
申请号:US18128212
申请日:2023-03-29
Applicant: Snowflake Inc.
Inventor: Abdullah Al Mahmood , Chong Han , Ganeshan Ramachandran Iyer , Jiaxing Liang , Nithin Mahesh , Yanrui Zhang
IPC: G06F16/23 , G06F16/174 , G06F16/27
CPC classification number: G06F16/2365 , G06F16/1748 , G06F16/27
Abstract: Embodiments of the present disclosure provide techniques for deduplicating files during internal stage replication using a directory table of the replicated internal stage that is modified as a cache for storing and retrieving original file-level metadata for the replicated files. An initial list of candidate files for loading from the internal stage to a table of the target deployment is prepared based on the files listed in the internal stage, and refined using a directory table lookup. If there is any inconsistency between the files registered in the directory table and the files listed in the internal stage, the target deployment will inspect the user-defined file-level metadata to obtain original file-level metadata for each file that is present in the internal stage but not in the directory table. This information may be used during deduplication to ensure that no duplicate files are loaded.
-
公开(公告)号:US20230409552A1
公开(公告)日:2023-12-21
申请号:US18338938
申请日:2023-06-21
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Chong Han , Igor Zinkovsky
IPC: G06F16/22 , G06F16/27 , G06F16/248 , G06F16/25
CPC classification number: G06F16/2291 , G06F16/27 , G06F16/248 , G06F16/258 , G06F16/221
Abstract: A method includes detecting, by at least one hardware processor, an upload of a user application within a database system. The user application includes user-defined function (UDF) code. A plurality of dependencies of the user application is determined by the at least one hardware processor. A plurality of execution environments corresponding to the plurality of dependencies is generated by the at least one hardware processor. The plurality of execution environments is associated with a corresponding plurality of data types of the UDF. A database query is decoded. The database query specifies database data of a data type of the plurality of data types of the UDF. The database query is processed using at least one of the plurality of execution environments to generate results data. The at least one of the plurality of execution environments corresponds to the data type of the plurality of data types of the UDF.
-
公开(公告)号:US11726976B1
公开(公告)日:2023-08-15
申请号:US18060085
申请日:2022-11-30
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Chong Han , Igor Zinkovsky
IPC: G06F16/22 , G06F16/248 , G06F16/27 , G06F16/25
CPC classification number: G06F16/2291 , G06F16/221 , G06F16/248 , G06F16/258 , G06F16/27
Abstract: A database query specifies a user-defined function (UDF) to apply to one or more tables in a distributed database. The one or more tables are identified in the distributed database. The one or more tables are associated with an initial set of data types native to the distributed database. The one or more tables are converted to database data comprising data types of the UDF. The UDF is executed on the database data to generate results data. The results data is converted to table results data. The table results data is associated with the initial set of data types. The table results data is stored in the distributed database.
-
公开(公告)号:US20230401229A1
公开(公告)日:2023-12-14
申请号:US18051657
申请日:2022-11-01
Applicant: Snowflake Inc.
Inventor: Robert Bengt Benedikt Gernhardt , Chong Han , Nithin Mahesh , Aravind Ramarathinam , Saurin Shah , Yanrui Zhang
CPC classification number: G06F16/27 , G06F16/256
Abstract: The distributed database can implement unstructured data replication using an internal or external storage location. Metadata, such as a directory table that lists the unstructured files, can be replicated across different deployments, followed by replication of the staged data. Replicating the staged data can be implemented by replication of only the stage metadata or replication of the database files between the deployments.
-
公开(公告)号:US20230273918A1
公开(公告)日:2023-08-31
申请号:US18051483
申请日:2022-10-31
Applicant: Snowflake Inc.
Inventor: Srilakshmi Chintala , Chong Han , Albert L. Hu , Nitya Kumar Sharma , Igor Zinkovsky
IPC: G06F16/242 , G06F16/248 , G06F16/27 , G06F8/41
CPC classification number: G06F16/2448 , G06F16/248 , G06F16/27 , G06F8/433
Abstract: A database dependency resolver system can identify different dependencies of a user application and integrate the identified dependencies in different execution environments of a distributed database system. The different execution environments can manage different versions of a given programming language, or other types of computational architectures (e.g., different CPU types). A database user can provide a database statement (e.g., query) that activates the different dependencies in the different environments to generate results data.
-
公开(公告)号:US20250068616A1
公开(公告)日:2025-02-27
申请号:US18944963
申请日:2024-11-12
Applicant: Snowflake Inc.
Inventor: Elliott Brossard , Chong Han , Igor Zinkovsky
IPC: G06F16/22 , G06F16/248 , G06F16/25 , G06F16/27
Abstract: A method to process UDFs includes performing, by at least one hardware processor of a database system, a resolution of application components to determine a plurality of resolved components of an application and a plurality of data types corresponding to the plurality of resolved components. The method includes instantiating a plurality of UDF clients. The plurality of UDF clients are associated with the plurality of data types. The method includes detecting a received database query specifies a data type of the plurality of data types. The method includes executing the received database query at a UDF client of the plurality of UDF clients.
-
公开(公告)号:US20240168947A1
公开(公告)日:2024-05-23
申请号:US18429286
申请日:2024-01-31
Applicant: Snowflake Inc.
Inventor: Srilakshmi Chintala , Chong Han , Albert L. Hu , Nitya Kumar Sharma , Igor Zinkovsky
IPC: G06F16/242 , G06F16/248 , G06F16/27
CPC classification number: G06F16/2448 , G06F16/248 , G06F16/27 , G06F8/433
Abstract: Multiple execution environments are established for processing database statements with diverse package dependencies. A database system receives a user application and identifies, via hardware processors, multiple package dependencies required by the application, including at least a first and a second package dependency. Upon receiving a database statement from a user that invokes these dependencies, the system retrieves the necessary packages from a repository. It then creates a first execution environment incorporating the first package dependency and a second execution environment with the second package dependency. Utilizing the multiple execution environments, the system generates results data in response to the database statement, effectively managing and executing user-defined functions that rely on different versions or types of package dependencies within a unified database framework.
-
-
-
-
-
-
-
-