Processing functionality to store sparse feature sets

    公开(公告)号:US12204553B2

    公开(公告)日:2025-01-21

    申请号:US18458425

    申请日:2023-08-30

    Applicant: Snowflake Inc.

    Abstract: The subject technology generates, by a database system, cell data for a particular table based on values from a source table, the values being based on raw input data, the source table comprising multiple rows and multiple columns, the raw input data comprising values in a first format, the values comprising input features corresponding to datasets included in the raw input data for machine learning models, the source table being provided by an external environment, the external environment comprising an external system from the database system. The subject technology performs a database operation to generate the particular table including table metadata, column metadata, and the generated cell data, the generated particular table comprising a second format that causes more efficient processing of data by the database system using a single query on the particular table compared to processing the raw input data from the source table.

    DISTRIBUTING LARGE AMOUNTS OF GLOBAL METADATA USING OBJECT FILES

    公开(公告)号:US20250013663A1

    公开(公告)日:2025-01-09

    申请号:US18897473

    申请日:2024-09-26

    Applicant: Snowflake Inc.

    Abstract: A data dictionary generation system automatically populates and updates a data dictionary for listings offering shared data. The data listing distribution component distributes the data dictionaries to various remote deployments in a data exchange by using a global messaging framework and replication method. For example, the data listing distribution component replicates a data dictionary generated for the listing and its shared data from a source deployment to one or more destination deployments associated with various geographic regions. The data listing distribution component distributes the listing to the various remote deployments to allow for the listing, including its shared data and data dictionary, to be accessed by users within the geographic region associated with the remote deployment.

    SYMMETRIC QUERY PROCESSING IN A DATABASE CLEAN ROOM

    公开(公告)号:US20250013648A1

    公开(公告)日:2025-01-09

    申请号:US18892783

    申请日:2024-09-23

    Applicant: Snowflake Inc.

    Abstract: Disclosed herein are systems and methods for query processing with restrictions in a database clean room. In an embodiment, a system receives a query directed to a combination of a first source dataset from a first database account of a distributed database and a second source dataset from a second database account of the distributed database. The system generates an approved statements table that contains database statement language that can be executed against the combination of the first and second source datasets. Based on determining that the approved statements table includes the query, the system executes the query to produce results data, and stores the results data in the first database account.

    Invoking specified functionality within a sandbox process

    公开(公告)号:US12190181B2

    公开(公告)日:2025-01-07

    申请号:US18429367

    申请日:2024-01-31

    Applicant: Snowflake Inc.

    Abstract: A database system configured to manage and execute stored procedures within a secure sandbox process. The system receives a response to a database query and, through an Application Programming Interface (API) executing within the sandbox process, converts the response into a remote procedure call. The sandbox process is modified to restrict communication with external networks while enabling communication with a designated execution node, which facilitates interaction between the stored procedure and database system components. The stored procedure, executing within the confines of the sandbox process, directs the API to communicate with the execution node. The execution node, in turn, submits the database query to the database system component.

    HIDDEN DATABASE OBJECTS OVER EXTERNAL DATA

    公开(公告)号:US20240427794A1

    公开(公告)日:2024-12-26

    申请号:US18513140

    申请日:2023-11-17

    Applicant: Snowflake Inc.

    Abstract: The subject technology provides techniques for enabling hidden database objects, which in an example are utilized for testing and verifying new database objects against existing workloads. Hidden database objects are a mechanism for bridging that gap by running user workloads on user data in advance of exposing the feature to users. This mechanism allows placing a database object as a hidden object nested beneath a user's visible object (e.g., table, column, view, and the like). Hidden database objects can be used to verify the functionality, parity, performance, and correctness of new unreleased features.

    UNIFIED STRUCTURED AND SEMI-STRUCTURED DATA TYPES IN DATABASE SYSTEMS

    公开(公告)号:US20240427790A1

    公开(公告)日:2024-12-26

    申请号:US18497746

    申请日:2023-10-30

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query, the query referencing a unified representation for structured type data and semi-structured type data, the unified representation being provided in storage and in memory during query processing, the unified representation comprising a set of structured type fields that include a set of semi-structured typed fields that enables type safety and enforcement for the set of structured type fields, and flexibility for the set of semi-structured typed fields in a same column, the unified representation in storage including type information for the semi-structured type data as part of the semi-structured type data, the unified representation being utilized for structured type data and semi-structured type data. The subject technology processes the query using the unified representation stored in the memory, the unified representation providing performance parity between structured type data and semi-structured type data.

    Directing queries to encrypted database files

    公开(公告)号:US12158970B2

    公开(公告)日:2024-12-03

    申请号:US18111821

    申请日:2023-02-20

    Applicant: Snowflake Inc.

    Abstract: A method for directing queries to encrypted database files includes acquiring a mapping that links a first encrypted file with a different encrypted file. The first encrypted file is generated based on a first encryption key. The method includes generating, by one or more processors based on the mapping, an updated mapping to link a second encrypted file with the different encrypted file. The second encrypted file is generated based on a second encryption key. The method includes directing, using the mapping, a first query to the first encrypted file when the first query arrives before generating the updated mapping; and. The method includes directing, using the updated mapping, a second query to the second encrypted file when a second query arrives after generating the updated mapping.

    FINE-GRAINED ACCESS CONTROL VIA DATABASE ROLES

    公开(公告)号:US20240394395A1

    公开(公告)日:2024-11-28

    申请号:US18787930

    申请日:2024-07-29

    Applicant: Snowflake Inc.

    Abstract: Embodiments of the present disclosure relate to sharing data using database roles. Database roles are generated within a database container of a provider account. Grants to a particular subset of the plurality of data objects of the database container may be assigned to each of the database roles, and each of the database roles are granted to a share object. The share object is mounted within a consumer account to generate an imported copy of each of the database roles. The imported copy of one or more of the database roles is granted to each of one or more account level roles of the consumer account. When a new object is added to a particular database role, it is immediately available for consumption by any account level roles to which the imported copy of the particular database role has been granted.

Patent Agency Ranking