QUERY PROCESSING USING HYBRID TABLE SECONDARY INDEXES

    公开(公告)号:US20230401189A1

    公开(公告)日:2023-12-14

    申请号:US18171296

    申请日:2023-02-17

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/2272 G06F16/283

    Abstract: The subject technology obtains a read timestamp of a first transaction. The subject technology performs a first read operation on a parent table associated with the first transaction to determine a set of committed versions of the parent table. The subject technology determines whether a key exists in the parent table based on the first transaction. The subject technology, in response to the key existing in the parent table, performs a first write operation on a child table. The subject technology determines whether a duplicate key exists in the child table. The subject technology, in response to determining that there is no duplicate key in the child table, determines whether there is a conflict with the key. The subject technology, in response to determining that there is no conflict with the key, performs a second write operation on a secondary index table of the child table.

    MULTIPLE USER DEFINED FUNCTIONS REGISTRATION
    416.
    发明公开

    公开(公告)号:US20230393816A1

    公开(公告)日:2023-12-07

    申请号:US18362114

    申请日:2023-07-31

    Applicant: Snowflake Inc.

    CPC classification number: G06F8/315 G06F9/543

    Abstract: The subject technology identifies a set of functions included in a set of files corresponding to a library. The subject technology, for each function in the set of functions, registers the function as a user defined function (UDF). The subject technology generates a name for the function based at least in part on a predetermined prefix, wherein the predetermined prefix comprises an alphanumeric string. The subject technology generates, using at least a particular set of input parameters utilized by the function and a particular type of parameter of each input parameter of the particular set of input parameters, a particular set of source code. The subject technology stores information corresponding to the function in a metadata database. The subject technology provides access to the function in a different application.

    Managing persistent database result sets

    公开(公告)号:US11836132B2

    公开(公告)日:2023-12-05

    申请号:US16851800

    申请日:2020-04-17

    Applicant: SNOWFLAKE INC.

    Abstract: A method, system, and apparatus for managing a persistent query result set are disclosed. A resource manager is disclosed that receives a query from a client over a computer network. A computer database server is disclosed that executes the query and generates a result set based on and in response to the received query. An execution platform is disclosed that divides the result set into a plurality of chunks, wherein each chunk comprises a portion of the generated result set. The resource manager adds metadata to each of the plurality of chunks. A first chunk of the plurality of chunks comprises metadata pointing to the plurality of chunks. The resource manager delivers at least the first chunk of the plurality of chunks to the client in response to the query. The execution platform offloads the remaining plurality of chunks to cloud storage in parallel.

    OVERLAP QUERIES ON A DISTRIBUTED DATABASE
    418.
    发明公开

    公开(公告)号:US20230385284A1

    公开(公告)日:2023-11-30

    申请号:US17804434

    申请日:2022-05-27

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24568 G06F16/2456 G06F16/244 G06F16/24564

    Abstract: Systems, methods, and machine-readable storage devices provide for identifying a user dataset on a distributed database. The system includes generating a similarity score dataset that indicates a similarity between the user dataset and a plurality of datasets of other users of the distributed database. The system generates a plurality of overlap queries that are configured to output overlap datasets between the user dataset and one or more of the plurality of datasets. The system further generates a results dataset by applying one or more of the plurality of overlap queries to a joined dataset comprising data from the user dataset and one of the plurality of datasets of other users on the distributed database.

    Symmetric query processing in a database clean room

    公开(公告)号:US11822554B2

    公开(公告)日:2023-11-21

    申请号:US18162708

    申请日:2023-01-31

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24565 G06F16/2443 G06F16/27

    Abstract: First and second database accounts respectively share first and second source datasets with one another in a data clean room. The first database account stores a first approved-statements table that contains database statements that are permitted by the first database account to be executed against a combination of the shared first source dataset and the shared second source dataset. The second database account stores a second approved-statements table that contains database statements that are permitted by the second database account to be executed against the combination of the shared first source dataset and the shared second source dataset. One or more queries received from the first database account that are included in the second approved-statements table are processed, as are one or more queries received from the second database account that are included in the first approved-statements table.

Patent Agency Ranking