Scan-based merge for analytical query processing in HTAP systems using delete vectors

    公开(公告)号:US12135700B1

    公开(公告)日:2024-11-05

    申请号:US18460206

    申请日:2023-09-01

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query, the query including a query range for processing the query and a set of requested columns. The subject technology based on the query range, determining a set of blob files and a set of delete vectors. The subject technology for each blob file, storing each row, including the set of request columns, into an array of rowsets. The subject technology for each rowset, generating a delete bitset to at least indicate whether each row has been deleted. The subject technology for each delta file, indicate a previous row of a visible row of the delta file as being deleted based on a delete pointer of the visible row. The subject technology providing a set of rowsets, including a corresponding selection column set, as a result of the query.

    COLUMNAR CACHE IN HYBRID TRANSACTIONAL/ANALYTICAL PROCESSING (HTAP) WORKLOADS

    公开(公告)号:US20250068640A1

    公开(公告)日:2025-02-27

    申请号:US18787807

    申请日:2024-07-29

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives, by an execution node, blob metadata from a key-value store, the blob metadata including information related to a set of blob files. The subject technology determines, by the execution node using the blob metadata, whether a copy of each of the set of blob files is stored in a local cache of the execution node. The subject technology transforms at least one blob file, retrieved from a blob store, to a second file in a column file format, the at least one blob file being in a first format that is different than the column file format, the transforming comprising at least converting a particular snapshot file from the at least one blob file to a particular set of rowsets and writing the set of rowsets into the second file in the column file format. The subject technology stores the second file in the local cache.

    Database processing using hybrid key-value tables

    公开(公告)号:US12222964B2

    公开(公告)日:2025-02-11

    申请号:US17661162

    申请日:2022-04-28

    Applicant: Snowflake Inc.

    Abstract: A distributed database system can include transactional database and an object storage database. The data of the transactional database can be split into granules and replicated to the object storage database. The distributed database system can process transactional requests using the transactional database. The distributed database can receive a request that reads data more than a set size from the transactional database. The distributed database system can identify the granule data in the object storage database and transmit data to complete the read on one or more of a plurality of execution nodes.

    DATABASE PROCESSING USING HYBRID KEY-VALUE TABLES

    公开(公告)号:US20230350921A1

    公开(公告)日:2023-11-02

    申请号:US17661162

    申请日:2022-04-28

    Applicant: Snowflake Inc.

    Abstract: A distributed database system can include transactional database and an object storage database. The data of the transactional database can be split into granules and replicated to the object storage database. The distributed database system can process transactional requests using the transactional database. The distributed database can receive a request that reads data more than a set size from the transactional database. The distributed database system can identify the granule data in the object storage database and transmit data to complete the read on one or more of a plurality of execution nodes.

Patent Agency Ranking