ATOMIC CACHE MANAGEMENT OF FILE COLLECTIONS
    11.
    发明公开

    公开(公告)号:US20230315693A1

    公开(公告)日:2023-10-05

    申请号:US17657229

    申请日:2022-03-30

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/1774 G06F16/172 G06F16/1734

    Abstract: Systems and methods for caching file system collections atomically. The systems and methods perform operations comprising: receiving a request to access a collection of data comprising a plurality of files stored on a storage device; in response to receiving the request, transferring the collection of data from the storage device to a first subfolder in a cache associated with the storage device; generating a lock file comprising a reference to the first subfolder in the cache; and atomically controlling access to the collection of data in the first subfolder and removal of the collection of data in the first subfolder from the cache via the lock file.

    Parallel scan of single file using multiple threads

    公开(公告)号:US11586621B1

    公开(公告)日:2023-02-21

    申请号:US17586493

    申请日:2022-01-27

    Applicant: Snowflake Inc.

    Abstract: Multiple execution threads process a query directed to a database organized into a plurality of files. In processing the query, a first thread downloads a file from the plurality of files. The file comprises a set of blocks. A parallel scan of the set of blocks is performed by at least the first thread and a second thread to identify data that matches the query. A response to the query is provided based in part on the parallel scan of the set of blocks.

    EFFICIENT DATABASE QUERY EVALUATION

    公开(公告)号:US20210240670A1

    公开(公告)日:2021-08-05

    申请号:US16779366

    申请日:2020-01-31

    Applicant: Snowflake Inc

    Abstract: Data in a micro-partition of a table is stored in a compressed form. In response to a database query on the table comprising a filter, the portion of the data on which the filter operates is decompressed, without decompressing other portions of the data. Using the filter on the decompressed portion of the data, the portions of the data that are responsive to the filter are determined and decompressed. The responsive data is returned in response to the database query. When a query is run on a table that is compressed using dictionary compression, the uncompressed data may be returned along with the dictionary look-up values. The recipient of the data may use the dictionary look-up values for memoization, reducing the amount of computation required to process the returned data.

Patent Agency Ranking