MICRO-PARTITION CLUSTERING BASED ON EXPRESSION PROPERTY METADATA

    公开(公告)号:US20240354315A1

    公开(公告)日:2024-10-24

    申请号:US18302234

    申请日:2023-04-18

    Applicant: SNOWFLAKE INC.

    CPC classification number: G06F16/285 G06F16/24556

    Abstract: A method for selecting micro-partitions for a clustering operation includes: storing table data in a plurality of micro-partitions of a storage device, wherein each of the plurality of micro-partitions comprises a portion of the table data, wherein subsets of the plurality of micro-partitions are associated with a respective one of a plurality of expression property (EP) files, and wherein each of the plurality of EP files comprises an EP data region that represents the portions of the table data of the subset of the plurality of micro-partitions associated with the EP file; determining sub-ranges of the table data based on the EP data regions of the plurality of EP files; selecting a subset of the plurality of EP files for a clustering operation based on the sub-ranges of the table data; and performing the clustering operation on the micro-partitions associated with the subset of the EP files.

Patent Agency Ranking