Indexed geospatial predicate search

    公开(公告)号:US12050605B2

    公开(公告)日:2024-07-30

    申请号:US17804248

    申请日:2022-05-26

    Applicant: Snowflake Inc.

    Abstract: Provided herein are systems and methods for indexed geospatial predicate search. An example method performed by at least one hardware processor includes decoding a query with a geospatial predicate. The geospatial predicate is configured between a geography data column and a constant geography object. The method further includes computing a first covering for a data value of a plurality of data values in the geography data column. The first covering includes a first set of cells in a hierarchical grid representation of a geography. The first set of cells represents a surface of the geography associated with the data value. A second covering is computed for the constant geography object. A determination is made on whether to prune at least one partition of a database organized into a set of partitions and including the geography data column based on a comparison between the first covering and the second covering.

    REGULAR EXPRESSION SEARCH QUERY PROCESSING USING PRUNING INDEX

    公开(公告)号:US20230342362A1

    公开(公告)日:2023-10-26

    申请号:US18305993

    申请日:2023-04-24

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24557 G06F16/9035 G06F16/283 G06F16/2272

    Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.

    Indexed regular expression search with N-grams

    公开(公告)号:US11681708B2

    公开(公告)日:2023-06-20

    申请号:US17934977

    申请日:2022-09-23

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24557 G06F16/2272 G06F16/283 G06F16/9035

    Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.

    INDEXED REGULAR EXPRESSION SEARCH WITH N-GRAMS

    公开(公告)号:US20230084069A1

    公开(公告)日:2023-03-16

    申请号:US17934977

    申请日:2022-09-23

    Applicant: Snowflake Inc.

    Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.

    INDEXED GEOSPATIAL PREDICATE SEARCH

    公开(公告)号:US20220284025A1

    公开(公告)日:2022-09-08

    申请号:US17804248

    申请日:2022-05-26

    Applicant: Snowflake Inc.

    Abstract: Provided herein are systems and methods for indexed geospatial predicate search. An example method performed by at least one hardware processor includes decoding a query with a geospatial predicate. The geospatial predicate is configured between a geography data column and a constant geography object. The method further includes computing a first covering for a data value of a plurality of data values in the geography data column. The first covering includes a first set of cells in a hierarchical grid representation of a geography. The first set of cells represents a surface of the geography associated with the data value. A second covering is computed for the constant geography object. A determination is made on whether to prune at least one partition of a database organized into a set of partitions and including the geography data column based on a comparison between the first covering and the second covering.

Patent Agency Ranking