-
公开(公告)号:US12050605B2
公开(公告)日:2024-07-30
申请号:US17804248
申请日:2022-05-26
Applicant: Snowflake Inc.
Inventor: Matthias Carl Adams , Mahmud Allahverdiyev , Ismail Oukid , Peter Popov , Alejandro Salinger
IPC: G06F16/22 , G06F16/2455 , G06F16/28 , G06F16/9035 , G06F17/18
CPC classification number: G06F16/24557 , G06F16/2272 , G06F16/283 , G06F16/9035 , G06F17/18
Abstract: Provided herein are systems and methods for indexed geospatial predicate search. An example method performed by at least one hardware processor includes decoding a query with a geospatial predicate. The geospatial predicate is configured between a geography data column and a constant geography object. The method further includes computing a first covering for a data value of a plurality of data values in the geography data column. The first covering includes a first set of cells in a hierarchical grid representation of a geography. The first set of cells represents a surface of the geography associated with the data value. A second covering is computed for the constant geography object. A determination is made on whether to prune at least one partition of a database organized into a set of partitions and including the geography data column based on a comparison between the first covering and the second covering.
-
公开(公告)号:US20240330300A1
公开(公告)日:2024-10-03
申请号:US18738252
申请日:2024-06-10
Applicant: Snowflake Inc.
Inventor: Matthias Carl Adams , Mahmud Allahverdiyev , Ismail Oukid , Peter Popov , Alejandro Salinger
IPC: G06F16/2455 , G06F16/22 , G06F16/28 , G06F16/9035 , G06F17/18
CPC classification number: G06F16/24557 , G06F16/2272 , G06F16/283 , G06F16/9035 , G06F17/18
Abstract: A method to perform an indexed geospatial search includes retrieving, by at least one hardware processor, a query specifying a geography data column and a constant geography object. A first plurality of hash functions of a first set of cells covering a surface associated with the geography data column is determined. A search index of a database including the geography data column is updated based on the first plurality of hash functions to obtain an updated search index. The query is executed on a reduced scan set of the database. The reduced scan set is based on the updated search index.
-
公开(公告)号:US11989184B2
公开(公告)日:2024-05-21
申请号:US18305993
申请日:2023-04-24
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Ismail Oukid , Stefan Richter , Alejandro Salinger
IPC: G06F16/24 , G06F16/22 , G06F16/2455 , G06F16/28 , G06F16/9035
CPC classification number: G06F16/24557 , G06F16/2272 , G06F16/283 , G06F16/9035
Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.
-
公开(公告)号:US20230342362A1
公开(公告)日:2023-10-26
申请号:US18305993
申请日:2023-04-24
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Ismail Oukid , Stefan Richter , Alejandro Salinger
IPC: G06F16/2455 , G06F16/9035 , G06F16/28 , G06F16/22
CPC classification number: G06F16/24557 , G06F16/9035 , G06F16/283 , G06F16/2272
Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.
-
公开(公告)号:US11681708B2
公开(公告)日:2023-06-20
申请号:US17934977
申请日:2022-09-23
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Ismail Oukid , Stefan Richter , Alejandro Salinger
IPC: G06F16/24 , G06F16/2455 , G06F16/9035 , G06F16/28 , G06F16/22
CPC classification number: G06F16/24557 , G06F16/2272 , G06F16/283 , G06F16/9035
Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.
-
公开(公告)号:US20230084069A1
公开(公告)日:2023-03-16
申请号:US17934977
申请日:2022-09-23
Applicant: Snowflake Inc.
Inventor: Thierry Cruanes , Ismail Oukid , Stefan Richter , Alejandro Salinger
IPC: G06F16/2455 , G06F16/9035 , G06F16/28 , G06F16/22 , G06F17/18
Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.
-
公开(公告)号:US20220284025A1
公开(公告)日:2022-09-08
申请号:US17804248
申请日:2022-05-26
Applicant: Snowflake Inc.
Inventor: Matthias Carl Adams , Mahmud Allahverdiyev , Ismail Oukid , Peter Popov , Alejandro Salinger
IPC: G06F16/2455 , G06F16/9035 , G06F16/28 , G06F17/18 , G06F16/22
Abstract: Provided herein are systems and methods for indexed geospatial predicate search. An example method performed by at least one hardware processor includes decoding a query with a geospatial predicate. The geospatial predicate is configured between a geography data column and a constant geography object. The method further includes computing a first covering for a data value of a plurality of data values in the geography data column. The first covering includes a first set of cells in a hierarchical grid representation of a geography. The first set of cells represents a surface of the geography associated with the data value. A second covering is computed for the constant geography object. A determination is made on whether to prune at least one partition of a database organized into a set of partitions and including the geography data column based on a comparison between the first covering and the second covering.
-
-
-
-
-
-