Invention Grant
- Patent Title: Indexed regular expression search with N-grams
-
Application No.: US17934977Application Date: 2022-09-23
-
Publication No.: US11681708B2Publication Date: 2023-06-20
- Inventor: Thierry Cruanes , Ismail Oukid , Stefan Richter , Alejandro Salinger
- Applicant: Snowflake Inc.
- Applicant Address: US MT Bozeman
- Assignee: Snowflake Inc.
- Current Assignee: Snowflake Inc.
- Current Assignee Address: US MT Bozeman
- Agency: Schwegman Lundberg & Woessner, P.A.
- Main IPC: G06F16/24
- IPC: G06F16/24 ; G06F16/2455 ; G06F16/9035 ; G06F16/28 ; G06F16/22

Abstract:
A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.
Public/Granted literature
- US20230084069A1 INDEXED REGULAR EXPRESSION SEARCH WITH N-GRAMS Public/Granted day:2023-03-16
Information query