-
公开(公告)号:US20210208888A1
公开(公告)日:2021-07-08
申请号:US17212560
申请日:2021-03-25
Applicant: Intel Corporation
Inventor: Hao CHANG , Xiang WANG , Yang HONG , Hongjun NI , Baoqian LI
Abstract: Methods, software, and apparatus for implementing a high-performance multi-literal matching algorithm. Under aspects of a method, multi-literal matching is used during front-end processing to identify match candidates based on suffix patterns. An extended SHIFT-OR algorithm is performed using the chunk of data as an input to identify match candidates for the suffix patterns, wherein the extended SHIFT-OR algorithm shifts match indicia in the plurality of rows across predetermined bit boundaries and aligns the match indicia corresponding to target suffixes into single columns. Match candidates are indicated when OR'ed column values yield a ‘0’. Match verification for one or more search strings containing the suffix patterns is then performed by a back-end. The scheme eliminates false positives at the predetermined bit boundaries, providing substantial improvement in front-end accuracy and performance.
-
公开(公告)号:US20240184578A1
公开(公告)日:2024-06-06
申请号:US18440818
申请日:2024-02-13
Applicant: Intel Corporation
Inventor: Hao CHANG , Geoffrey LANGDALE , Xiang WANG , Yang HONG , Wenjun ZHU , Kun QIU , Xusheng LU
CPC classification number: G06F9/30038 , G06F9/30032 , G06F9/30043 , G06F9/3887 , G06F15/8084
Abstract: Methods and embodiments of a high-performance parallel multi-literal matching algorithm called NeoHarry. A chunk of data comprising a character string comprising n bytes is sampled for a byte stream, and data in the sampled chunk are pre-shifted to create shifted copies of data at multiple sampled locations. A mask table is generated having column vectors containing match indicia identifying potential character matches. A look up of the mask table at multiple sampled locations using the pre-shifted data is performed for a target literal character pattern. The mask table lookup results are combined to generate match candidates and exact match verification is performed to identify any generated match candidates that match the target literal character pattern. NeoHarry uses a column-vector-based shift-or model and implements a cross-domain shift algorithm under which character patterns spanning two domains are identified.
-
公开(公告)号:US20220113969A1
公开(公告)日:2022-04-14
申请号:US17559752
申请日:2021-12-22
Applicant: Intel Corporation
Inventor: Hao CHANG , Xiang WANG , Yang HONG , Wenjun ZHU , Kun QIU , Baoqian LI
Abstract: Examples include techniques four use of a large scale multi-literal matching algorithm. Implementation of the large scale multi-literal matching algorithm includes processing a chunk of input data via performance of a SHIFT-OR operation using the chunk of input data to identify a match candidate for a target literal character pattern. A single input multiple data (SIMD) instruction may be utilized by a processor to perform the SHIFT-OR operation as a parallel table lookup of rows of SHIFT-OR mask table for the chunk of input data to facilitate identification of the match candidate.
-
公开(公告)号:US20220171628A1
公开(公告)日:2022-06-02
申请号:US17559696
申请日:2022-02-17
Applicant: Intel Corporation
Inventor: Yang HONG , Hao CHANG , Xiang WANG , Wenjun ZHU , Kun QIU
IPC: G06F9/38
Abstract: Examples include techniques to encode and decode for character class matching. A character class including a plurality of characters may be encoded in a mask table. A sampled chunk of input data is used to perform a decode operation to check, in a parallel manner, each character included in the input data with the encoded mask table. The checking of each character in the input data to determine whether at least one character in the input data matches a character included in the character class.
-
-
-