Invention Grant
- Patent Title: Trie-based normalization of field values for matching
-
Application No.: US15884732Application Date: 2018-01-31
-
Publication No.: US11016959B2Publication Date: 2021-05-25
- Inventor: Arun Kumar Jagota , Ajitesh Jain , Dmytro Kudriavtsev
- Applicant: salesforce.com, inc.
- Applicant Address: US CA San Francisco
- Assignee: salesforce.com, inc.
- Current Assignee: salesforce.com, inc.
- Current Assignee Address: US CA San Francisco
- Agency: Dergosits & Noah LLP
- Agent Todd A. Noah
- Main IPC: G06F16/23
- IPC: G06F16/23 ; G06F16/2458 ; G06F16/2457 ; G06F16/22 ; G06F16/2452

Abstract:
A system tokenizes values stored in a field by multiple records. The system creates a trie from the tokenized values, each branch in the trie labeled with one of the tokenized values, each node storing a count indicating the number of the multiple records associated with a tokenized value sequence beginning from a root of the trie. The system tokenizes a value stored in the field by a prospective record. Beginning from the root of the trie, the system identifies each node corresponding to a token value sequence for the prospective record's tokenized value. Beginning from the most recently identified node for the prospective record's token value sequence, the system identifies each extending node which stores a count that satisfies a threshold, each identified extending node corresponding to another token value sequence. The system uses the other token value sequence to identify one of the multiple records that matches the prospective record.
Public/Granted literature
- US20190236178A1 TRIE-BASED NORMALIZATION OF FIELD VALUES FOR MATCHING Public/Granted day:2019-08-01
Information query