发明申请
US20090210418A1 TRANSFORMATION-BASED FRAMEWORK FOR RECORD MATCHING 有权
用于记录匹配的基于变换的框架

TRANSFORMATION-BASED FRAMEWORK FOR RECORD MATCHING
摘要:
A transformation-based record matching technique. The technique provides a flexible way to account for synonyms and more general forms of string equivalences when performing record matching by taking as explicit input user-defined transformation rules (such as, for example, the fact that “Robert” and “Bob” that are synonymous). The input string and user-defined transformation rules are used to generate a larger set of strings which are used when performing record matching. Both the input string and data elements in a database can be transformed using the user-defined transformation rules in order to generate a larger set of potential record matches. These potential record matches can then be subjected to a threshold test in order to determine one or more best matches. Additionally, signature-based similarity functions are used to improve the computational efficiency of the technique.
公开/授权文献
信息查询
0/0