Invention Grant
- Patent Title: Transformation-based framework for record matching
- Patent Title (中): 用于记录匹配的基于变换的框架
-
Application No.: US12031715Application Date: 2008-02-15
-
Publication No.: US08032546B2Publication Date: 2011-10-04
- Inventor: Arvind Arasu , Surajit Chaudhuri
- Applicant: Arvind Arasu , Surajit Chaudhuri
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corp.
- Current Assignee: Microsoft Corp.
- Current Assignee Address: US WA Redmond
- Agency: Lyon & Harr, LLP
- Agent Katrina A. Lyon
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
A transformation-based record matching technique. The technique provides a flexible way to account for synonyms and more general forms of string equivalences when performing record matching by taking as explicit input user-defined transformation rules (such as, for example, the fact that “Robert” and “Bob” that are synonymous). The input string and user-defined transformation rules are used to generate a larger set of strings which are used when performing record matching. Both the input string and data elements in a database can be transformed using the user-defined transformation rules in order to generate a larger set of potential record matches. These potential record matches can then be subjected to a threshold test in order to determine one or more best matches. Additionally, signature-based similarity functions are used to improve the computational efficiency of the technique.
Public/Granted literature
- US20090210418A1 TRANSFORMATION-BASED FRAMEWORK FOR RECORD MATCHING Public/Granted day:2009-08-20
Information query