发明申请
- 专利标题: EXAMPLE-DRIVEN DESIGN OF EFFICIENT RECORD MATCHING QUERIES
- 专利标题(中): 实例 - 有效记录匹配查询的驱动设计
-
申请号: US11758202申请日: 2007-06-05
-
公开(公告)号: US20080306945A1公开(公告)日: 2008-12-11
- 发明人: Surajit Chaudhuri , Bee-Chung Chen , Venkatesh Ganti , Shriraghav Kaushik
- 申请人: Surajit Chaudhuri , Bee-Chung Chen , Venkatesh Ganti , Shriraghav Kaushik
- 申请人地址: US WA Redmond
- 专利权人: MICROSOFT CORPORATION
- 当前专利权人: MICROSOFT CORPORATION
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Example-driven creation of record matching queries. The disclosed architecture employs techniques that exploit the availability of positive (or matching) and negative (non-matching) examples to search through this space and suggest an initial record matching query. The record matching task is modeled as that of designing an operator tree obtained by composing a few primitive operators. This ensures that record matching programs be executable efficiently and scalably over large input relations. The architecture joins records across multiple (e.g., two) relations (e.g., R and S). The architecture exploits the monotonicity property of similarity functions for record matching in the relations, in that, any pair of matching records have a higher similarity value than non-matching record pairs on at least one similarity function.
公开/授权文献
- US08046339B2 Example-driven design of efficient record matching queries 公开/授权日:2011-10-25
信息查询