发明申请
- 专利标题: HIGH PRECISION SET EXPANSION FOR LARGE CONCEPTS
- 专利标题(中): 高精度扩展大概念
-
申请号: US13325072申请日: 2011-12-14
-
公开(公告)号: US20130159317A1公开(公告)日: 2013-06-20
- 发明人: Jiewen Huang , Zhimin Chen , Arvind Arasu , Vivek Narasayya
- 申请人: Jiewen Huang , Zhimin Chen , Arvind Arasu , Vivek Narasayya
- 申请人地址: US WA Redmond
- 专利权人: MICROSOFT CORPORATION
- 当前专利权人: MICROSOFT CORPORATION
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A set expansion system is described herein that improves precision, recall, and performance of prior set expansion methods for large sets of data. The system maintains high precision and recall by 1) identifying the qualify of particular lists and applying that quality through a weight, 2) allowing for the specification or negative examples in a set of seeds to reduce the introduction of bad entities into the set, and 3) applying a cutoff to eliminate lists that include a low number of positive matches. The system may perform multiple passes to first generate a good candidate result set and then refine the set to find a set with highest quality. The system may also apply Map Reduce or other distributed processing techniques to allow calculation in parallel. Thus, the system efficiently expands large concept sets from a potentially small set of initial seeds from readily available web data.
公开/授权文献
- US09547718B2 High precision set expansion for large concepts 公开/授权日:2017-01-17
信息查询