发明申请
- 专利标题: ENTITY RECOGNITION USING PROBABILITIES FOR OUT-OF-COLLECTION DATA
- 专利标题(中): 使用无法收集数据的概率的实体识别
-
申请号: US13162563申请日: 2011-06-16
-
公开(公告)号: US20120323839A1公开(公告)日: 2012-12-20
- 发明人: Emre Kiciman , Abulimiti Aji , Kuansan Wang
- 申请人: Emre Kiciman , Abulimiti Aji , Kuansan Wang
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06N5/02
- IPC分类号: G06N5/02
摘要:
A classifier that disambiguates among entities based on a dictionary, such as corpus of documents about those entities, is built by incorporating probabilities that an entity exists that is not in the dictionary. Given a document it is associated by the classifier with an entity. By incorporating out of collection probabilities into the classifier, a higher level of confidence in the match between an entity and a document is achieved.
公开/授权文献
信息查询