发明申请
- 专利标题: REFINING A DICTIONARY FOR INFORMATION EXTRACTION
- 专利标题(中): 修改信息提取的词典
-
申请号: US13598946申请日: 2012-08-30
-
公开(公告)号: US20130318076A1公开(公告)日: 2013-11-28
- 发明人: Laura Chiticariu , Vitaly Feldman , Frederick R. Reiss , Sudeepa Roy , Huaiyu Zhu
- 申请人: Laura Chiticariu , Vitaly Feldman , Frederick R. Reiss , Sudeepa Roy , Huaiyu Zhu
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A method for refining a dictionary for information extraction, the operations including: inputting a set of extracted results from execution of an extractor comprising the dictionary on a collection of text, wherein the extracted results are labeled as correct results or incorrect results; processing the extracted results using an algorithm configured to set a score of the extractor above a score threshold, wherein the score threshold balances a precision and a recall of the extractor; and outputting a set of candidate dictionary entries corresponding to a full set of dictionary entries, wherein the candidate dictionary entries are candidates to be removed from the dictionary based on the extracted results.
公开/授权文献
- US08775419B2 Refining a dictionary for information extraction 公开/授权日:2014-07-08
信息查询