发明申请
- 专利标题: Document comparision using multiple similarity measures
- 专利标题(中): 文献比较采用多重相似度测度
-
申请号: US11304029申请日: 2005-12-15
-
公开(公告)号: US20070143322A1公开(公告)日: 2007-06-21
- 发明人: Ravi Kothari , Sougata Mukherjea
- 申请人: Ravi Kothari , Sougata Mukherjea
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 主分类号: G06F7/00
- IPC分类号: G06F7/00
摘要:
Disclosed herein is a method for comparing documents. The method includes the steps of: determining a plurality of similarity measures; and determining an overall similarity measure for the plurality of documents, based on the plurality of similarity measures. In one embodiment, the similarity measures are chosen from the group of similarity measures consisting of semantic and reference similarity measures. When comparing documents from the chemical, biochemical or pharmaceutical domains, the determination of the similarity utilizes a determination of structural similarity of the chemical formulas described in the plurality of documents.
公开/授权文献
- US07472121B2 Document comparison using multiple similarity measures 公开/授权日:2008-12-30
信息查询