发明授权
- 专利标题: Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance
- 专利标题(中): 用于构建紧凑型相似度结构并用于分析文档相关性的方法和装置
-
申请号: US11298500申请日: 2005-12-12
-
公开(公告)号: US07472131B2公开(公告)日: 2008-12-30
- 发明人: James G. Shanahan , Norbert Roma , David A. Evans
- 申请人: James G. Shanahan , Norbert Roma , David A. Evans
- 申请人地址: US PA Pittsburgh
- 专利权人: JustSystems Evans Research, Inc.
- 当前专利权人: JustSystems Evans Research, Inc.
- 当前专利权人地址: US PA Pittsburgh
- 代理商 Blaney Harper; Jones Day
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/00 ; G06F17/30
摘要:
A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity values represents a level of similarity of one document of a given pair relative to the other document of the given pair. The similarity value of each entry is greater than a threshold similarity value that is greater than zero. The plurality of similarity-value entries are fewer than N2−N in number if the similarity values are asymmetric with regard to document pairing, and the plurality of similarity-value entries are fewer than N 2 - N 2 in number if the similarity values are symmetric with regard to document pairing. A method and apparatus for generating the data structure are described.
公开/授权文献
信息查询