发明授权
US06038574A Method and apparatus for clustering a collection of linked documents
using co-citation analysis
失效
使用共同引用分析对链接文档集合进行聚类的方法和装置
- 专利标题: Method and apparatus for clustering a collection of linked documents using co-citation analysis
- 专利标题(中): 使用共同引用分析对链接文档集合进行聚类的方法和装置
-
申请号: US44693申请日: 1998-03-18
-
公开(公告)号: US06038574A公开(公告)日: 2000-03-14
- 发明人: James E. Pitkow , Peter L. Pirolli , Jock D. Mackinlay , Stuart K. Card
- 申请人: James E. Pitkow , Peter L. Pirolli , Jock D. Mackinlay , Stuart K. Card
- 申请人地址: CT Stamford
- 专利权人: Xerox Corporation
- 当前专利权人: Xerox Corporation
- 当前专利权人地址: CT Stamford
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F17/21
摘要:
The method and apparatus of the present invention generates clusters of documents in a collection of linked documents based on co-citation analysis. The frequency linkage is determined for each document in the collection. In other words, the number of times each document is linked to by another document in the collection is determined. Further, a minimum frequency linkage (link frequency threshold) is specified based on a predetermined minimum frequency of document linkage. Additionally, a list of pairs of documents that are linked to by the same document is created so that each of the pairs of documents has a count of the number of times (co-citation frequency) that they are both linked to by another document. Pairs of linked documents are clustered using a suitable co-citation technique.
公开/授权文献
- US8599P Lagerstroemia indica cv. Monink 公开/授权日:1994-02-15