- 专利标题: Apparatus for retrieving similar documents and apparatus for extracting relevant keywords
-
申请号: US09892700申请日: 2001-06-28
-
公开(公告)号: US06671683B2公开(公告)日: 2003-12-30
- 发明人: Yuji Kanno
- 申请人: Yuji Kanno
- 优先权: JP2000-195075 20000628
- 主分类号: G06F1730
- IPC分类号: G06F1730
摘要:
Three kinds of data, i.e., a keyword frequency-of-appearance, a document length, and a keyword weight, are produced. Then, a document profile vector and a keyword profile vector are calculated. Then, by independently performing the weighted principal component analysis considering the document length and the keyword weight, a document feature vector and a keyword feature vectors are obtained. Then, documents and keywords having higher similarity to the feature vectors calculated with reference to the retrieval and extracting conditions are obtained and displayed.
公开/授权文献
信息查询