• 专利标题: Apparatus for retrieving similar documents and apparatus for extracting relevant keywords
  • 申请号: US09892700
    申请日: 2001-06-28
  • 公开(公告)号: US06671683B2
    公开(公告)日: 2003-12-30
  • 发明人: Yuji Kanno
  • 申请人: Yuji Kanno
  • 优先权: JP2000-195075 20000628
  • 主分类号: G06F1730
  • IPC分类号: G06F1730
Apparatus for retrieving similar documents and apparatus for extracting relevant keywords
摘要:
Three kinds of data, i.e., a keyword frequency-of-appearance, a document length, and a keyword weight, are produced. Then, a document profile vector and a keyword profile vector are calculated. Then, by independently performing the weighted principal component analysis considering the document length and the keyword weight, a document feature vector and a keyword feature vectors are obtained. Then, documents and keywords having higher similarity to the feature vectors calculated with reference to the retrieval and extracting conditions are obtained and displayed.
信息查询
0/0