发明申请
US20050021508A1 Method and apparatus for calculating similarity among documents 失效
用于计算文件之间相似度的方法和装置

Method and apparatus for calculating similarity among documents
摘要:
Information that individual elements (characteristic character rings) indicative of characteristics of a registered document appear in the registered document is stored in advance. When calculating similarity of the registered document, a query designated by a searcher is analyzed. The query is represented by a characteristic vector having the individual elements which take the relation between a plurality of words into consideration. Pieces of appearance information of the individual words contained in the query are counted. The counted appearance information is compared with a searching index to calculate similarity between documents.
信息查询
0/0