发明授权
- 专利标题: Method and system for searching documents with numbers
- 专利标题(中): 用数字搜索文件的方法和系统
-
申请号: US10134406申请日: 2002-04-26
-
公开(公告)号: US07010520B2公开(公告)日: 2006-03-07
- 发明人: Rakesh Agrawal , Ramakrishnan Srikant
- 申请人: Rakesh Agrawal , Ramakrishnan Srikant
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理商 John L. Rogitz
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A system and method for using numbers to query a corpus of documents, particularly but not exclusively for data spaces that have low reflectivity, i.e., for a point xi described by one or more numbers, the data space does not contain very many permutations of the numbers. For each document to be searched, each query number is matched with one and only one document number preferably using a bipartite graph or heuristic rule such that a distance function is minimized. The distance function can, but not must, take into account attribute names and unit names. A limiting algorithm can be used to limit the number of documents that must be searched.
公开/授权文献
- US20030204494A1 Method and system for searching documents with numbers 公开/授权日:2003-10-30
信息查询