发明申请
US20070162445A1 System and method for searching and matching data having ideogrammatic content
有权
用于搜索和匹配具有表意文字内容的数据的系统和方法
- 专利标题: System and method for searching and matching data having ideogrammatic content
- 专利标题(中): 用于搜索和匹配具有表意文字内容的数据的系统和方法
-
申请号: US11603413申请日: 2006-11-22
-
公开(公告)号: US20070162445A1公开(公告)日: 2007-07-12
- 发明人: Anthony Scriffignano , Kevin Nedd , Peihsin Shao , Gan Simpeng , Sarah Lu , Masayuki Okada , Mayako Kasai , Julian Prower , Nicholas Teoh , Jeremy Sy , Warwick Matthews
- 申请人: Anthony Scriffignano , Kevin Nedd , Peihsin Shao , Gan Simpeng , Sarah Lu , Masayuki Okada , Mayako Kasai , Julian Prower , Nicholas Teoh , Jeremy Sy , Warwick Matthews
- 专利权人: Dun and Bradstreet
- 当前专利权人: Dun and Bradstreet
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A method of searching and matching non-phonetic or ideogrammatic input data to stored data, including the steps of receiving input data comprising a search string having a plurality of elements, converting a subset of the elements into a set of terms, generating an optimized plurality of keys from the set of terms, retrieving stored data based on the optimized keys corresponding to most likely candidates for match, and selecting a best match from the plurality of candidates. At least some of the ideogrammatic elements form part of an ideogrammatic writing system. The method may also include dividing the search string into a plurality of overlapping sub-segments and identifying sub-segments having inferred semantic meaning as well as sub-segments having no semantic meaning in the ideogrammatic writing system, and using the various sub-segments to generate the optimized keys.
公开/授权文献
信息查询