发明授权
- 专利标题: Index and method for extending and querying index
- 专利标题(中): 扩展和查询索引的索引和方法
-
申请号: US11562495申请日: 2006-11-22
-
公开(公告)号: US07689574B2公开(公告)日: 2010-03-30
- 发明人: Wei Zhu Chen , Zhong Su , Rui Wang , Li Zhang
- 申请人: Wei Zhu Chen , Zhong Su , Rui Wang , Li Zhang
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Gibb I.P. Law Firm, LLC
- 优先权: CN200510124283 20051129
- 主分类号: G06F17/00
- IPC分类号: G06F17/00 ; G06F15/16 ; G06F3/00
摘要:
A method, system and program storage device are provided for extending an inverted index, which comprises first and second inverted index subfiles to increase the speed of establishing and updating inverted index files. The method includes performing ordered keyword indexing operations of generating an inverted index from data sources, in which a frequency of occurrence of keywords in each of the data sources is calculated, and writing each keyword, the data sources, and the frequency of occurrence of each keyword in the corresponding data sources to the inverted index. If a number of data sources involved in the indexing operations reaches a first threshold, then writing contents of the inverted index as a smallest grid into the first inverted index subfile. If a number of smallest grids in the first inverted index subfile reaches a second threshold, then merging the smallest grids into a merged grid and writing the merged grid into the second inverted index subfile. If the number of merged grids in the second inverted index subfile reaches a third threshold, then further merging the merged grids into a larger merged grid, and writing the larger merged grid back into the first inverted index subfile.
公开/授权文献
- US20070124277A1 Index and Method for Extending and Querying Index 公开/授权日:2007-05-31
信息查询