发明授权
US08682901B1 Index server architecture using tiered and sharded phrase posting lists
有权
索引服务器架构使用分层和分层的短语发布列表
- 专利标题: Index server architecture using tiered and sharded phrase posting lists
- 专利标题(中): 索引服务器架构使用分层和分层的短语发布列表
-
申请号: US13332278申请日: 2011-12-20
-
公开(公告)号: US08682901B1公开(公告)日: 2014-03-25
- 发明人: Pei Cao , Nadav Eiron , Soham Mazumdar , Anna L. Patterson , Russell Power , Yonatan Zunger
- 申请人: Pei Cao , Nadav Eiron , Soham Mazumdar , Anna L. Patterson , Russell Power , Yonatan Zunger
- 申请人地址: US CA Mountain View
- 专利权人: Google Inc.
- 当前专利权人: Google Inc.
- 当前专利权人地址: US CA Mountain View
- 主分类号: G01F7/00
- IPC分类号: G01F7/00
摘要:
An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are extracted from the document collection. Documents are the indexed according to their included phrases, using phrase posting lists. The phrase posting lists are stored in an cluster of index servers. The phrase posting lists can be tiered into groups, and sharded into partitions. Phrases in a query are identified based on possible phrasifications. A query schedule based on the phrases is created from the phrases, and then optimized to reduce query processing and communication costs. The execution of the query schedule is managed to further reduce or eliminate query processing operations at various ones of the index servers.
信息查询