发明授权
- 专利标题: Efficient retrieval algorithm by query term discrimination
- 专利标题(中): 通过查询词辨别的有效检索算法
-
申请号: US11804627申请日: 2007-05-18
-
公开(公告)号: US07822752B2公开(公告)日: 2010-10-26
- 发明人: Chenxi Lin , Lei Ji , Huajun Zeng , Benyu Zhang , Zheng Chen , Jian Wang
- 申请人: Chenxi Lin , Lei Ji , Huajun Zeng , Benyu Zhang , Zheng Chen , Jian Wang
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/30
摘要:
Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.
公开/授权文献
信息查询