发明申请
- 专利标题: Ranking similar passages
- 专利标题(中): 排名相似的段落
-
申请号: US12134145申请日: 2008-06-05
-
公开(公告)号: US20090055389A1公开(公告)日: 2009-02-26
- 发明人: William Noah Schilit , Okan Kolak , Justin John Paul Vincent-Foglesong
- 申请人: William Noah Schilit , Okan Kolak , Justin John Paul Vincent-Foglesong
- 申请人地址: US CA Mountain View
- 专利权人: Google Inc.
- 当前专利权人: Google Inc.
- 当前专利权人地址: US CA Mountain View
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Passages in a digital corpus are scored and ranked based at least in part on characteristics of instances of the passages occurring in the corpus. Such characteristics include the popularity of the author, the characteristics of the words introducing and following the similar passage, frequency of appearance of the passage in the digital corpus, the length of the similar passage, the words of the similar passage, the usage of punctuation with the similar passage, and the diffusion of the similar passage within the digital corpus. The characteristics are scored and weighted to produce ranking scores for the associated passages. The ranking scores are used for purposes including selecting passages to display in association with a document and ranking passages displayed in response to a search.
信息查询