- 专利标题: Anchor tag indexing in a web crawler system
-
申请号: US13300516申请日: 2011-11-18
-
公开(公告)号: US09305091B2公开(公告)日: 2016-04-05
- 发明人: Huican Zhu , Jeffrey Dean , Sanjay Ghemawat , Bwolen Po-Jen Yang , Anurag Acharya
- 申请人: Huican Zhu , Jeffrey Dean , Sanjay Ghemawat , Bwolen Po-Jen Yang , Anurag Acharya
- 申请人地址: US CA Mountain View
- 专利权人: Google Inc.
- 当前专利权人: Google Inc.
- 当前专利权人地址: US CA Mountain View
- 主分类号: G06F17/00
- IPC分类号: G06F17/00 ; G06F17/30 ; G06F17/27 ; G06F17/22
摘要:
Provided is a method and system for indexing documents in a collection of linked documents. A link log, including one or more pairings of source documents and target documents is accessed. A sorted anchor map, containing one or more target document to source document pairings, is generated. The pairings in the sorted anchor map are ordered based on target document identifiers.
公开/授权文献
- US20120066576A1 Anchor Tag Indexing in a Web Crawler System 公开/授权日:2012-03-15
信息查询