Invention Application
- Patent Title: MULTI-LEVEL COVERAGE FOR CRAWLING SELECTION
- Patent Title (中): 多层次搜索选择
-
Application No.: US12958611Application Date: 2010-12-02
-
Publication No.: US20120143844A1Publication Date: 2012-06-07
- Inventor: Taifeng Wang , Tie-Yan Liu , Bin Gao
- Applicant: Taifeng Wang , Tie-Yan Liu , Bin Gao
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Some implementations provide techniques for determining which URLs to select for crawling from a pool of URLs. For example, the selection of URLs for crawling may be made based on maintaining a high coverage of the known URLs and/or high discoverability of the World Wide Web. Some implementations provide a multi-level coverage strategy for crawling selection. Further, some implementations provide techniques for discovering unseen URLs.
Information query