Invention Grant
- Patent Title: Page selection for indexing
- Patent Title (中): 索引的页面选择
-
Application No.: US12959060Application Date: 2010-12-02
-
Publication No.: US08645288B2Publication Date: 2014-02-04
- Inventor: Taifeng Wang , Bin Gao , Tie-Yan Liu
- Applicant: Taifeng Wang , Bin Gao , Tie-Yan Liu
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Lee & Hayes, PLLC
- Main IPC: G06F15/18
- IPC: G06F15/18

Abstract:
Some implementations provide techniques for selecting web pages for inclusion in an index. For example, some implementations apply regularization to select a subset of the crawled web pages for indexing based on link relationships between the crawled web pages, features extracted from the crawled web pages, and user behavior information determined for at least some of the crawled web pages. Further, in some implementations, the user behavior information may be used to sort a training set of crawled web pages into a plurality of labeled groups. The labeled groups may be represented in a directed graph that indicates relative priorities for being selected for indexing.
Public/Granted literature
- US20120143792A1 PAGE SELECTION FOR INDEXING Public/Granted day:2012-06-07
Information query