Invention Application
US20120143792A1 PAGE SELECTION FOR INDEXING 有权
页面选择索引

PAGE SELECTION FOR INDEXING
Abstract:
Some implementations provide techniques for selecting web pages for inclusion in an index. For example, some implementations apply regularization to select a subset of the crawled web pages for indexing based on link relationships between the crawled web pages, features extracted from the crawled web pages, and user behavior information determined for at least some of the crawled web pages. Further, in some implementations, the user behavior information may be used to sort a training set of crawled web pages into a plurality of labeled groups. The labeled groups may be represented in a directed graph that indicates relative priorities for being selected for indexing.
Public/Granted literature
Information query
Patent Agency Ranking
0/0