Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating resources with entities. One of the methods includes clustering a plurality of first documents into one or more first document groups, wherein each of the one or more first document groups is associated with a proper name of an author; receiving a query that specifies a particular proper name of a particular author; generating a result list of one or more documents that satisfy the query, the documents being listed in order of rank; ranking the one or more first document groups based on the one or more documents that satisfy the query; and providing the one or more first document groups, wherein the one or more first document groups are presented in an order based on the ranking.
Abstract:
The disclosed implementations provide a method of searching for a known item. The method includes receiving a lookup request for the known item. The lookup request includes information identifying estimated values for a plurality of attributes of the known item. In accordance with the received lookup request, two or more estimated attribute-value pairs for the known item are estimated. In accordance with the received lookup request, a plurality of queries corresponding to the estimated attribute-value pairs are formulated in accordance with a plurality of predefined query types, each query having a corresponding position in a query type hierarchy. One or more candidate items are identified by executing one or more of the plurality of queries in accordance with the query type hierarchy. At least one of the candidate items is returned in response to the lookup request for the known item.
Abstract:
Systems and methods are provided for obtaining a plurality of documents. A respective document in the plurality of documents is associated with a score and each document in the plurality of documents is from a different data structure in a plurality of data structures. Each data structure in the plurality of data structures represents a different portion of a document address space. A first document in the plurality of documents is selected in accordance with the score associated with the first document. The first document has a fingerprint that indicates that the first document has substantially identical content to every other document in the plurality of documents. In accordance with the score, the first document is indexed thereby producing an indexed first document. With respect to the plurality of documents, the indexed first document is included in a document index as representative of each document in the plurality of documents.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating resources based on resource associations. One of the methods includes receiving a first profile, wherein the first profile is for a first author, wherein the first profile links to one or more first documents, wherein the first author is an author of each of the one or more first documents; identifying, one or more second authors, wherein each of the one or more second authors is a co-author of one or more of the first documents; calculating, respective co-author scores for each of the one or more second authors; ranking, the one or more second authors based on their respective co-author scores; and associating, the one or more second authors with the first profile, the first profile includes a listing of the one or more second authors in an order according to the ranking.
Abstract:
A system and method identifies a primary version out of different versions of the same document. The system selects a priority of authority for each document version based on a priority rule and information associated with the document version, and selects a primary version based on the priority of authority and information associated with the document version.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating resources based on resource associations. One of the methods includes receiving a first profile, wherein the first profile is for a first author, wherein the first profile links to one or more first documents, wherein the first author is an author of each of the one or more first documents; identifying, one or more second authors, wherein each of the one or more second authors is a co-author of one or more of the first documents; calculating, respective co-author scores for each of the one or more second authors; ranking, the one or more second authors based on their respective co-author scores; and associating, the one or more second authors with the first profile, the first profile includes a listing of the one or more second authors in an order according to the ranking.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating resources with entities. One of the methods includes clustering a plurality of first documents into one or more first document groups, wherein each of the one or more first document groups is associated with a proper name of an author; receiving a query that specifies a particular proper name of a particular author; generating a result list of one or more documents that satisfy the query, the documents being listed in order of rank; ranking the one or more first document groups based on the one or more documents that satisfy the query; and providing the one or more first document groups, wherein the one or more first document groups are presented in an order based on the ranking.