摘要:
An online search system generates an index of documents using index information received from a library. Some documents have restricted access; some documents may not be available online. The search system provides links to documents in the library as well as other sites based on a search, and may include link resolvers received from the library. The search system provides access links to the link resolvers if an identifier, such as a user identification or IP address, matches an affiliation list from the library.
摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for query modification. In one aspect, a method includes receiving an original query including a first limitation. First search results responsive to a modified query are obtained, where the first limitation has been omitted from the modified query. One or more common characteristics shared by two or more resources are identified. Each of the two or more resources corresponds to a different highly-ranked result of the first search results. A second modified query including the original query and a second limitation representing the one or more common characteristics is generated. Second search results responsive to the second modified query are obtained. The second search results are provided in a response to the original query.
摘要:
A system and method identifies a primary version out of different versions of the same document. The system selects a priority of authority for each document version based on a priority rule and information associated with the document version, and selects a primary version based on the priority of authority and information associated with the document version.
摘要:
A search query containing at least one term is received at a search controller from a query server and preferably normalized and hashed into a representation of the search query. The representation of the search query is transmitted towards a cache containing multiple query result entries. Each query result entry contains a list of documents associated with the previously searched search query. The cache is then searched and query result entries for the search query are sent to the search controller from the cache. Subsequently, it is determined whether the query result entries are current versions for the search query. If the query result entries are not the current versions, then current versions of the query result entries are obtained.
摘要:
A system provides client access to customized news content. The system includes a custom news source server and a news search server. The custom news source server periodically sends one or more customized search queries to a news search server. The news search server fetches news content from multiple news source servers and aggregates the news content. The news search server also periodically receives the one or more search queries from the custom news source server, searches the aggregated news content based on the one or more search queries, and periodically provides selected news content to the custom news server based on results of the searches. The custom news source server permits access to clients, from across a network, to the selected news content provided by the news search server.
摘要:
A system of reducing the possibility of crawling duplicate document identifiers partitions a plurality of document identifiers into multiple clusters, each cluster having a cluster name and a set of document parameters. The system generates an equivalence rule for each cluster of document identifiers, the rule specifying which document parameters associated with the cluster are content-relevant. Next, the system groups each cluster of document identifiers into one or more equivalence classes in accordance with its associated equivalence rule, each equivalence class including one or more document identifiers that correspond to a document content and having a representative document identifier identifying the document content.
摘要:
A system customizes a news document associated with a user of a news aggregation service. The system includes multiple news source servers that store news content and a remote news aggregation server. The news aggregation server creates a customized news document based on one or more personalized search queries received from a user. The news aggregation server fetches the news content from the multiple news source servers, aggregates the news content, and searches the aggregated news content based on the one or more personalized search queries. The news aggregation server provides selected news content to the customized news document based on results of the search.
摘要:
An online search system generates an index of documents using index information received from a library. Some documents have restricted access; some documents may not be available online. The search system provides links to documents in the library as well as other sites based on a search, and may include link resolvers received from the library. The search system provides access links to the link resolvers if an identifier, such as a user identification or IP address, matches an affiliation list from the library.
摘要:
A search query containing one or more terms is received from a client system. In response to receiving the search query, one or more snippets obtained in response to a prior execution of said search query are requested from a cache. For a respective snippet received from the cache, it is determined whether the respective snippet is a current version. In response to a determination that the respective snippet is not the current version, the current version of the respective snippet is obtained from a corresponding document in which one or more terms from said search query are located and the snippet stored in the cache is updated using the obtained current version. Search query results including the respective snippet are transmitted to the client.
摘要:
A system of reducing the possibility of crawling duplicate document identifiers partitions a plurality of document identifiers into multiple clusters, each cluster having a cluster name and a set of document parameters. The system generates an equivalence rule for each cluster of document identifiers, the rule specifying which document parameters associated with the cluster are content-relevant. Next, the system groups each cluster of document identifiers into one or more equivalence classes in accordance with its associated equivalence rule, each equivalence class including one or more document identifiers that correspond to a document content and having a representative document identifier identifying the document content.