摘要:
Methods and apparatus are described for identifying implicitly temporally qualified queries, i.e., queries for which a time period is implied but not explicitly stated, and for expanding such queries to include one or more temporal references.
摘要:
Methods and apparatus are described for identifying implicitly temporally qualified queries, i.e., queries for which a time period is implied but not explicitly stated, and for expanding such queries to include one or more temporal references.
摘要:
A familiarity level classifier comprises a stopwords engine for conducting a stopwords analysis of stopwords, e.g., introductory level stopwords and advanced level stopwords, in a document, e.g., a website; and a familiarity level classifier module for generating a document familiarity level based on the stopwords analysis. The classifier may be in an indexing module, a search engine, a user computer, or elsewhere in a computer network. The classifier may also include a reading level engine for conducting a reading level analysis of the document, and wherein the familiarity level classifier module is configured to generate the familiarity level also based on the reading level analysis. The classifier may also include a document features engine for conducting a feature analysis of the document, and wherein the familiarity level classifier module is configured to generate the document familiarity level also based on the feature analysis.
摘要:
A system and method is disclosed for determining a prediction measurement, or measure, using geo-spatial information which can be used to determine whether or not to include type of information in search results. The prediction measurement comprises a measure of the likelihood that an item of the type of information for which the prediction measure is determined will be selected, or clicked on, by a user, if the item of the type of information is included in the search result. Without limitation, one such information type is news.
摘要:
Systems and methods according to embodiments leverage click data to predict a relevance judgment for a given query-content item pair. An initial training phase utilize a training set of query-content item pairs coupled with click data and relevance data (e.g., relevance judgments or labels) to train a model of the relationship between relevance and clicks. Accordingly, given an unlabeled query-content item pair as input to the model, a relevance judgment or label is provided. Theses relevance labels, in turn, may be used in conjunction with query-content item pairs with which they are associated to train a model to determine a content item relevance function. When a user provides a query to a given search engine, the search engine applies the content item relevance function to the query and content items in a responsive result set to provide a relevance ordered result set to the user.
摘要:
The present invention is directed towards systems and methods for generating one or more related queries with respect to a given query. The method may comprise receiving a query comprising one or more terms, generating a corpus of candidate reformulations and identifying one or more queries or terms from the corpus of candidate reformulations related to the received query. One or more related queries are generated on the basis of the identified one or more queries or terms from the corpus of candidate reformulations.
摘要:
A system and method is disclosed for determining a prediction measurement, or measure, using geo-spatial information which can be used to determine whether or not to include type of information in search results. The prediction measurement comprises a measure of the likelihood that an item of the type of information for which the prediction measure is determined will be selected, or clicked on, by a user, if the item of the type of information is included in the search result. Without limitation, one such information type is news.
摘要:
A system and method are disclosed for identifying the tasks that a user is attempting to perform when submitting a series of search queries. A task may be categorized hierarchically into a mission and a goal in order to classify a series of queries and determine the user's intent. The user's mission may be an extended task and the user's goal may be a specific task. A pair of queries may be analyzed to determine whether those queries have the same mission and/or the same goal. For a series of queries, boundaries may be identified between the queries to identify where a user has changed her goal or changed her mission. The identification of goals, missions and boundaries may be automated based on an analysis of features of the queries that are compared.
摘要:
A system and method for receiving a query and determining candidate alternate queries. A query may be parsed into concepts and then candidates for alternate terms may be determined for either the whole query or the determined concepts. Candidates are scored for their respective similarity to the original query. After scoring, at least one candidate is chosen and may be used as a possible alternate query choice for a user or used to serve bidded advertisements to the user.
摘要:
The present invention is directed towards systems and methods for generating one or more related queries with respect to a given query. The method may comprise receiving a query comprising one or more terms, generating a corpus of candidate reformulations and identifying one or more queries or terms from the corpus of candidate reformulations related to the received query. One or more related queries are generated on the basis of the identified one or more queries or terms from the corpus of candidate reformulations.