摘要:
Online spelling correction/phrase completion is described herein. A computer-executable application receives a phrase prefix from a user, wherein the phrase prefix includes a first character sequence. A transformation probability is retrieved responsive to receipt of the phrase prefix, wherein the transformation probability indicates a probability that a second character sequence has been transformed into a first character sequence. A search is then executed over a trie to locate a most probable phrase completion based at least in part upon the transformation probability.
摘要:
Systems, methods and computer-storage media are provided for identifying query formulation suggestions in response to receiving a search query. A portion of a search query is received. Query formulation suggestions are identified by semantically analyzing the search query. The query formulation suggestions are used to further formulate the received search query. The query formulation suggestions include semantic-pattern-based query suggestions that are derived from semantic query patterns, one or more entities, and information associated with these entities. The query formulation suggestions are transmitted for presentation.
摘要:
Architecture that employs a modeling technique based on language modeling to estimate a probability of a document matching the user need as expressed in the query. The modeling technique is based on the data mining results that various portions of a document (e.g., body, title, URL, anchor text, user queries) use different styles of human languages. Thus, the results based on a language can be adapted individually to match the language of query. Since the approach is based on adaptation, the framework also provides a natural means to progressively revise the model as user data are collected. Different styles of languages in a document can be recognized and adapted individually. Background language models are also employed that offer a fallback approach in case the document has incomplete fields of data, and can utilize topical or semantic hierarchy of the knowledge domain.
摘要:
Systems, methods and computer-storage media are provided for identifying low-match search queries and determining comparable item matches to suggest to the user in response to a low-match query. “Low-match queries” are queries for which an insufficient number of exact item matches are available. In embodiments, exact and/or comparable item matches may be determined via semantic analysis. Also provided are systems, methods and computer-storage media for informing the user, by way of a presented indicator, or the like, that a presented item was selected for presentation based upon a similarity metric rather than being determined an exact match for the input query.
摘要:
A resource selection system is described for assisting a user in performing a task that includes multiple actions. At each stage of the task, the system presents a set resources from which the user may select to perform a subsequent action in the task. The system implicitly selects the set of resources based on context information that identifies the user's current informational needs. For example, the context information may be derived from textual information that is being presented on a user device, which the user is presumed to be viewing at the current time. In one implementation, the system selects the set of resources by computing language models for respective domains and respective entities. The system uses the language models to determine the relevance of the context information to each of the domains. The system then selects resources associated with domains that have been assessed as relevant.
摘要:
A resource selection system is described for assisting a user in performing a task that includes multiple actions. At each stage of the task, the system presents a set resources from which the user may select to perform a subsequent action in the task. The system implicitly selects the set of resources based on context information that identifies the user's current informational needs. For example, the context information may be derived from textual information that is being presented on a user device, which the user is presumed to be viewing at the current time. In one implementation, the system selects the set of resources by computing language models for respective domains and respective entities. The system uses the language models to determine the relevance of the context information to each of the domains. The system then selects resources associated with domains that have been assessed as relevant.
摘要:
Architecture that employs a modeling technique based on language modeling to estimate a probability of a document matching the user need as expressed in the query. The modeling technique is based on the data mining results that various portions of a document (e.g., body, title, URL, anchor text, user queries) use different styles of human languages. Thus, the results based on a language can be adapted individually to match the language of query. Since the approach is based on adaptation, the framework also provides a natural means to progressively revise the model as user data are collected. Different styles of languages in a document can be recognized and adapted individually. Background language models are also employed that offer a fallback approach in case the document has incomplete fields of data, and can utilize topical or semantic hierarchy of the knowledge domain.
摘要:
Technologies pertaining to providing completions to proffered prefixes are disclosed herein. A suggested completion to a proffered prefix is retrieved by walking nodes of a trie data structure, wherein a node includes one or more characters that are used to extend a character sequence represented by its parent. Each node in the trie data structure is assigned a score, wherein the score maps to a best score assigned to its descendants. The nodes of the trie data structure are sorted based upon score, and the nodes are walked based upon scores assigned thereto.
摘要:
The present invention is directed to presenting a suggested search query. Responsive to receiving a user-devised search parameter, a suggested search query is identified. The user-devised search parameter might have been previously received by a search system, or alternatively, might be a unique query that has not been previously received. A suggested search query might be generated using various techniques, such as by applying an n-gram language model. A classification of the suggested search query is determined, and the suggested search query is presented together with a visual indicator, which signifies the classification.
摘要:
A method of entering information into a mobile device includes receiving a multi-word speech input from a user, performing speech recognition on the speech input to obtain a multi-word speech recognition result, and sequentially displaying, in a display, words in the speech recognition result for user confirmation or correction, by adding one word at a time to the display. A next word is only displayed after user confirmation or correct has been received for a previously displayed word that is immediately preceding the next word in the speech recognition result. The method also includes calculating a hypothesis lattice indicative of a plurality of speech recognition hypotheses based on the speech input and, prior to finishing calculating the hypothesis lattice and while continuing to calculate the hypothesis lattice, calculating a preliminary hypothesis lattice indicative of only partial speech recognition hypotheses based on the speech input and outputting the preliminary hypotheses lattice.