摘要:
Architecture that uses content from off-page data sources such as feeds (e.g., yellow pages, coupons, social networks, commerce, etc.) to present additional, relevant information in association with search results. The additional and relevant information is directly relevant to the implicit task the user is trying to accomplish. The architecture includes online and offline mechanisms that identify an entity represented on a web page and look-up information related to that entity in disparate data sources. Relevance heuristics are employed to determine which of the available entity data to show in the caption given the user query, the web page, and the underlying user task (other known information about the user such as geographic location).
摘要:
Architecture that inserts one or more label items in search result entries. In addition to the typical search result caption (title, snippet, and link), the architecture includes the label component of one or more of the label items in the result entry. The number and type of label annotations are based on the query. When a particular label item is selected (e.g., hover, mouse click), a presentation component (e.g., expansion object, pop-up window) launches proximate to a label item in response to interaction with the label item and presents additional information from the target webpage. The additional information can include an action and data related to the search result entry and the target webpage. The data can be obtained from a data source other than the target webpage.
摘要:
Architecture that inserts one or more label items in search result entries. In addition to the typical search result caption (title, snippet, and link), the architecture includes the label component of one or more of the label items in the result entry. The number and type of label annotations are based on the query. When a particular label item is selected (e.g., hover, mouse click), a presentation component (e.g., expansion object, pop-up window) launches proximate to a label item in response to interaction with the label item and presents additional information from the target webpage. The additional information can include an action and data related to the search result entry and the target webpage. The data can be obtained from a data source other than the target webpage.
摘要:
Architecture that uses content from off-page data sources such as feeds (e.g., yellow pages, coupons, social networks, commerce, etc.) to present additional, relevant information in association with search results. The additional and relevant information is directly relevant to the implicit task the user is trying to accomplish. The architecture includes online and offline mechanisms that identify an entity represented on a web page and look-up information related to that entity in disparate data sources. Relevance heuristics are employed to determine which of the available entity data to show in the caption given the user query, the web page, and the underlying user task (other known information about the user such as geographic location).
摘要:
Architecture that generates and presents a separator interface element in association with subsnippets (of search results) to indicate to the user the closeness relationship between subsnippets. The closeness can be presented in terms of bytes, words, paragraphs, semantic distance, rhetorical relationship, and so on, which assist the user in determining how closely related the parts of the document are in which the search terms appear. The element can be a box, for example, that is sized in length proportional to an inter-subsnippet closeness relationship.
摘要:
Answers are provided to users in response to queries as a supplement to any responsive documents. Query formats for entity and attribute combinations are identified. The query formats can be substituted with entity and attribute combinations that have a corresponding attribute value to form a list of answered queries. The attribute value corresponding to an answered query can be provided when a query is received that matches an answered query.
摘要:
Answers are provided to users in response to queries as a supplement to any responsive documents. Query formats for entity and attribute combinations are identified. The query formats can be substituted with entity and attribute combinations that have a corresponding attribute value to form a list of answered queries. The attribute value corresponding to an answered query can be provided when a query is received that matches an answered query.
摘要:
Technologies are described herein for coreference resolution in an ambiguity-sensitive natural language processing system. Techniques for integrating reference resolution functionality into a natural language processing system can processes documents to be indexed within an information search and retrieval system. Ambiguity awareness features, as well as ambiguity resolution functionality, can operate in coordination with coreference resolution. Annotation of coreference entities, as well as ambiguous interpretations, can be supported by in-line markup within text content or by external entity maps. Information expressed within documents can be formally organized in terms of facts, or relationships between entities in the text. Expansion can support applying multiple aliases, or ambiguities, to an entity being indexed so that all of the possibly references or interpretations for that entity are captured into the index. Alternative stored descriptions can support retrieval of a fact by either the original description or a coreferential description.
摘要:
Architecture that enables an optional display of a longer version of each subsnippet in response to user interactions such as clicking, hovering, or other suitable form of interaction. More specifically, options are provided to display additional text from a search result at the point where a subsnippet (a subsegment in a snippet that is delimited by ellipses) ends. Selecting suitable boundaries for both initial subsnippets and expanded subsnippets enables relevant information to be presented and increased readability.
摘要:
The present technology is related to identifying, from within a corpus of documents, a subject (e.g., person, location, date, etc.) that is relevant to a topic and that is usable to enhance a topic-describing document. Documents within the corpus of documents share a link structure, such that some documents include hyperlinks that enable navigation to the topic-describing document, and the topic-describing document includes hyperlinks that enable navigation to other documents. Text of documents within the corpus is parsed to identify the subject, and a context of the subject suggests a degree of relevance of the subject to the topic. An enhancement type of the subject is determined, and a version of the topic-describing document is enhanced to include a presentation of the subject.