摘要:
Concept disambiguation is provided for search queries by analyzing search results in conjunction with an ontology of concepts. An ontology of concepts is identified, and at least one document is associated with each concept. The document associated with a concept is representative of the concept and used to generate a concept signature. When a search query is received, it is processed to obtain search results. The search results are used to generate a search results signature, which is compared to the concept signatures to identify one or more concepts that are relevant to the search query.
摘要:
Referring expressions are identified for concepts by analyzing search query and result selection information. An ontology of concepts is identified, and at least one document is associated with each concept. The document associated with a concept is representative of the concept. Search query information from a search engine is analyzed to identify search queries that resulted in user selections of documents associated with the concepts. Referring expressions that refer to the concepts are identified based on the search queries that resulted in user selections of documents corresponding with the concepts. After identifying referring expressions for concepts, search queries may be mapped to referring expressions to identify concepts to which the search queries pertain, and search result pages may be generated based on knowledge of the concepts.
摘要:
Technologies are described herein for coreference resolution in an ambiguity-sensitive natural language processing system. Techniques for integrating reference resolution functionality into a natural language processing system can processes documents to be indexed within an information search and retrieval system. Ambiguity awareness features, as well as ambiguity resolution functionality, can operate in coordination with coreference resolution. Annotation of coreference entities, as well as ambiguous interpretations, can be supported by in-line markup within text content or by external entity maps. Information expressed within documents can be formally organized in terms of facts, or relationships between entities in the text. Expansion can support applying multiple aliases, or ambiguities, to an entity being indexed so that all of the possibly references or interpretations for that entity are captured into the index. Alternative stored descriptions can support retrieval of a fact by either the original description or a coreferential description.
摘要:
Technologies are described herein for coreference resolution in an ambiguity-sensitive natural language processing system. Techniques for integrating reference resolution functionality into a natural language processing system can processes documents to be indexed within an information search and retrieval system. Ambiguity awareness features, as well as ambiguity resolution functionality, can operate in coordination with coreference resolution. Annotation of coreference entities, as well as ambiguous interpretations, can be supported by in-line markup within text content or by external entity maps. Information expressed within documents can be formally organized in terms of facts, or relationships between entities in the text. Expansion can support applying multiple aliases, or ambiguities, to an entity being indexed so that all of the possibly references or interpretations for that entity are captured into the index. Alternative stored descriptions can support retrieval of a fact by either the original description or a coreferential description.
摘要:
Methods and computer-readable media for associating words or groups of words distilled from content, such as reported speech or an attitude report, of a document to form semantic relationships collectively used to generate a semantic representation of the content are provided. Semantic representations may include elements identified or parsed from a text portion of the content, the elements of which may be associated with other elements that share a semantic relationship, such as an agent, location, or topic relationship. Relationships may also be developed by associating one element that is in relation to, or is about, another element, thereby allowing for rapid and effective comparison of associations found in a semantic representation with associations derived from queries. The semantic relationships may be determined based on semantic information, such as potential meanings and grammatical functions of each element within the text portion of the content.
摘要:
Architecture that enables an optional display of a longer version of each subsnippet in response to user interactions such as clicking, hovering, or other suitable form of interaction. More specifically, options are provided to display additional text from a search result at the point where a subsnippet (a subsegment in a snippet that is delimited by ellipses) ends. Selecting suitable boundaries for both initial subsnippets and expanded subsnippets enables relevant information to be presented and increased readability.
摘要:
Architecture that enables an optional display of a longer version of each subsnippet in response to user interactions such as clicking, hovering, or other suitable form of interaction. More specifically, options are provided to display additional text from a search result at the point where a subsnippet (a subsegment in a snippet that is delimited by ellipses) ends. Selecting suitable boundaries for both initial subsnippets and expanded subsnippets enables relevant information to be presented and increased readability.
摘要:
Architecture that generates and presents a separator interface element in association with subsnippets (of search results) to indicate to the user the closeness relationship between subsnippets. The closeness can be presented in terms of bytes, words, paragraphs, semantic distance, rhetorical relationship, and so on, which assist the user in determining how closely related the parts of the document are in which the search terms appear. The element can be a box, for example, that is sized in length proportional to an inter-subsnippet closeness relationship.
摘要:
A method and system are provided to report the findings of an expert's analysis of image data. The method and system are based on a reporting system that forms the basis of an image management system that can efficiently and systematically generate image reports, facilitate data entry into searchable databases for data mining, and expedite billing and collections for the expert's services. The expert identifies a significant finding on an image and attaches a location:description code to the location of that finding in order to create a significant finding and an entry into a database. Further descriptions of that finding, such as dimensional measurements, may be automatically appended to the finding as secondary attributes. After the evaluation, the system sorts the findings in the database and presents the findings by prioritized categories. The expert edits and approves a multimedia report which may be delivered by electronic means to an end-user.
摘要:
Techniques are provided for segmenting text into categorized discourse constituents and attaching discourse constituents into a structural representation of discourse. Techniques for determining hybrid structural and non-structural summaries of a text are also provided. A text is segmented based on a theory of discourse analysis into at least a main discourse constituent containing spatio-temporal information about a single event in a possible world view. The discourse constituents are then inserted into a structural representation of discourse. Non-structural techniques are used to determine relevance scores and important discourse constituents are determined. Relevance scores are percolated through the structural representation of discourse to determine supporting preceding discourse constituents that preserve grammaticality. A hybrid text summary is then determined based on the structural representation of the discourse and relevance scores.