摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for information retrieval. In one aspect, a method includes receiving a fact query; determining an expected type of answer to the fact query; identifying search results responsive to the fact query; identifying phrases from the resources corresponding to the search results that correspond to a form of the expected type; determining a score for each of the identified phrases; and presenting an answer to the fact query using the scores.
摘要:
Methods, systems, and apparatus, including computer programs stored on computer storage media, for retrieval and display of information from an electronic document collection. One aspect can be embodied in machine-implemented methods that include the actions of receiving description data describing a preexisting structured presentation, comparing characteristics of the preexisting structured presentation with content of electronic documents in an unstructured collection of electronic documents to locate electronic documents that identify a new attribute that is relevant to the preexisting structured presentation, adding an identifier of the new attribute to the preexisting structured presentation to form an expanded structured presentation, and outputting instructions for presenting the expanded structured presentation.
摘要:
Methods, systems, and apparatus, including computer programs stored on computer storage media, for retrieval and display of information from an electronic document collection. In one aspect, a machine-implemented method includes the actions of receiving description data describing a preexisting structured presentation, comparing characteristics of the preexisting structured presentation with content of electronic documents in an unstructured collection of electronic documents to locate electronic documents that identify a new instance that is relevant to the preexisting structured presentation, adding an identifier of the new instance to the preexisting structured presentation to form an expanded structured presentation, and outputting instructions for presenting the expanded structured presentation.
摘要:
Methods, systems, and apparatus, including computer programs stored on computer storage media, for populating a structured presentation with new values. One aspect can be embodied in machine-implemented methods that include the actions of receiving description data describing a preexisting structured presentation, comparing characteristics of the preexisting structured presentation with content of electronic documents in an unstructured collection of electronic documents to locate electronic documents that identify a new value that is relevant to the preexisting structured presentation, adding the new value to the preexisting structured presentation to form a new structured presentation, and outputting instructions for visually presenting the new structured presentation.
摘要:
Methods, systems, and apparatus, including computer programs stored on computer storage media, for retrieval and display of information from an electronic document collection. One aspect can be embodied in machine-implemented methods that include the actions of receiving description data describing a preexisting structured presentation, comparing characteristics of the preexisting structured presentation with content of electronic documents in an unstructured collection of electronic documents to locate electronic documents that identify a new attribute that is relevant to the preexisting structured presentation, adding an identifier of the new attribute to the preexisting structured presentation to form an expanded structured presentation, and outputting instructions for presenting the expanded structured presentation.
摘要:
Methods, systems, and apparatus, including computer programs stored on computer storage media, for retrieval and display of information from an unstructured electronic document collection. One aspect can be embodied in machine-implemented methods that include the actions of receiving a machine-readable search query from a user and responding to the search query with instructions for presenting the user with a structured presentation of instances relevant to the search query. A visual presentation of the structured presentation denotes associations between the instances and values that characterize attributes of the instances by virtue of an arrangement of identifiers of the instances and the values.
摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for identifying a group of related instance identifiers. In one aspect, a computer storage medium is encoded with a computer program. The program comprises instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations. The operations include receiving a search query at a data processing apparatus, the search query specifying attributes shared by a group of related instances, searching an electronic document collection to identify instance identifiers that are responsive to the search query, representing features of the instance identifiers in a vertex-edge graph, and scoring relevance of the instance identifiers to the search query according to the features represented in the vertex-edge graph.
摘要:
Methods, systems, and apparatus, including computer programs stored on computer storage media, for accessing a search interface from a structured presentation. One aspect can be embodied in machine-implemented methods that include the actions of displaying a structured presentation on a display device, receiving data characterizing a user interaction with the displayed structured presentation, the data including a specification of a first instance and a first attribute of the structured presentation, and displaying a formerly concealed search interface on the display device in response to receiving the data.
摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for improving search with user corrections. In one aspect, a methods performed by a data processing apparatus include the actions of receiving a value result set, accessing historical records of user corrections stored at one or more data storage devices, the historical records describing user corrections of the characterization of instance attributes by values, determining that the historical records of user corrections describe a first user correction involving a first value in the value result set, and changing a confidence parameter embodying a confidence that the first value correctly characterizes the attribute of the instance. The value result set comprises a collection of one or more values. The values are candidates for characterizing an attribute of an instance. The first value is involved in the correction as either a corrected value or an uncorrected value.
摘要:
A corpus contains information including text from books and metadata about the books. The book texts mention entities of various types, such as location, date, event, and person entities. An entity importance engine analyzes the information in the corpus to identify the entities mentioned therein, and ranks the entities using query-independent importance scores. The importance scores can be based in part on the contexts in which the entities are mentioned by the books. A presentation engine enables searching of the corpus to identify books satisfying a search query. The presentation engine presents information about a designated book. Entities mentioned in the book are presented in a style selected to enhance comprehension and utility. For example, location entities can be presented on a map with markers showing the locations of the entities. The number of entities presented are limited and ordered based on the query-independent importance scores.