Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing a representation based on structured data in resources. The methods, systems, and apparatus include actions of obtaining search results for a search query. Each search result references a respective resource, the respective resources including a first resource and a second resource. Additional actions include determining a consistent value, that is consistent with both a value for a particular property as indicated by structured data in the first resource and a value for the particular property as indicated by structured data in the second resource. Further actions include in response to determining a consistent value, providing (i) a representation of the consistent value as a possible answer to the search query, and (ii) the search results.
Abstract:
Techniques for identifying documents sharing common underlying structures in a large collection of documents and processing the documents using the identified structures are disclosed. Images of the document collection are processed to detect occurrences of a predetermined set of image features that are common or similar among forms. The images are then indexed in an image index based on the detected image features. A graph of nodes is built. Nodes in the graph represent images and are connected to nodes representing similar document images by edges. Documents sharing common underlying structures are identified by gathering strongly inter-connected nodes in the graph. The identified documents are processed based at least in part on the resulting clusters.
Abstract:
Systems and methods offer a search system with third-party provided search applications that are triggered in response to specified queries and run at the search system. For example, a method may include determining that a query triggers a third party search application hosted at the search system, extracting a parameter from the query based on a query template, executing the third party search application with the parameter in a sandboxed manner at the search system to generate a third-party formatted answer for the query, and providing the third-party formatted answer as a search result for the query. The third party may provide the query template, parameter attributes, and the third party formatted answer. The third party search application may be stored at the search system and include the query template, a data store, the parameter attributes, and instructions for accessing the data store using the parameter.
Abstract:
Techniques for identifying documents sharing common underlying structures in a large collection of documents and processing the documents using the identified structures are disclosed. Images of the document collection are processed to detect occurrences of a predetermined set of image features that are common or similar among forms. The images are then indexed in an image index based on the detected image features. A graph of nodes is built. Nodes in the graph represent images and are connected to nodes representing similar document images by edges. Documents sharing common underlying structures are identified by gathering strongly inter-connected nodes in the graph. The identified documents are processed based at least in part on the resulting clusters.