Abstract:
A system comprises a first speech recognition engine, a second speech recognition engine, and evaluation logic coupled to the first and second speech recognition engines. The evaluation logic evaluates the first and second speech recognition engines based on evaluation voice signals from a user and, based on the evaluation, selects one of said speech recognition engines to process additional speech signals from the user.
Abstract:
A method for determining a logical structure of a document. At least one candidate table of contents page of the document is determined, wherein the table of contents page comprises a plurality of table of contents words. The document comprises at least one table of contents page and a plurality of body pages, wherein at least one body page comprises a plurality of body words. A dictionary is built for the candidate table of contents page, the dictionary comprising at least one table of contents word. One body page is analyzed for at least one text match between the plurality of body words and the table of contents words of the dictionary. Based on an evaluation the text match, it is determined whether the body page is a title page.
Abstract:
In at least some embodiments, a system may comprise a user voice interface, a processor coupled to the user voice interface, wherein the processor interprets words spoken by a user, and a memory coupled to the processor, wherein the memory stores an email application, wherein the email application summarizes email messages and navigates a plurality of email messages according to the words spoken by the user.
Abstract:
Embodiments enable searching of portions of objects in images, including programmatically analyzing each image in a collection in order to determine image data that, for individual images in the collection, represents one or more visual characteristics of a portion of an object shown in that image. A user is enabled to specify one or more search criteria that includes image data, and a search result may be determined based on one or more images in the collection that show a corresponding object that has a portion that satisfies a threshold. The threshold is defined at least in part by the one or more search criteria.
Abstract:
Various embodiments enable a computing device to perform tasks such as processing an image to recognize text or an object in an image to identify a particular product or related products associated with the text or object. In response to recognizing the text or the object as being associated with a product available for purchase from an electronic marketplace, one or more advertisements or product listings associated with the product can be displayed to the user. Accordingly, additional information for the associated product can be displayed, enabling the user to learn more about and purchase the product from the electronic marketplace through the portable computing device.
Abstract:
A user-specified layout of graphic objects is received. A relative layout specification is derived from the user-specified layout. The relative layout specification describes relative positions of the graphic objects. A final layout of the graphic objects is determined in accordance with the relative layout specification. The final layout includes a specification of size dimensions and positions of the graphic objects on a page. The final layout of graphic objects is output.
Abstract:
Images are analyzed by programmatic mechanisms for assessing one or more remote web pages to retrieve content on display at remote web pages. The retrieved images may be analyzed to determine information about an object shown in a corresponding images of the content on display. At least a portion of the object shown in the corresponding image of the content on display may be made selectable and associated with the determined information. This determined information may subsequently be used, in for example, search applications.
Abstract:
Embodiments described herein provide a system and method for providing merchandise items at a network site. According to an embodiment, an image of a merchandise item is obtained. The image is programmatically analyzed to determine information about the merchandise item. The information is used to generate a presentation that includes the merchandise item.
Abstract:
Embodiments described herein provide for a system for creating a data collection of recognized images. The system includes an image analysis module that is configured to programmatically analyze individual images in a collection of images in order to determine information about each image in the collection. The system may also include a manual interface that is configured to (i) interface with one or more human editors, and (ii) displays a plurality of panels concurrently. Individual panels may be provided for one or more analyzed images, and individual panels may be configured to display information that is at least indicative of the one or more images of that panel and/or of the information determined from the one or more images. Additionally, the manual interface enables the one or more human editors to view the plurality of panels concurrently and to interact with each of the plurality of panels in order to correct or remove any information that is incorrectly determined from the image of that panel.
Abstract:
In at least some embodiments, a system may comprise a user voice interface, a processor coupled to the user voice interface, wherein the processor interprets words spoken by a user, and a memory coupled to the processor, wherein the memory stores an email application, wherein the email application summarizes email messages and navigates a plurality of email messages according to the words spoken by the user.