Abstract:
Various technologies and techniques detect lists in vector graphics based documents and use them in meaningful ways. The system detects at least one list in a vector graphics based document using a set of rules. Pattern detection logic identifies characters, symbols, numbers, letters, and/or images that may start a list. Additional pattern detection logic determines if a list exists. The system can identify and parse bulleted lists, numbered or lettered lists, and nested lists that are any combination of both. Once identified, the content is translated into a modified format. The content can be output to a destination application in the modified format that is more suitable for output or use by the destination application.
Abstract:
A content application employs page links to merge pages of articles. The content application retrieves an initial page of an article. An article such as a web article spread into multiple pages is retrieved for analysis. A page link of a following page of the article is detected within the initial page. The page link is a top choice among candidates sorted based on a weight score. The following page is retrieved using the page link and appended into the initial page to form an aggregate article. The aggregate article is presented for consumption.
Abstract:
A reader application (104) presents fixed format documents (112) in reflowed format (108). A selection control (220) is presented to display a reflow view after identifying text-based sections within a displayed content. The reflow view is generated using the sections upon an action on the selection control to display the reflow view. The reflow view is displayed in a location associated with the section within the displayed content or in place of the displayed content The reflow view is adjusted based on a screen size of the reader application.
Abstract:
Metadata based enhanced search capabilities are provided in conjunction with e-reader experience. Metadata associated with content and objects within displayed content is created based on default and/or custom rules. The metadata such as attributes, tags, etc. is indexed and used to augment search queries based on selected portions of the displayed content. Additional search augmentation is performed based on user-defined metadata or processing of objects such as image recognition of image objects and similar processes.
Abstract:
Flat content (or flat book) is converted to rich content (or a rich book) in an e-reader environment through key term identification, context-based search using identified key terms, ranking of search results, and comparison of the ranked search results against a predefined threshold, customizable by a user or an administrator. Textual, graphic, audio, video, and comparable objects from search results above the predefined threshold may be augmented in suitable locations within the flat content such as interactive text, audio/video object, images, additional text, or other interactive elements.
Abstract:
Context based enhanced search capabilities are provided in conjunction with e-reader experience. A search query may be augmented based on a context of the displayed content, user information, and similar background data such that relevancy and context of search results may be focused in presentation to the user. Context of content may include terms or objects near a selected portion of the content for search, a type of the displayed content, etc. Search results may be presented from a variety of sources categorized and/or prioritized based on system default parameters and/or user preferences.
Abstract:
Speaker authentication is performed by determining a similarity score for a test utterance and a stored training utterance. Computing the similarity score involves determining the sum of a group of functions, where each function includes the product of a posterior probability of a mixture component and a difference between an adapted mean and a background mean. The adapted mean is formed based on the background mean and the test utterance. The speech content provided by the speaker for authentication can be text-independent (i.e., any content they want to say) or text-dependent (i.e., a particular phrase used for training).
Abstract:
A content application determines images of an article for extraction. The content application identifies an initial image associated with a content of the article. A caption and a credit line associated with the initial image is detected and the initial image is extracted along with the caption and the credit line. A second image of the article associated with a video is also detected and extracted along with the video. In addition, the content application extracts a slideshow detected within the article.
Abstract:
An e-reader application displays a page of a fixed format document resized to match a size of a window pane in a fit page format. The e-reader application presents selection controls associated with a selection action in response to detecting the selection action on a text of the page. In addition, an operation associated with a navigation action is executed in response to detecting the navigation action on the page.
Abstract:
An e-reader application preserves a layout of a region of content during a modification. A detected modification of a region of the content initiates an analysis for how to preserve the region. A configuration located within the content determines a behavior for preserving the region. The configuration is evaluated and compared to other configurations to determine overriding attributes to apply to the region during the modification. The attributes are applied to the associated objects in the region while applying the modification to other objects to maintain a pre-modification appearance of the region after the modification.