摘要:
A system and method for authoring a media presentation including a media presentation environment representation having a portion defined as a hot spot associated with a media presentation device. Various embodiments include a hyper-slide listing portion, a media presentation authoring portion, and/or a media presentation device listing portion. Various embodiments include an integrated presentation authoring preview environment. The method includes selecting a physical device for a presentation unit in the media presentation environment, manipulating a visual representation of the presentation unit, recording a display of the presentation unit, and previewing the presentation in an augmented reality environment, a virtual reality environment, or both. Various embodiments operate with a plurality of types of media presentation devices and a plurality of each type of device.
摘要:
A method, information system, and computer-readable medium is provided for segmenting a plurality of data, such as multimedia data, and in particular an image document stream. Segment boundary points may be used for retrieving and/or browsing the plurality of data. Similarly, segment boundary points may be used to summarize the plurality of data. Examples of image document streams include video, PowerPoint slides, and NoteLook pages. A genetic method having a fitness or evaluation function using information retrieval concepts, such as importance and precedence, is used to obtain segment boundary points. The genetic method is able to evaluate a large amount of data in a cost effective manner. The genetic method is also able to run incrementally on streaming video and adapt to usage patterns by considering frequently accessed images.
摘要:
A semi-automatic system for scanning a document includes an image capture device, such as a digital or video camera, which records a sequence of images while a user waves a document in front of the device. The user can present multiple pages of a document to the image capture device, after which the total sequence of images are processed to identify a clear image of each page from the sequence of images. The system further includes image processing techniques to correct for motion blurring, acceleration and perspective errors. The system is capable of processing any size or shape of document without destroying the organization or format of the original.
摘要:
Described is system that characterizes segments of document with one or more keyphrases and then uses keyphrases to help users find interesting parts of document. Keyphrases are displayed with information about the location of the phrase in the document and are used as pointers to quickly move to from overview to section of potential interest. In another implementation, when there are many documents in a collection, inventive multi-document view can be used to reduce number of documents presented, helping user to more efficiently find documents of interest. In this view, a user (possibly repeatedly) filters documents displayed based on metadata values. In one implementation, icons corresponding to documents are displayed on a display device together with metadata corresponding to the documents. When the value of the metadata is selected by the user, display state of the icons corresponding to document is varied based on selected value of metadata.
摘要:
Systems and methods for interactive, user-driven detection, creation and completion of form fields in a digital document are provided. A document with form fields that require completion by a user is received, after which form fields are detected at the direction of the user. Once the user selects a possible form field, the system creates the appropriate fillable form field based on size, type, location, related text and other parameters of the form field and surrounding document. Additional levels of interaction include predictive text, pattern development and automatic completion of previously completed fields.
摘要:
Systems and methods provide for mixed use of physical documents and a computer, and more specifically provide for detailed interactions with fine-grained content of physical documents that are integrated with operations on a computer to provide for improved user interactions between the physical documents and the computer. The system includes a camera which processes the physical documents and detects gestures made by a user with respect to the physical documents, a projector which provides visual feedback on the physical document, and a computer with a display to coordinate the interactions of the user with the computer and the interactions of the user with the physical document. The system, which can be portable, is capable of detecting interactions with fine-grained content of the physical document and translating interactions at the physical document with the computer display, and vice versa.
摘要:
The present invention relates to a method to make effective use of display space. In an embodiment of the invention, given a heterogeneous set of images along with metadata or nearby text, similar images are recursively clustered into a k-tree using the k-means algorithm. In an embodiment of the invention, the invention is particularly useful for showing image search results on small mobile devices.
摘要:
Recorded video is accessed from printed notes or summaries derived from the video. Summaries may be created automatically by analyzing the recorded video, and annotations are made by a user on a device for note-taking with digital ink and video. The notes and/or summaries are printed along with data glyphs that provide time based indexes or offsets into the recorded video. The indexes or offsets are retrieved by scanning the glyph on the printout. The glyph information can be embedded in the printouts in many ways. One method is to associate block glyphs with annotations or images on the printed pages. Another method is to provide an address carpet in an annotated timeline. Yet another method is to provide a two-dimensional address carpet with X-Y position mapped to time which can be used to provide selected access to the video. The accessed video may be played back on the note-taking device on a pen computer, or on a summary interface on a Web browser-type device.
摘要:
Described is a system that characterizes segments of a document with one or more keyphrases and then uses the keyphrases to help users find interesting parts of a document. The keyphrases are displayed with information about the location of the phrase in the document and are used as pointers to quickly move to from an overview to a section of potential interest.
摘要:
Techniques are provided for determining relevant information from a document based on document structure. A document is selected and structural elements within the document having a dominance relationship are determined. A first location within the document is selected. The structural element surrounding the first location is determined and the surrounding and non-surrounding structural elements are characterized. Additional documents are associated with the first location in the surrounding structural element based on the surrounding structural element characterization and the non-surrounding structural element characterization. Techniques for dynamically determining annotations for images based on document structure are also provided.