摘要:
A method and apparatus for providing multi-resolution video to multiple users under hybrid human and automatic control. Initial environment and close-up images are captured using a first camera and a PTZ camera. The initial images are then stored in memory. Current environment and close-up images are captured and the an estimated difference between the initial and current images and the true image is determined. The estimated differences are weighted and compared and the stored images are updated. A close-up image is then provided to each user of the system. The close-up camera is then directed to a portion of the environment image having high distortion, and current environment and close-up images are captured again.
摘要:
A method, system, and apparatus for easily creating a video collage from a video is provided. By segmenting the video into a set number of video segments and providing an interface for a user to select images which represent the video segments and insert the selected images into a video collage template, a video collage may be easily created in a short amount of time. The system is designed to assign values to the video inserted in a video collage and compact the video based on these values thereby creating a small file which may be easily stored or transmitted.
摘要:
A system for automatically generating indexes for handwritten notes captured as digital ink in a computer is disclosed. Ink words are identified, and features of the ink words are computed. Pairwise distances or match scores, which measure the distance in the features between two ink words, are calculated. A clustering technique selects equivalence classes of ink words. Index terms, which are non-uniform through-out the notes, are selected from the equivalence classes of ink words. The system generates an index from the index terms, including displaying pages numbers where the index terms are located in the notes as well as hyper-linking the index terms. A technique to identify a threshold for use in clustering the ink words is also disclosed.
摘要:
The present invention analyzes recorded video from a video camera to identify camera and object motion in the recorded video. Keyframes representative of clips of the recorded video are displayed on a user interface that allows a user to manipulate an order of the keyframes. Editing rules are then applied to the keyframes to intelligently splice together portions of the representative clips into a final output video.
摘要:
In an embodiment of the invention, an electronic document (e-document) can be searched and found by capturing an image of the printed document. Instead of typing in a file name or searching through multiple directories, the user simply takes a picture of the document with a camera and the system uses the document image to locate the e-document. In an alternative embodiment of the invention, an image of a printed document can be useful for remote document sharing. In various embodiments of the invention, sharing an image of a printed document can be used to email a high quality paper document, send a high quality fax, or open a document to a page containing an annotation. Through co-design of the feature extraction and search algorithm in the system, the image feature detection robustness and search speed are improved at the same time.
摘要:
Systems and methods for bookmarking multimedia documents include displaying multiple multimedia streams, creating bookmarks comprising time signatures and snapshots of each multimedia stream based upon single action cues from a user, associating snapshots with portions of multimedia streams, displaying bookmarks and displaying portions of a multimedia stream associated with selected snapshots.
摘要:
In an embodiment of the invention, an electronic document (e-document) can be searched and found by capturing an image of the printed document. Instead of typing in a file name or searching through multiple directories, the user simply takes a picture of the document with a camera and the system uses the document image to locate the e-document. In an alternative embodiment of the invention, an image of a printed document can be useful for remote document sharing. In various embodiments of the invention, sharing an image of a printed document can be used to email a high quality paper document, send a high quality fax, or open a document to a page containing an annotation. Through co-design of the feature extraction and search algorithm in the system, the image feature detection robustness and search speed are improved at the same time.
摘要:
A method of extracting audio excerpts comprises: segmenting audio data into a plurality of audio data segments; setting a fitness criteria for the plurality of audio data segments; analyzing the plurality of audio data segments based on the fitness criteria; and selecting one of the plurality of audio data segments that satisfies the fitness criteria. In various exemplary embodiments, the method of extracting audio excerpts further comprises associating the selected one of the plurality of audio data segments with video data. In such embodiments, associating the selected one of the plurality of audio data segments with video data may comprise associating the selected one of the plurality of audio data segments with a keyframe.
摘要:
The multimedia content browsing system for small mobile devices smoothly blends three key tasks: querying the multimedia contents by keywords, exploring the search results by viewing keyframes of the multimedia contents, and playing a stream of the multimedia contents, e.g., videos or video segments. Videos can be stored in a segment-based multimedia content database, which is designed to support the browsing, retrieval, and reuse of videos. A layered imaging model is introduced where each layer may have its own transparent value set individually, continuously, and interactively, and the layers can overlap on top of each other when rendered on the screen. Since a small mobile device alone may not have enough resources to handle the entire task of multimedia content browsing, a scalable architecture can be adopted to break up the task among the small mobile device, a Hard Disk Drive (HDD), and a resource-rich computing device.
摘要:
A heuristically derived unsuitability score is computed and used as an input for metaphorical springs in which each selected video segment from recorded video of a video camera is associated with a metaphorical spring that maintains the selected segment at an optimal length while being responsive to a global system spring whose spring strength determines a final length of a final edited output video. Accordingly, user-specified changes to the final length of the final output video automatically lengthen or shorten the lengths of individual segments in such a way that high quality video segments having low unsuitability scores are emphasized over low quality video segments having high unsuitability scores.