摘要:
A document organizing system extracts annotations made to a document along with the context surrounding each annotation and organizes the annotations based upon the annotation attributes and/or context. The annotations are created by grouping marks based upon their proximity in time and space. The document is segmented to determine a minimum context associated with each annotation. A list of the annotations sorted by the attributes are then displayed to the user. The context provided by the invention for each annotation allows the user to fully understand the annotation.
摘要:
When dynamically grouping a plurality of graphic objects, such as displayed on a graphic input display apparatus, a cluster tree is formed for the plurality of graphic objects. The cluster tree is based on a plurality of different types of distance measures. These include a time distance and a spatial distance. These distances are combined to form a distance metric indicting a distance between a pair of the graphic objects. Each level of the cluster tree defines a new cluster of the graphic objects. At least one of the graphic objects is selected. The different cluster levels of the cluster tree containing the selected graphic object are displayable. The displayed cluster of the graphic objects can be modified to increase or decrease the cluster level of the cluster containing the selected graphic object.
摘要:
In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.
摘要:
The multimedia content browsing system for small mobile devices smoothly blends three key tasks: querying the multimedia contents by keywords, exploring the search results by viewing keyframes of the multimedia contents, and playing a stream of the multimedia contents, e.g., videos or video segments. Videos can be stored in a segment-based multimedia content database, which is designed to support the browsing, retrieval, and reuse of videos. A layered imaging model is introduced where each layer may have its own transparent value set individually, continuously, and interactively, and the layers can overlap on top of each other when rendered on the screen. Since a small mobile device alone may not have enough resources to handle the entire task of multimedia content browsing, a scalable architecture can be adopted to break up the task among the small mobile device, a Hard Disk Drive (HDD), and a resource-rich computing device.
摘要:
The invention provides for quickly browsing through a large set of video clips to locate video clips of interest. In an embodiment of the present invention, hierarchical clustering of the video clips can be undertaken enabling the user to successively identify the subgroup of video clips of interest. This approach generates a video summary for the contents of each cluster by selecting representative video clips from individual videos and lower level clusters within the cluster. Links are added between the more general, higher-level clusters and the elements they contain. Thus, starting at the top of the set of videos being browsed or returned by the search engine and continuing at each subsequent cluster level, the user is presented with video summaries for the relevant parts of videos and those of next lower-level clusters. The user can then follow the navigational link to the desired video or lower-level cluster.
摘要:
Embodiments of the present invention provide the ability to navigate, view, and manipulate a collection of digital images utilizing a GUI that has the familiar context of a calendar. Graphical objects representative of digital images are displayed within a particular day displayed in a calendar-based GUI. A user may group digital images into groups, modify the date with which a digital image is associated and perform various other manipulations using embodiments of a calendar-based GUI.
摘要:
A method, system, and apparatus for easily creating a video collage from a video is provided. By segmenting the video into a set number of video segments and providing an interface for a user to select images which represent the video segments and insert the selected images into a video collage template, a video collage may be easily created in a short amount of time. The system is designed to assign values to the video inserted in a video collage and compact the video based on these values thereby creating a small file which may be easily stored or transmitted.
摘要:
The present invention analyzes recorded video from a video camera to identify camera and object motion in the recorded video. Keyframes representative of clips of the recorded video are displayed on a user interface that allows a user to manipulate an order of the keyframes. Editing rules are then applied to the keyframes to intelligently splice together portions of the representative clips into a final output video.
摘要:
Detection of video shot boundaries using a Video Segmenting Hidden Markov Model to model the sequence of states of a video. The Video Segmenting Hidden Markov Model determines the state sequence based on feature values. Using Hidden Markov Model techniques allows for automatic learning and use of multiple features including motion vectors, audio differences and histogram differences, without the need for manual adjustments of these thresholds.
摘要:
A method for segmenting audio data, comprising speech from a plurality of individual speakers, according to speaker is provided. The method comprises providing individual HMMs for each individual speaker, each individual HMM including at least one state, and constructing a speaker network HMM by connecting the individual HMMs in parallel. The audio data is then divided into segments by determining a most likely sequence of states through the speaker network HMM, each of the segments being associated with one of the individual HMMs. Afterward, the speaker of each of the segments is identified. The segmented data may be used to form an index into the audio data according to speaker.