摘要:
A method for determining a semantic concept associated with an audio signal captured using an audio sensor. A data processor is used to automatically analyze the audio signal using a plurality of semantic concept detectors to determine corresponding preliminary semantic concept detection values, each semantic concept detector being adapted to detect a particular semantic concept. The preliminary semantic concept detection values are analyzed using a joint likelihood model based on predetermined pair-wise likelihoods that particular pairs of semantic concepts co-occur to determine updated semantic concept detection values. One or more semantic concepts are determined based on the updated semantic concept detection values. The semantic concept detectors and the joint likelihood model are trained together with a joint training process using training audio signals, at least some of which are known to be associated with a plurality of semantic concepts.
摘要:
A method for identifying a set of key video frames from a video sequence comprising extracting feature vectors for each video frame and applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames. Weighting coefficients associated with the group sparse combination are analyzed to determine video frame clusters of temporally-contiguous, similar video frames. A summary is formed based on the determined video frame clusters.
摘要:
A method for identifying a set of key video frames from a video sequence comprising extracting feature vectors for each video frame and applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames. Weighting coefficients associated with the group sparse combination are analyzed to determine video frame clusters of temporally-contiguous, similar video frames. A set of key video frames are selected based on the determined video frame clusters.
摘要:
A method of identifying one or more particular images from an image collection, includes indexing the image collection to provide image descriptors for each image in the image collection such that each image is described by one or more of the image descriptors; receiving a query from a user specifying at least one keyword for an image search; and using the keyword(s) to search a second collection of tagged images to identify co-occurrence keywords. The method further includes using the identified co-occurrence keywords to provide an expanded list of keywords; using the expanded list of keywords to search the image descriptors to identify a set of candidate images satisfying the keywords; grouping the set of candidate images according to at least one of the image descriptors, and selecting one or more representative images from each grouping; and displaying the representative images to the user.
摘要:
A method of classifying a set of semantic concepts on a second multimedia collection based upon adapting a set of semantic concept classifiers and updating concept affinity relations that were developed to classify the set of semantic concepts for a first multimedia collection. The method comprises providing the second multimedia collection from a different domain and a processor automatically classifying the semantic concepts from the second multimedia collection by adapting the semantic concept classifiers and updating the concept affinity relations to the second multimedia collection based upon the local smoothness over the concept affinity relations and the local smoothness over data affinity relations.
摘要:
A system and method for semantic event detection in digital image content records is provided in which an event-level “Bag-of-Features” (BOF) representation is used to model events, and generic semantic events are detected in a concept space instead of an original low-level visual feature space based on the BOF representation.
摘要:
A display system and method for operating a display and a collection of digital multimedia objects are provided. A first selection set of predefined organizational metaphors is presented and a selection of a first organizational metaphor from the first selection set is received. A second selection set of predefined organizational metaphors other than the first selected organizational metaphor is presented and a selection of a second organizational metaphor from the second selection set is received. A result is presented on the display having one of at least two group icons, each group icon indicating a group of digital multimedia objects chosen from the collection according to rules associated with the selected organizational metaphors and the content of the digital multimedia objects or any metadata associated with the digital multimedia objects. Wherein the group of digital multimedia objects indicated by each group icon are chosen according to result presentation rules.
摘要:
A method, system and software program for automatically organizing digital images obtained from a plurality of hardcopy media. A plurality of hardcopy media are scanned so as to obtain both the image side and non-image side the of hardcopy media including capturing any watermark present on non-image side. The watermark on the non-mage side is used for automatically organizing digital images.
摘要:
A method for automatically classifying images into a final set of events including receiving a first plurality of images having date-time and a second plurality of images with incomplete date-time information; determining one or more time differences of the first plurality of images based on date-time clustering of the images and classify the first plurality of images into a first set of possible events; analyzing the second plurality of images using scene content and metadata cues and selecting images which correspond to different events in the first set of possible events and combining them into their corresponding possible events to thereby produce a second set of possible events; and using image scene content to verify the second set of possible events and to change the classification of images which correspond to different possible events to thereby provide the final set of events.
摘要:
In a method for classifying a sequence of records into events based upon feature values, such as time and/or location, associated with each of the records, feature differences between consecutive records are determined. The feature differences are ranked. A sequence of three or more clusters of feature differences is computed. The clusters are arranged in decreasing order of relative likelihood of respective feature differences representing separations between events. The records can be inclusive of images.