摘要:
A system and method is provided for synthesizing audio-visual content in a video image processor. A content synthesis application processor extracts audio features and video features from audio-visual input signals that represent a speaker who is speaking. The processor uses the extracted visual features to create a computer generated animated version of the face of the speaker. The processor synchronizes facial movements of the animated version of the face of the speaker with a plurality of audio logical units such as phonemes that represent the speaker's speech. In this manner the processor synthesizes an audio-visual representation of the speaker's face that is properly synchronized with the speaker's speech.
摘要:
One provides (101) a media source bundle (200) as pertains to a given subject matter of interest to at least one end user. This media source bundle can comprise, for example and at least in part, content source locations for each of a plurality of independent content sources that each offer content regarding the given subject matter and wherein at least some of these independent content sources are associated with mutually non-compatible electronic content-delivery modalities. (In such an application, the media source bundle will be understood to not comprise the content itself.) These teachings will then provide for transmitting (102) a message that comprises, at least in part, this media source bundle to one or more corresponding end user recipient platforms (303).
摘要:
An object detection algorithm that generates a two-layer Gaussian Mixture Model (GMM) during a training session, and subsequent to the training session, uses the two-layer GMM to perform face detection. No labeling of local features is needed. The only input that is provided by a user is the setting of a few global parameters for the image being captured during the training session, such as, for example, the person's facial pose.
摘要:
A method for matching signatures based on motion signature information including acquiring a first signature and at least one second signature that are to be matched, wherein the first signature and the second signature are generated based on a motion object's motion signature information; and matching the first signature and the second signature based on the motion signature information to obtain a corresponding match result.
摘要:
An object detection algorithm that generates a two-layer Gaussian Mixture Model (GMM) during a training session, and subsequent to the training session, uses the two-layer GMM to perform face detection. No labeling of local features is needed. The only input that is provided by a user is the setting of a few global parameters for the image being captured during the training session, such as, for example, the person's facial pose.
摘要:
A processor (10) utilizes information regarding one or more physical dimensions of an individual (14) to better inform a personal identification process. In one embodiment, the measured physical dimensions are utilized to influence the conduct of a face recognition process. In one embodiment, a Bayesian Belief Network can be utilized to facilitate such processes.
摘要:
A network element of choice receives (101) information and uses (102) that information to develop a content search query. That network element then instigates (103) a content search using the content search query and receives (104), in turn, content search results comprising a plurality of content items. Profile information for a plurality of playback platforms is then accessed (105) and used (106) to identify which of the content items are best played back on particular ones of the playback platforms.
摘要:
Meta-data retrieved externally is determined to be relevant to the creation of an electronic programming guide (EPG) or relevant to a user's preferred programs. The meta-data is stored locally if it is determined that it is relevant to the creation of the EPG or if it is relevant to the user's preferred programs. All other meta-data is discarded. When a user requests meta-data, an attempt is made to retrieve the data from a local database, and if the attempt fails, then the meta-data is obtained from an external source.