Abstract:
A method and apparatus for creating a personalized channel capable of providing a selected commercial to display on a display device includes a viewer reference profile defined with respect to at least one attribute descriptive of a commercial is acquired and used to select a commercial from a plurality of commercials stored in a database. A virtual channel is allocated from available channels in a channel list having a plurality of program channels. The selected commercial is then associated with the virtual channel so as to allow display of the selected channel on the display device when the virtual channel is accessed by a user.
Abstract:
Stories are detected in multimedia data composed of concurrent streams for different modes such as audio, video and text and linked to related stories. First, time periods of uniformity in attributes of the streams serve as nullbuilding blocksnull that are consolidated according to rules that are characteristic of the story to be detected. The attributes are then ranked by their respective reliabilities for detecting the story to be detected. An inter-attribute union of the time periods is cumulated attribute-to-attribute in an order that is based on the ranking. A buffered portion of the multimedia data that is delimited by the starting and ending times is retained in mass storage. The starting and ending times are indexed by characteristics of content of the portion to form a story segment which is maintained in a data structure with links to related story segments.
Abstract:
A method for optimizing the performance of an algorithm for detecting predetermined content in a media information stream, and a program and apparatus that operate in accordance with the method. The algorithm is a function of a set of parameters. The method comprises the steps of performing the algorithm at least once to detect the predetermined content in the media information stream, while employing a respective set of parameters in the algorithm for each performance thereof, and automatically evolving at least one respective set of parameters employed in the algorithm to maximize the degree of accuracy at which the algorithm detects the predetermined content in the media information stream.
Abstract:
Advertisers want to deliver their message in a relatively short period of time. This leads to the product name, company name and other identifying features being repeated frequently during a commercial broadcast. Transcript information can be used to detect commercials by detecting frequently occurring words in the commercials. This can also be used to identify an individual commercial from other commercials. Once the individual commercials have been identified, the transcript information corresponding to each commercial can be stored in a database to identify the commercial in subsequent broadcasts, or to provide a search mechanism for searching a particular commercial in the database.
Abstract:
A parental control system provides the ability to automatically filter a multimedia program content in real time based on stock and user specified criteria. The criteria are used to teach a learning module in the system what types of video program segments are to be considered sensitive or objectionable so that the module's understanding of what is sensitive and what is not can be applied to other video programs to provide real-time filtering. The multimedia program is broken down into audio, video, and transcript components so that sound effects, visual components and objects, and language can all be analyzed collectively to make a determination of whether offending material is being passed along in the multimedia program. The user has the option of training the system for any type of objectionable material, not just sex and violence.
Abstract:
The present invention provides a method, system and program product for generating a content-based table of contents for a program. Specifically, under the present invention the genre of a program having sequences is determined. Once the genre has been determined, each sequence is assigned a classification. The classifications are assigned based on video content, audio content and textual content within the sequences. Based on the genre and the classifications, keyframe(s) are selected from the sequences for use in a content-based table of contents.
Abstract:
A memory storing computer readable instructions for causing a processor associated with a mega speaker identification (ID) system to instantiate functions including an audio segmentation and classification function receiving general audio data (GAD) and generating segments, a feature extraction function receiving the segments and extracting features based on mel-frequency cepstral coefficients (MFCC) therefrom, a learning and clustering function receiving the extracted features and reclassifying segments, when required, based on the extracted features, a matching and labeling function assigning a speaker ID to speech signals within the GAD, and a database function for correlating the assigned speaker ID to the respective speech signals within the GAD. The audio segmentation and classification function can assign each segment to one of N audio signal classes including silence, single speaker speech, music, environmental noise, multiple speaker's speech, simultaneous speech and music, and speech and noise. A mega speaker identification (ID) system and corresponding method are also described.
Abstract:
A method for providing name-face/voice-role association includes determining whether a closed captioned text accompanies a video sequence, providing one of text recognition and speech to text conversion to the video sequence to generate a role-name versus actor-name list from the video sequence, extracting face boxes from the video sequence and generating face models, searching a predetermined portion of text for an entry on the role-name versus actor-name list, searching video frames for face models/voice models that correspond to the text searched by using a time code so that the video frames correspond to portions of the text where role-names are detected, assigning an equal level of certainty for each of the face models found, using lip reading to eliminate face models found that pronounce a role-name corresponding to said entry on the role-name versus actor-name list, scanning a remaining portion of text provided and updating a level of certainty for said each of the face models previously found. Once a particular face model/voice model and role-name association has reached a threshold the role-name, actor name, and particular face model/voice model is stored in a database and can be displayed by a user when the threshold for the particular face model has been reached. Thus the user can query information by entry of role-name, actor name, face model, or even words spoken by the role-name as a basis for the association. A system provides hardware and software to perform these functions.
Abstract:
A system and method for collecting, analyzing, and using sensory reactions and involuntary or spontaneous movements by members of a television viewing (or listening) audience. While known programming is displayed on a television receiver, a plurality of sensors monitor the viewer or viewers for recognizable evidence of an emotional response that can be associated with a discrete program segment. Where positive (or negative) responses can be associated with a certain type of program content, the system monitors subsequent programs for the opportunity to notify the viewer or simply present (or avoid presenting) the program automatically.
Abstract:
There is disclosed an apparatus and method for conducting exclusive and inclusive metadata searches to identify and select multimedia programs. The apparatus of the invention comprises a metadata search controller that compares user specified search words with metadata words to find programs that meet user specified search criteria. The metadata search controller executes an inclusive metadata search to search for matches between a user specified search word and a metadata word that is related to the user specified search word in a word pair contained within a word pair database. The metadata search controller calculates a rank value for each program that is found by a metadata search and creates a ranked list of such programs.