Abstract:
A method for optimizing the performance of an algorithm for detecting predetermined content in a media information stream, and a program and apparatus that operate in accordance with the method. The algorithm is a function of a set of parameters. The method comprises the steps of performing the algorithm at least once to detect the predetermined content in the media information stream, while employing a respective set of parameters in the algorithm for each performance thereof, and automatically evolving at least one respective set of parameters employed in the algorithm to maximize the degree of accuracy at which the algorithm detects the predetermined content in the media information stream.
Abstract:
Advertisers want to deliver their message in a relatively short period of time. This leads to the product name, company name and other identifying features being repeated frequently during a commercial broadcast. Transcript information can be used to detect commercials by detecting frequently occurring words in the commercials. This can also be used to identify an individual commercial from other commercials. Once the individual commercials have been identified, the transcript information corresponding to each commercial can be stored in a database to identify the commercial in subsequent broadcasts, or to provide a search mechanism for searching a particular commercial in the database.
Abstract:
The present invention provides a method, system and program product for generating a content-based table of contents for a program. Specifically, under the present invention the genre of a program having sequences is determined. Once the genre has been determined, each sequence is assigned a classification. The classifications are assigned based on video content, audio content and textual content within the sequences. Based on the genre and the classifications, keyframe(s) are selected from the sequences for use in a content-based table of contents.
Abstract:
A method for providing name-face/voice-role association includes determining whether a closed captioned text accompanies a video sequence, providing one of text recognition and speech to text conversion to the video sequence to generate a role-name versus actor-name list from the video sequence, extracting face boxes from the video sequence and generating face models, searching a predetermined portion of text for an entry on the role-name versus actor-name list, searching video frames for face models/voice models that correspond to the text searched by using a time code so that the video frames correspond to portions of the text where role-names are detected, assigning an equal level of certainty for each of the face models found, using lip reading to eliminate face models found that pronounce a role-name corresponding to said entry on the role-name versus actor-name list, scanning a remaining portion of text provided and updating a level of certainty for said each of the face models previously found. Once a particular face model/voice model and role-name association has reached a threshold the role-name, actor name, and particular face model/voice model is stored in a database and can be displayed by a user when the threshold for the particular face model has been reached. Thus the user can query information by entry of role-name, actor name, face model, or even words spoken by the role-name as a basis for the association. A system provides hardware and software to perform these functions.
Abstract:
A system and method for collecting, analyzing, and using sensory reactions and involuntary or spontaneous movements by members of a television viewing (or listening) audience. While known programming is displayed on a television receiver, a plurality of sensors monitor the viewer or viewers for recognizable evidence of an emotional response that can be associated with a discrete program segment. Where positive (or negative) responses can be associated with a certain type of program content, the system monitors subsequent programs for the opportunity to notify the viewer or simply present (or avoid presenting) the program automatically.
Abstract:
A method, apparatus and systems for bookmarking an area of interest of stored video content is provided. As a viewer is watching a video and finds an area of interest, they can bookmark the particular segment of the video and then return to that segment with relative simplicity. This can be accomplished by pressing a button, clicking with a mouse or otherwise sending a signal to a device for marking a particular location of the video that is of interest. Frame identifiers can also be used to select a desired video from an index and to then retrieve the video from a medium containing multiple videos.
Abstract:
A system and method for detecting commercials from other programs in a stored content. The system comprises an image detection module that detects and extracts faces in a specific time window. The extracted faces are matched against the detected faces in the subsequent time window. If none of the faces match, a flag is set, indicating a beginning of a commercial portion. A sound or speech analysis module verifies the beginning of the commercial portion by analyzing the sound signatures in the same time windows used for detecting faces.
Abstract:
Techniques are disclosed for detecting commercials or other particular types of video content in a video signal. In an illustrative embodiment, color histograms are extracted from frames of the video signal. For each of at least a subset of the extracted color histograms, the extracted color histogram is compared to a family histogram. If the extracted color histogram falls within a specified range of the family histogram, the family histogram is updated to include the extracted color histogram as a new member. If the extracted color histogram does not fall within the specified range of the family histogram, the family histogram is considered complete and the extracted color histogram is utilized to generate a new family histogram for use in processing subsequent extracted color histograms. The resulting family histograms are utilized to detect commercials or other particular type of video content in the video signal.
Abstract:
A multiple viewing program recommendation system employing a commercial detection module and a multiple viewing module is disclosed. In response to a generation of viewing recommendations of two or more program during the same time slot, the multiple viewing module controls a display of a recommended program having the highest viewing priority on a television screen while the recommended program is being aired on one of the television channels until the commercial detection module detects a commercial being aired on the television channel. In response to the detection of the commercial by the commercial detection module (37), the multiple viewing module controls a display of an additional recommended program having the next highest viewing priority on the television screen while the additional recommended program is being aired on another one of the television channels.
Abstract:
An entertainment receiver is tuned in response to at least one signal indicative of preferred program content type for a user of the receiver. In response to received and detected program content type, program content type of plural program sources received by the receiver is determined. The program content type of the plural received program sources is compared with the stored signal indicative of preferred program content type for a user of the receiver. The receiver is activated so a received program source with the preferred program content type is presented to the user.