摘要:
A video signal is processed to generate one or more signatures associated with a broadcast program to be recorded by a recording device. The signatures are then processed to determine an actual start time and end time of the desired broadcast program, such that the program can be properly recorded despite delays or other changes in a pre-scheduled broadcast time of the program. One or more of the extracted signatures may be based at least in part on, e.g., a keyframe similarity measure, a histogram, one or more detected commercials, a transcript, a program logo or other detected object, detected text, and a sign-on or sign-off of the desired program. Other types of signatures can also be used.
摘要:
A method for optimizing the performance of an algorithm for detecting predetermined content in a media information stream, and a program and apparatus that operate in accordance with the method. The algorithm is a function of a set of parameters. The method comprises the steps of performing the algorithm at least once to detect the predetermined content in the media information stream, while employing a respective set of parameters in the algorithm for each performance thereof, and automatically evolving at least one respective set of parameters employed in the algorithm to maximize the degree of accuracy at which the algorithm detects the predetermined content in the media information stream.
摘要:
A method of selecting, storing and delivering desired audio/data/visual information includes the steps of determining viewing preferences of a viewer and receiving a first group of audio/data/visual signals, for example, broadcast and cable television signals or internet-based signals. Based on the first group of audio/data/visual signals, a second group of audio/data/visual signals, which is a subset of the first group of audio/data/visual signals, is identified. The second group of audio/data/visual signals is selected based on the association of EPG data for each signal with the viewing preferences of the viewer. Content data is then extracted from the second group of audio/data/visual signals and compared with the viewing preferences. The content data may include, for example, closed-captioned text, EPG data, audio information, visual information and transcript information. Based on the comparison of the content data extracted from the second group of audio/data/visual signals with the viewing preferences, audio/data/visual information contained in the second group of audio/data/visual signals which is of interest to the viewer is identified and stored for review at the viewers convenience.
摘要:
A method, apparatus and systems for bookmarking an area of interest of stored video content is provided. As a viewer is watching a video and finds an area of interest, they can bookmark the particular segment of the video and then return to that segment with relative simplicity. This can be accomplished by pressing a button, clicking with a mouse or otherwise sending a signal to a device for marking a particular location of the video that is of interest. Frame identifiers can also be used to select a desired video from an index and to then retrieve the video from a medium containing multiple videos.
摘要:
A video signal is processed to identify segments that are likely to be associated with a commercial or other particular type of video content. A signature is extracted from each of the segments so identified, and the extracted signatures are used, possibly in conjunction with additional temporal and contextual information, to determine which of the identified segments are in fact associated with the particular video content. One or more of the extracted signatures may be, e.g., a visual frame signature based at least in part on a visual characteristic of a frame of the video segment, as determined using information based on DC and motion coefficients of the frame, or DC and AC coefficients of the frame. A given extracted signature may alternatively be an audio signature based at least in part on a characteristic of an audio signal associated with a portion of the video segment. Other types of signatures can also be used. Advantageously, the invention allows the identification and extraction of particular video content to be implemented with significantly reduced amounts of memory and computational resources.
摘要:
Techniques are disclosed for detecting commercials or other particular types of video content in a video signal. In an illustrative embodiment, color histograms are extracted from frames of the video signal. For each of at least a subset of the extracted color histograms, the extracted color histogram is compared to a family histogram. If the extracted color histogram falls within a specified range of the family histogram, the family histogram is updated to include the extracted color histogram as a new member. If the extracted color histogram does not fall within the specified range of the family histogram, the family histogram is considered complete and the extracted color histogram is utilized to generate a new family histogram for use in processing subsequent extracted color histograms. The resulting family histograms are utilized to detect commercials or other particular type of video content in the video signal.
摘要:
A method of automatically identifying the microarray chip corners and probes, even if there are no probes at the corners, in a high density and high resolution microarray scanned image having an image space, wherein the method minimizes the error distortions in the image arising in the scanning process by applying to the image a multipass corner finding algorithm comprising: (a) applying a Radon transform to an input microarray image to project the image into an angle and distance space where it is possible to find the orientation of the straight lines; (b) applying a fast Fourier transform to the projected image of (a) to find the optimal tilting angle of the projected image; (c) determining the optimal first and last local maxima for the optimal tilting angle; (d) back projecting the determined first and last local maxima to the image space to find the first approximation of the first and last column lines of the image; (e) rotating the image and repeating steps (a) through (d) to find the first approximation of the top and bottom row lines of the image; (f) determining the first approximation of the four corners of the image from the intersection of the column and row lines; (g) applying a heuristic for determining if the first approximation of step (f) is sufficient; and (h) optionally trimming the scanned image around the first approximation of the four corners and repeating steps (a) through (f).
摘要:
A content maintenance system uses a time-dependent precipitation function for iteratively augmenting or removing content over time, after an initial demonstration of user interest. A plurality of parallel precipitation processes can be launched simultaneously in response to different facets of a user expression of interest. Precipitation is dependent on highlighting or extracting segment descriptors from content of interest to the user. Then segments are filtered, rated, annotated and/or prioritized from that content. The remaining segments are matched against stored search structures. When the segments match, they are precipitated out for storage and can generate new search structures.
摘要:
A content-based classification system is provided that detects the presence of object images within a frame and determines the path, or trajectory, of each object image through multiple frames of a video segment. In a preferred embodiment, face objects and text objects are used for identifying distinguishing object trajectories. A combination of face, text, and other trajectory information is used in a preferred embodiment of this invention to classify each segment of a video sequence. In one embodiment, a hierarchical information structure is utilized to enhance the classification process. At the upper, video, information layer, the parameters used for the classification process include, for example, the number of object trajectories of each type within the segment, an average duration for each object type trajectory, and so on. At the lowest, model, information layer, the parameters include, for example, the type, color, and size of the object image corresponding to each object trajectory. In an alternative embodiment, a Hidden Markov Model (HMM) technique is used to classify each segment into one of a predefined set of classifications, based on the observed characterization of the object trajectories contained within the segment.
摘要:
An image processing device and method for classifying symbols, such as text, in a video stream employs a back propagation neural network (BPNN) whose feature space is derived from size, translation, and rotation invariant shape-dependent features. Various example feature spaces are discussed such as regular and invariant moments and an angle histogram derived from a Delaunay triangulation of a thinned, thresholded, symbol. Such feature spaces provide a good match to BPNN as a classifier because of the poor resolution of characters in video streams.