摘要:
A surveillance and control system includes a feature extraction unit to dynamically extract low-level features from a compressed digital video signal, a description encoder, coupled to the feature extraction unit, to encode the low-level features as content descriptors. An event detector is coupled to the description encoder to detect security events from the content descriptors, and a control signal processor, coupled to the event detector, to generate control signals in response to detecting the security events.
摘要:
A multi-media delivery system for delivering a compressed bitstream through a network to a user device includes a transcoder and a manager. The transcoder is configured to operate on the bit stream using in any one of a plurality of conversion modes. The manager is configured to selecting a particular one of the plurality of conversion modes dependent on semantic content of the bitstream and network characteristics. The system also includes a content classifier to determine the content characteristics, and a model predicator to determine the network characteristics, and user device characteristics. An integrator of the manager generates an optimal rate-quality function to be used for selecting the particular conversion model for a given available bit rate of the network.
摘要:
The apparatus for decoding a digital signal includes a composite picture forming unit for forming a composite picture from a first digital video signal and a second digital video signal. The first digital video signal includes inter-coded picture data. A down-converter receives a third digital video signal, and outputs the third digital video signal to the composite picture forming unit as the first digital signal in a first mode. However, the down-converter down-converts the third digital video signal into a fourth digital video signal in a second mode, and outputs the fourth digital video signal to the composite picture forming unit in the second mode. A memory stores the anchor pictures output from the composite picture forming unit, and a motion compensator generates the second digital signal based on the stored anchor pictures.
摘要:
A compressed bitstream is scaled down to a reduced rate bitstream by first demultiplexing a compressed input bitstream to extract video objects as elementary input bitstreams having a first bit rate. A transcoder converts each elementary input bitstream to an elementary output bitstream having a second bit rate. The first bit rate is less than the second bit rate. A transcoding control unit, coupled to the transcoder, supplies control information for the transcoder. A multiplexer composes the elementary output bitstreams into a compressed output bitstream having the second bit rate.
摘要:
In an apparatus for transcoding a compressed video, a generator simulates constraints of a network and constraints of a user device. A classifier is coupled to receive an input compressed video and the constraints. The classifier generates content information from features of the input compressed video. A manager produces a plurality of conversions modes dependent the constraints and content information, and a transcoder produces output compressed videos, one for each of the plurality conversion modes.
摘要:
A method determines true motion vectors associated with a sequence of images. The images include fields made up of blocks of pixels. The method selects candidate feature blocks from the blocks of pixels. The candidate feature blocks have intensity variances above a threshold indicative of texture features. Candidate feature blocks in similarly numbered adjacent field intervals are compared to determine sets of displaced frame differences parameters for each candidate feature block. The true motion vectors for each candidate feature block are determined from a minimum weighted score derived from the difference parameters.
摘要:
This invention relates to methods of feature extraction from MPEG-2 and MPEG-4 compressed video sequences. The spatio-temporal compression complexity of video sequences is evaluated for feature extraction by inspecting the compressed bitstream and the complexity is used as a descriptor of the spatio-temporal characteristics of the video sequence. The spatio-temporal compression complexity measure is used as a matching criterion and can also be used for absolute indexing. Feature extraction can be accomplished in conjunction with scene change detection techniques and the combination has reasonable accuracy and the advantage of high simplicity since it is based on entropy decoding of signals in compressed form and does not require computationally expensive inverse Discrete Cosine Transformation (DCT).
摘要:
A method describes motion activity in a video sequence. A motion activity matrix is determined for the video sequence. A threshold for the motion activity matrix is determined. Connected regions of motion vectors at least equal to the threshold are identified and measured for size. A histogram of the distribution of the sizes of the connected areas is constructed for the entire video sequence. The histogram is normalized to characterize the spatial distribution of the video sequence in a motion activity descriptor.
摘要:
A dynamically configurable video signal processing system partitions and encodes data using a variable number of data segments and variable data resolution. The system partitions data into a variable number of data segments by predicting, as a function of the data rate, first and second distortion factors for the data partitioned into first and second numbers of data segments. The first and second distortion factors are mutually compared and the data is partitioned into the number of data segments which exhibits the lower distortion factor value. First and second distortion factors for the data encoded with first and second data resolutions are also predicted. The first and second distortion factors are similarly compared and the data is encoded with the resolution exhibiting the lower distortion factor value.
摘要:
A method analyzes a high-level syntax and structure of a continuous compressed video according to a plurality of states. First, a set of hidden Markov models for each of the states is trained with a training video segmented into known states. Then, a set of domain specific features are extracted from a fixed-length sliding window of the continuous compressed video, and a set of maximum likelihoods is determined for each set of domain specific features using the sets of trained hidden Markov models. Finally, dynamic programming is applied to each set of maximum likelihoods to determine a specific state for each fixed-length sliding window of frames of the compressed video.