摘要:
A method and apparatus for scaling the bitstream of a compressed video signal includes partial decoding hardware (38, 41) to permit excising of higher frequency AC DCT coefficients or re-quantizing quantized data with a coarser quantization factor. The scaling is performed on a block (macroblock) basis in a manner which linearly scales the amount of compressed data per block. An analyzer (40) generates a profile of cumulative partially decompressed data over a video frame, and bitstream scaling (42) is performed in a manner which insures that a profile of the scaled signal substantially comports with the profile of the original data.
摘要:
A dynamically configurable video signal processing system partitions and encodes data using a variable number of data segments and variable data resolution. The system partitions data into a variable number of data segments by predicting, as a function of the data rate, first and second distortion factors for the data partitioned into first and second numbers of data segments. The first and second distortion factors are mutually compared and the data is partitioned into the number of data segments which exhibits the lower distortion factor value. First and second distortion factors for the data encoded with first and second data resolutions are also predicted. The first and second distortion factors are similarly compared and the data is encoded with the resolution exhibiting the lower distortion factor value.
摘要:
A dynamically configurable video signal processing system including an encoder and decoder processes data in the form of hierarchical layers. The system partitions data between hierarchical layers and allows variation in the number of layers employed. Data is automatically partitioned into one or more hierarchical layers as a function of one or more parameters selected from available system bandwidth, input data rate, and output signal quality. In addition, the image resolution and corresponding number of pixels per image of the data may be varied as a function of system parameters.
摘要:
Error concealment apparatus for correcting errors in signals representing video images includes means for detecting image gradients in an area surrounding a lost block of image data. Circuitry responsive to these image gradients generates a plurality of blocks of directionally interpolated pixel values. The pixel values in the respective blocks of directionally interpolated pixel values are sorted according to amplitude, and then pixel values from mutually exclusive positions in the respective blocks are selected to form a block of pixel values for error concealment.
摘要:
An apparatus and concomitant method for selecting a macroblock coding mode based upon the quantization scale selected for the macroblock. The total number of bits needed to code each macroblock consists of two parts, bits needed for coding motion vectors and bits for coding the predictive residual. The number of bits for coding the motion vectors is generally obtained from a look-up table. The number of bits for coding the predictive residual is obtained by an estimation which assumes that the number of bits for encoding the predictive residuals is directly proportional to the value of its variance and inversely proportional to the value of quantizer steps (quantizer scale). Using this estimation, the total number of bits necessary to code a macroblock is calculated and compared for each coding mode. By selecting the coding mode with the least number of bits, a near-optimal solution of low complexity for practical implementation is acquired.
摘要:
A method and apparatus for selecting a quantizer scale for each macroblock to maintain the overall quality of the video image while optimizing the coding rate. A quantizer scale is selected for each macroblock such that target bit rate for the picture is achieved while an optimal quantization scale ratio is maintained for successive macroblocks to produce a uniform visual quality over the entire picture. One embodiment applies the method to the frame level while another embodiment applies the method in conjunction with a wavelet transform.
摘要:
Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bit stream to provide random temporal access to the multiview videos. Additional view dependency information enables the decoding of a reduced number of frames prior to accessing randomly a target frame for a specified view and time, and decoding the target frame. The method also decodes multiview videos by maintaining a reference picture list for a current frame of a plurality of multiview videos, and predicting each current frame of the plurality of multiview videos according to reference pictures indexed by the associated reference picture list.
摘要:
A method randomly accesses multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bitstream to provide random temporal access to the multiview videos.
摘要:
A model stored in a memory accessible by a video transcoder includes a first rate-distortion function modeling a requantization of an input video. A second-rate distortion function models a resynchronization marker insertion rate for the transcoded video, and a third rate-distortion function models an intra-block insertion rate for the transcoded video.
摘要:
A method classifies pixels in an image by first partitioning the image into blocks. A variance of an intensity is determined for each pixel, and for each block the pixel with the maximum variance is identified. Then, the blocks are classified into classes according to the maximum variance.