摘要:
An apparatus and concomitant method for selecting a macroblock coding mode based upon the quantization scale selected for the macroblock. The total number of bits needed to code each macroblock consists of two parts, bits needed for coding motion vectors and bits for coding the predictive residual. The number of bits for coding the motion vectors is generally obtained from a look-up table. The number of bits for coding the predictive residual is obtained by an estimation which assumes that the number of bits for encoding the predictive residuals is directly proportional to the value of its variance and inversely proportional to the value of quantizer steps (quantizer scale). Using this estimation, the total number of bits necessary to code a macroblock is calculated and compared for each coding mode. By selecting the coding mode with the least number of bits, a near-optimal solution of low complexity for practical implementation is acquired.
摘要:
A method and apparatus for selecting a quantizer scale for each macroblock to maintain the overall quality of the video image while optimizing the coding rate. A quantizer scale is selected for each macroblock such that target bit rate for the picture is achieved while an optimal quantization scale ratio is maintained for successive macroblocks to produce a uniform visual quality over the entire picture. One embodiment applies the method to the frame level while another embodiment applies the method in conjunction with a wavelet transform.
摘要:
A dynamically configurable video signal processing system partitions and encodes data using a variable number of data segments and variable data resolution. The system partitions data into a variable number of data segments by predicting, as a function of the data rate, first and second distortion factors for the data partitioned into first and second numbers of data segments. The first and second distortion factors are mutually compared and the data is partitioned into the number of data segments which exhibits the lower distortion factor value. First and second distortion factors for the data encoded with first and second data resolutions are also predicted. The first and second distortion factors are similarly compared and the data is encoded with the resolution exhibiting the lower distortion factor value.
摘要:
A method transcodes a compressed multi-layer video bitstream that includes a base layer bitstream and an enhancement layer bitstream. The base and enhancement layers are first partially decoded, and then the partially decoded signals are combined with a motion compensated signal yielding a combined signal. The combined signal is quantized into an output signal according to a quantization parameter, and the output signal is variable length encoded as a single layer bitstream. In a preprocessing step, the enhancement layer can be truncated according to rate control constraint, and the same constraints can also be used during the quantization.
摘要:
A dynamically configurable video signal processing system including an encoder and decoder processes data in the form of hierarchical layers. The system partitions data between hierarchical layers and allows variation in the number of layers employed. Data is automatically partitioned into one or more hierarchical layers as a function of one or more parameters selected from available system bandwidth, input data rate, and output signal quality. In addition, the image resolution and corresponding number of pixels per image of the data may be varied as a function of system parameters.
摘要:
A method of motion estimation for video encoding constructs a binary pyramid structure having three binary layers. A state update module registers and updates repeat occurrence of final motion vectors and a static-state checking module determines if the method is in a static mode or a normal mode based on the repeat occurrence. In a normal mode, the first binary layer is searched within a ±3 pixel refinement window to determine a first level motion vector. In the second binary layer, a search range is computed based on six motion vector candidates. By checking every point within in the search range, a second binary layer search generates a second level motion vector. Finally, a third binary layer search within a ±2 pixel refinement window generates a final motion vector according to the second level motion vector. In a static mode, a fine tuning module performs search within a ±1 pixel refinement window and generates a final motion vector.
摘要:
An apparatus and a concomitant method for using M-ary pyramid decomposition to reduce the computational complexity in determining motion vectors for block-based motion estimation is disclosed.
摘要:
A method and apparatus for recursively optimizing the rate control of a hierarchical subband coding system that offers spatial, quality and/or complexity scalabilities. The rate control method recursively adjusts the quantizer scale for each layer of a subband tree, i.e., a subband decomposed image.
摘要:
An apparatus and a concomitant method for performing hierarchial block-based motion estimation with a high degree of scalability is disclosed. The present invention decomposes each of the image frames within an image sequence into an M-ary pyramid. Different dynamic ranges for representing the pixel values are used for different levels of the M-ary pyramid, thereby generating a plurality of different “P-bit” levels, i.e., a plurality of different M-ary pyramid architectures. The present scalable hierarchical motion estimation provides the flexibility of switching from one M-ary pyramid architecture to another M-ary pyramid architecture according to the available platform resources and/or user's choice.