摘要:
Region of Interest (ROI) scalability with SHVC is able to be implemented where scalability is used for part of a picture but not the whole picture. Applications of ROI scalability include traffic monitoring, security monitoring and tiled streaming.
摘要:
A context reduced last transform (CRLT) coding technique which enhances parallel context processing, such as utilized in JCTVC-D262, to reduce complexity by reducing the number of context models using for coding the position of the last significant transform coefficient. Selected context models are removed and additional bins are shared which reduce the number of contexts required. In one benchmark test for YUV 4:2:0 video, the number of context models were reduced from 120 for the proposed entropy encoding of JCTVC-D262 test model HM 2.0, versus 82 context models required for CRLT coding.
摘要:
Rate-QP estimation for an I picture is disclosed which involves the steps of: providing an input group of pictures (GOP); selecting an input I picture within the GOP; and outputting, to a computer readable medium, a bit rate corrected Rate-QP, R(QP), for the input I picture. The outputting step may involve calculating intra luma and chroma Rate-QP estimates from corresponding intra luma and chroma histograms; offsetting the intra chroma Rate-QP estimate to form an offset intra chroma estimate; and setting a bit rate corrected Rate-QP for the input I picture to a corrected sum of the previous estimates. The histograms are formed with estimates of intra prediction coefficients, where an intra/non-intra mode is selected that results in a lowest SATD for each macroblock in the GOP. The methods may be implemented into a computer program, possibly resident in an advanced video encoder.
摘要:
Picture level rate control systems, apparatuses and methods are described which indirectly control bit rate through peak signal to noise ratio (PSNR), thereby providing a stable PSNR while maintaining a given target average bit rate. In a preferred implementation, encoding is performed in conjunction with macroblock QP modulation. In this way, the picture level rate control provides stable temporal quality and the macroblock QP modulation provides stable spatial quality. By way of example, the number of bits generated in a group of pictures (GOP) is controlled by modulating the target bit allocation for a group of pictures (GOP) based on coded picture buffer (CPB) fullness. Macroblocks are encoded for each picture based on a rate-distortion model, which is modified in response to the relative fluctuation of PSNR of pictures within a group of pictures.
摘要:
A method of generating a value for a missing pixel “x” by determining a “least harmful” local edge direction between pixels, or sub-pixels, on substantially opposing sides of the missing pixel, and interpolating the difference to arrive at a value for pixel “x”. The method involves generating sub-pixel values for locations within neighboring pixels, the sub-pixels may comprise half-pixels, quarter-pixels, three-quarter pixels, and so forth, wherein any fractional pixel quantity may be created. Absolute difference values are calculated between neighboring pixels, or sub-pixel values, to determine a least harmful local edge direction along which a value is generated for pixel “x” by interpolation.
摘要:
A context reduced last transform (CRLT) coding technique which enhances parallel context processing, such as utilized in JCTVC-D262, to reduce complexity by reducing the number of context models using for coding the position of the last significant transform coefficient. Selected context models are removed and additional bins are shared which reduce the number of contexts required. In one benchmark test for YUV 4:2:0 video, the number of context models were reduced from 120 for the proposed entropy encoding of JCTVC-D262 test model HM 2.0, versus 82 context models required for CRLT coding.
摘要:
A method and apparatus for performing quantization during video compression, such as within an MPEG-2 encoder. Inter-quantization is particularly described, although the teachings may be applied to intra-quantization as well. Computationally intensive portions of the quantization process are split into multiple components in response to differing execution probability. A first component is executed within each quantization iteration, and preferably is performed in parallel, such as using SIMD processing. At least one additional component, such as a second component, is serially processed in response to detecting a condition that does not arise with each loop iteration. One configuration replaces division operations with a combination of multiply and shift operations, while making extended use of table lookup procedures to reduce processing overhead.
摘要:
A fast implementation context selection of a significance map includes caching the significance of each coefficient in a buffer, storing an extended significance map in an extended buffer and updating the extended significance map after the significance of a transform coefficient is determined by the encoder or the decoder.
摘要:
An apparatus and method are taught for estimating an optimized sub-pixel interpolation filter using iterative and non-iterative estimations as needed for sub-pixel motion compensation and motion estimation in a video codec for improving coding efficiency. Motion vector information and mode decisions are passed from the first encoding stage which uses predetermined interpolation to at least a second encoding stage which uses an estimated adaptive interpolation filter determined during the first encoding stage. Processing overhead is reduced within the subsequent stages. Embodiments are described in which additional stages perform iterative encoding and estimation of interpolation filter in an n-th iteration.
摘要:
The invention is an apparatus and method for estimating an optimized sub-pixel interpolation filter using iterative estimations as needed for sub-pixel motion compensation and motion estimation in a video codec for improving coding efficiency. Multiple iterations of adaptive interpolation filter estimation are performed including more than one iteration based on sub-pixel motion vectors. During testing of the inventive apparatus and method on various video segments, average bit rate reductions were exhibited of approximately 5%.