摘要:
The method, system, and apparatus of linear temporal reference scheme having non-reference predictive frames is disclosed. In one embodiment, a method of a temporal reference scheme includes creating a non-reference predictive frame and a reference predictive frame from a reference frame, increasing a quantization factor of the non-reference predictive frame by a threshold value, and decreasing the quantization factor of the reference predictive frame by the threshold value. The method may include toggling between the temporal reference scheme and another temporal reference scheme in which there may be no non-reference predictive frames and only reference predictive frames based on a scenario. The method may also include calculating a target bit rate across a communication channel through a bit rate calculation algorithm that considers a bandwidth capacity across the communication channel.
摘要:
Image frames of a video stream are encoded with the aim of reducing flicker in the video stream when displayed. In one embodiment, the quantization parameter used to quantize an image frame is capped to be not greater than a quantization parameter used to quantize an immediately previous image frame. In another embodiment, the quantization step size used to quantize a macro-block of an image frame is computed based on the value of a quantization error of a co-located macro-block in an immediately previous image frame. In yet another embodiment, macro-block transform coefficients corresponding to high-frequency components are quantized using relatively higher quantization parameter values if the image frame is deemed to contain high activity. In yet another embodiment, flicker-prone macro-blocks of a source frame are replaced by corresponding best-match macro-blocks of a previous reconstructed frame, prior to being encoded.
摘要:
A video encoder receives a macro-block of an image frame, and determines whether the macro-block contains text. The video encoder computes a quantization parameter for quantizing the macro-block, with the quantization parameter computed to be smaller if the macro-block is determined to contain text. The video encoder encodes the macro-block using the quantization parameter. Text quality in the encoded macro-block is preserved.
摘要:
A method and apparatus for predicting reference data transfer scheme for motion estimation. The method includes computing, via the processor, hypothetical rectangle region in reference frame containing all the predicting and reference data for doing motion estimation search around the region, if the macroblock is not the first in a row, utilizing overlap with previously fetched reference data, computing overlap with previously fetched reference data, and transferring needed data, invalidating any predictor, wherein the predictor is not part of the fetched data, and regulating the motion estimation and setting the motion vector to an effective value based on the fetched and computed data.
摘要:
Image frames of a video stream are encoded with the aim of reducing flicker in the video stream when displayed. In one embodiment, the quantization parameter used to quantize an image frame is capped to be not greater than a quantization parameter used to quantize an immediately previous image frame. In another embodiment, the quantization step size used to quantize a macro-block of an image frame is computed based on the value of a quantization error of a co-located macro-block in an immediately previous image frame. In yet another embodiment, macro-block transform coefficients corresponding to high-frequency components are quantized using relatively higher quantization parameter values if the image frame is deemed to contain high activity. In yet another embodiment, flicker-prone macro-blocks of a source frame are replaced by corresponding best-match macro-blocks of a previous reconstructed frame, prior to being encoded.
摘要:
In a video encoder, pixel values of a macro-block are processed to determine an activity measure indicative of the type of content in the macro-block. Several techniques are employed for determining the activity measure of a macro-block. In an embodiment, a default quantization scale for quantizing a macro-block is modified based on the activity measure of the macro-block. In another embodiment, the macro-block is classified into one of multiple classes based on its activity measure. The default quantization scale for quantizing the macro-block is modified based on the classification of the macro-block. In yet another embodiment, an encoding mode to be used for encoding a macro-block is also determined on the basis of the class of the macro-block. Several of the techniques exploit the fact that the human visual system (HVS) has different sensitivities in perceiving a (rendered) macro-block or video frame, depending on the type of macro-block content.
摘要:
A method and apparatus for predicting reference data transfer scheme for motion estimation. The method includes computing, via the processor, hypothetical rectangle region in reference frame containing all the predicting and reference data for doing motion estimation search around the region, if the macroblock is not the first in a row, utilizing overlap with previously fetched reference data, computing overlap with previously fetched reference data, and transferring needed data, invalidating any predictor, wherein the predictor is not part of the fetched data, and regulating the motion estimation and setting the motion vector to an effective value based on the fetched and computed data.
摘要:
Several techniques aimed at reducing computational complexity when encoding uses bi-predictively encoded frames (B-frames) are implemented in a video encoder. In an embodiment, B-frames are not used as reference frames for encoding P-frames and other B-frames. Non-use of B-frames allows a de-blocking filter used in the video encoder to be switched off when reconstructing encoded B-frames, and use of a lower complexity filter for fractional-resolution motion search for B-frames. In another embodiment, cost functions used in motion estimation for B-frames are simplified to reduce computational complexity. In one more embodiment, fractional pixel refinement in motion search for B-frames is simplified. In yet another embodiment, predictors used in motion estimation for a macro-block in a P-frame are selected from a B-frame that uses a same reference frame as the P-frame.
摘要:
This invention predicts that intra mode prediction is more effective for the macroblocks where motion estimation in inter mode prediction fails. This failure is indicated by a large value of the inter mode SAD. This invention performs intra mode prediction for only macro blocks have larger inter mode SADs. The definition of a large inter mode SAD differs for different content. This invention compares the inter mode SAD of a current macroblock with an adaptive threshold. This adaptive threshold depends on the average and variance of the SADs of the previous predicted frame. An adaptive threshold is calculated for each new predictive frame.
摘要:
The method, system, and apparatus of source statistics based intra prediction type is disclosed. In one embodiment, a method includes classifying a four-pixel square block in an edge class (e.g., may include a DC edge class, a vertical edge class, a horizontal edge class, a diagonal edge class, and/or a planar edge class) based on an edge classifier, classifying an eight-pixel square block having the four-pixel square block and other four-pixel square blocks as a homogenous class if the four-pixel square block and the other four-pixel square blocks of the eight-pixel square block belong to the edge class, assigning a direction to the edge class of the eight-pixel square block, and determining an optimal intra-prediction type through the classification such that empirical testing of all possible ones of the edge class and the direction is avoided when the homogenous class is identified.