摘要:
Selecting a coding mode for coding video data by measuring a distortion sensitivity threshold for a pixel block, calculating a distortion threshold representative of the maximum distortion that may be effectively masked by the brightness and texture of the pixel block, estimating the distortion induced by coding the pixel block according to skip mode and coding the source pixel block with a predictive coding technique if the estimated distortion value exceeds the distortion threshold. The distortion sensitivity threshold may include, for example, a brightness value or a texture value. The contrast between the pixel block and the surrounding pixel blocks may also be considered such that if the contrast exceeds a contrast threshold calculated based on the measurement of brightness and texture, the source pixel block may be coded with a predictive coding technique even if the estimated distortion value does not exceed the distortion threshold.
摘要:
A video coder employs techniques for applying frame rate adaptation and variable resolution adaptation in response to environmental coding factors present at the coding terminal. According to such techniques, a coder may estimate a coding quality level to be applied based on the environmental coding factors. The coder may retrieve from a controller table, settings for resolution and frame rate based on the estimated quality level. Optionally, the coder further may retrieve settings identifying a range of quantization parameters that may be used during coding. Prior to coding, the coder may configure input video data to match the resolution and frame rate settings retrieved from the controller table. Thereafter, the coder may code the reconfigured input video data by motion-compensation prediction constrained, as applicable, by the retrieved quantization parameter range.
摘要:
A system and method for coding video data wherein a pixel block may be coded for refresh with reference to an LTR frame that was successfully transmitted, or has a high probability of having been successfully transmitted from the encoder to the decoder. Not all pixel blocks in the frame may be refreshed at the same rate. Pixel blocks containing edge details, containing a significant object, or containing foreground image data may be refreshed more often than pixel blocks containing smooth, background, or relatively less significant image data.
摘要:
A scene-aware auto-exposure control process stabilizes changes in a camera's auto-exposure settings so as to reduce lighting and color flicker during image capture operations. A metric, referred to as the Modified Adjusted Luminance (MAL) metric, is defined to remain relatively constant as long as the lighting of the scene being captured remains relatively constant. Thus, scene changes such as an object moving into, out of, or around in a scene do not significantly affect the MAL metric's value and do not, therefore, trigger an exposure adjustment. Once the MAL metric indicates a scene's lighting is stable, the camera's auto-exposure operation may be suppressed. As long as incoming frames indicate a stable lighting condition (based on the MAL metric), auto-exposure operation may remain suppressed. When incoming frames result in a substantially different MAL over a specified number of frames, auto-exposure operation may be restored.
摘要:
A coding technique is disclosed in which frames of a video sequence are assigned to one of a plurality of sub-channels to be transmitted to a decoder. The frames are coded according to predictive coding techniques such that ordinarily prediction references of the frames in each sub-channel only reach the reference frames that occur within the same sub-channel. Thus, if transmission errors arise with respect to one sub-channel, decoding may occur for another sub-channel until the transmission error is detected and corrected.
摘要:
A rate control system is disclosed for video coding applications. The rate controller assigns a quantization parameter for video data in a picture in response to complexity indicators indicative of spatial complexity, motion complexity and/or bits per pel of the picture. A virtual buffer based quantizer parameter is proposed based on a virtual buffer fullness analysis and a target rate estimate, which is derived from the complexity indicators. A second quantizer parameter is proposed from a linear regression analysis of quantizer parameters used to code previously coded pictures of similar type (e.g., I pictures, P pictures or B pictures). A coding policy decision unit defines a final quantizer parameter from a comparison of the two proposed quantizer parameters.
摘要:
Methods, systems, and apparatus are presented for reducing distortion in an image, such as a video image. A video image can be captured by an image capture device, e.g. during a video conferencing session. Distortion correction processing, such as the application of one or more warping techniques, can be applied to the captured image to produce a distortion corrected image, which can be transmitted to one or more participants. The warping techniques can be performed in accordance with one or more warp parameters specifying a transformation of the captured image. Further, the warp parameters can be generated in accordance with an orientation of the image capture device, which can be determined based on sensor data or can be a fixed value. Additionally or alternatively, the warp parameters can be determined in accordance with a reference image or model to which the captured image should be warped.
摘要:
Method For Implementing A Quantizer In A Multimedia Compression And Encoding System is disclosed. In the Quantizer system of the present invention, several new quantization ideas are disclosed. In one embodiment, adjacent macroblocks are grouped together into macroblock groups. The macroblock groups are then assigned a common quantizer value. The common quantizer value may be selected based upon how the macroblocks are encoded, the type of macroblocks within the macroblock group (intra-blocks or inter-blocks), the history of the motion vectors associated with the macroblocks in the macroblock group, the residuals of the macroblocks in the macroblock group, and the energy of the macroblocks in the macroblock group. The quantizer value may be adjusted in a manner that is dependent on the current quantizer value. Specifically, if the quantizer value is at the low end of the quantizer scale, then only small adjustments are made. If the quantizer value is at the high end then larger adjustments may be made to the quantizer. Finally, in one embodiment, the quantizer is implemented along with an inverse quantizer for efficient operation.
摘要:
Some embodiments of the invention provide a method of performing a Discrete Cosine Transform (“DCT”) encoding or decoding coefficients of a data array by (1) multiplying the coefficients by a scalar value before the encoding or decoding, and then (2) dividing the encoded or decoded coefficients by the scalar value. When used in conjunction with fixed-point arithmetic, this method increases the precision of the encoded and decoded results. In addition, some embodiments provide a method of performing a two-dimensional (2D) Inverse Discrete Cosine Transform (“iDCT”). This method splits a pre-multiplication operation of the iDCT into two or more separate stages. When used in conjunction with fixed-point arithmetic, this splitting increases the precision of the decoded results of the iDCT.
摘要:
In a video coder/decoder system using variable resolution adaptation, decoder techniques provide a mechanism to changing resolution of coded lower-resolution video to a higher resolution for rendering. Coded video data of a low resolution frame may be decoded. A motion estimation search may be performed between the decoded low resolution frame and a cache of previously-stored high resolution video frames. If the motion estimation search generates one or more matches, high resolution video data of the decoded frame may generated as a derivation of matching data from the cached video frames.