摘要:
A video codec provides for encoding and decoding pictures of a video sequence at various coded resolutions, such that pictures can be encoded at lower coded resolutions based on bit rate or other constraints while maintaining a consistent display resolution. The video codec employs a coding syntax where a maximum coded resolution is signaled at the sequence level of the syntax hierarchy, whereas a lower coded resolution is signaled at the entry point level for a segment of one or more intra-coded frames and frames predictively encoded based thereon. This allows the use of a separate out-of-loop resampler after the decoder to up-sample the pictures to the display resolution.
摘要:
A video codec provides for adaptive vertical macroblock alignment of mixed interlaced and progressive video sequences. With adaptive vertical macroblock alignment, a video codec enforces a macroblock alignment height restriction on per picture basis, rather than requiring that all frames in a sequence adhere to a uniform height restriction. The video codec can then apply less padding to progressive and like type pictures that have smaller macroblock alignment increments, than to interlaced type pictures with larger alignment increments, which can save significant compression overhead.
摘要:
At high bit rates, the reconstruction error of compressed video is generally proportional to the squared value of quantization step size, such that full quantization step size increments at high bit rates can lead to significant change in the reconstruction error and/or bit rate of the compressed video. A video codec uses fractional increments of quantization step size at high bit rates to permit a more continuous variation of quality and/or bit rate as the quantization scale changes. For high bit rate scenarios, the bit stream syntax includes an additional syntax element to specify fractional step increments (e.g., half step) of the normal quantizer scale step sizes.
摘要:
In one aspect, an encoder/decoder selects a bitplane mode from a group of plural available bitplane modes, and processes a bitplane according to the selected bitplane mode, wherein the bitplane indicates AC prediction status information for plural macroblocks of a video picture. In another aspect, an encoder encodes a bitplane that indicates AC prediction status information for plural macroblocks of a video picture and signals the encoded bitplane. In another aspect, a decoder receives an encoded bitplane and decodes the bitplane, wherein the bitplane indicates AC prediction status information for plural macroblocks of a video picture.
摘要:
Techniques and tools for escape mode code resizing are described. For example, a video decoder receives encoded information (e.g., runs, levels) for transform coefficients of blocks. For at least some of the encoded information, the decoder decodes in an escape mode for which codes have sizes signaled on a sub-frame basis (e.g., on a per-interlaced field basis in a video frame, or on a per-slice basis in a video frame). A video encoder performs corresponding encoding and signaling.
摘要:
Various techniques and tools for chrominance motion vector rounding are described. For example, during motion compensation, a video encoder or decoder converts luminance motion vectors into chrominance motion vector having quarter-pixel accuracy. Or, during motion compensation, a video encoder or decoder converts luminance motion vectors into chrominance motion vectors using one of multiple available chrominance motion vector rounding modes.
摘要:
Techniques and tools for coding/decoding of video images, and in particular, B-frames, are described. In one aspect, a video encoder/decoder determines a fraction for a current image in a sequence. The fraction represents an estimated temporal distance position for the current image relative to an interval between a reference images for the current image. The video encoder/decoder processes the fraction along with a motion vector for a first reference image, resulting in a representation of motion (e.g., constant or variable velocity motion) in the current image. Other aspects are also described, including intra B-frames, forward and backward buffers for motion vector prediction, bitplane encoding of direct mode prediction information, multiple motion vector resolutions/interpolation filters for B-frames, proactive dropping of B-frames, and signaling of dropped predicted frames.
摘要:
At high bit rates, the reconstruction error of compressed video is generally proportional to the squared value of quantization step size, such that full quantization step size increments at high bit rates can lead to significant change in the reconstruction error and/or bit rate of the compressed video. A video codec uses fractional increments of quantization step size at high bit rates to permit a more continuous variation of quality and/or bit rate as the quantization scale changes. For high bit rate scenarios, the bit stream syntax includes an additional syntax element to specify fractional step increments (e.g., half step) of the normal quantizer scale step sizes.
摘要:
A video codec provides efficient repeat padding of hybrid video sequences having arbitrary video resolution. The video codec repeat pads to expand the active content of pictures in the video sequence out to meet an adaptive vertical macroblock alignment restriction that varies by picture type. For progressive type pictures, the video codec repeats the last row or horizontal boundary edge of the active content. For interlaced type pictures, the video coded repeats the last two rows (last row of each interlaced field) of the active content. This repeat padding differing by picture type provides a better prediction (lower prediction error residual) for macroblocks in following predicted frames whose motion vector points into the padded region.
摘要:
Techniques and tools for escape mode code resizing are described. For example, a video decoder receives encoded information (e.g., runs, levels) for transform coefficients of blocks. For at least some of the encoded information, the decoder decodes in an escape mode for which codes have sizes signaled on a sub-frame basis (e.g., on a per-interlaced field basis in a video frame, or on a per-slice basis in a video frame). A video encoder performs corresponding encoding and signaling.