摘要:
Methods of encoding and decoding video in a low-fidelity mode are described. A coding unit level low-fidelity flag is present in the bitstream to signal whether low-fidelity mode is enabled for a particular coding unit or not. If enabled, then, for that coding unit, the chroma quantization parameter is determined using the luma quantization parameter adjusted by a low-fidelity-mode offset. If not enabled, then, for that coding unit, the chroma quantization parameter is determined using the luma quantization parameter without adjustment by the low-fidelity-mode offset. The chroma quantization parameter is then used in the scaling of quantized chroma transform domain coefficients. Use with luma or other video components is also proposed.
摘要:
Methods of encoding and decoding video are described. The methods for encoding and decoding a picture partitioned into blocks include determining an activity rank for a block, based on a block size of the block and an intra-coding mode for the block; calculating a normalization value based on the activity rank divided by an average activity rank determined over a plurality of previously-encoded/decoded blocks; and quantizing/dequantizing a set of transform domain coefficients for the block using a quantization step size adjusted by the normalization value.
摘要:
Methods of encoding and decoding for video data for encoding or decoding a sequence of greater-than-one flags for a coefficient group are provided. Context-based encoding and decoding selects a context for encoding or decoding the greater-than-one flag based upon the position of the greater-than-one flag in the sequence of greater-than-one flags. Selection of the context may also be based upon the position of the last-non-zero coefficient in the coefficient group.
摘要:
Methods of encoding and decoding for video data for encoding or decoding motion vector difference components for inter-coded video are described. The sign of one of the components is hidden within the parity of the sum of the magnitudes of the horizontal and vertical difference components. The sign of the other of the components is explicitly signaled in the bitstream. The hidden sign may be assigned to the larger in magnitude of the two components. In other cases, the hidden sign may always be assigned to the horizontal or vertical component. In another case, the hidden sign may always be assigned to one component, unless that component is zero, in which case the hidden sign is assigned to the other component. In another case, both components may have their signs hidden, in which case the sign hiding is based on their respective parity, rather than the parity of their sum.
摘要:
Methods and devices for decoding, in a video decoder, a block of enhancement-layer transform domain data for an enhancement-layer encoded video and corresponding to a reconstructed block of reference-layer transform domain data, are provided. The method includes determining a context for an enhancement-layer element based, at least in part, upon elements in the corresponding reconstructed block of reference-layer transform domain data, wherein the elements are identified by a template and wherein the position of the template in the reference-layer reconstructed block is based upon the position of that enhancement-layer element in the block of enhancement-layer transform domain data.
摘要:
Methods and devices for encoding and decoding scalable video are described. In one aspect, a method of reconstructing, in a video decoder, an enhancement-layer image based upon a reconstructed reference-layer image using inter-layer prediction is described. The method includes reconstructing a reference-layer residual and a reference-layer prediction, wherein the reference-layer residual and the reference-layer prediction, when combined, form the reconstructed reference-layer image; up-sampling the reference-layer residual using a first up-sampling operation; up-sampling the reference-layer prediction using a second up-sampling operation different from the first up-sampling operation; generating an inter-layer prediction using the up-sampled reference-layer residual and the up-sampled reference-layer prediction; and reconstructing the enhancement-layer image based upon the inter-layer prediction.
摘要:
Methods of encoding and decoding for video data are described for encoding or decoding coefficients for a transform unit. In particular, the significant-coefficient flags for a coefficient group are encoded and decoded based upon a context determination, and the context is determined based upon the values of neighboring flags. The neighborhood used to determine the context varies depending on whether the significant-coefficient flag to be encoded or decoded is in the right column or bottom row of the coefficient group or not. If it is in the right column or bottom row one of the alternative context neighborhoods is used to avoid relying on significant-coefficient flags in other coefficient groups except for the flags immediately adjacent the right border and bottom border of the coefficient group, and the flag diagonally to the lower-right.
摘要:
Methods of encoding and decoding media, for example video, using conditional reconstruction refinement are described. The encoding and decoding relies upon predictions that are based on previously encoded and decoded samples. Prediction information identifies the previously reconstructed samples upon which a prediction is to be based and is available to both the encoder and decoder. On the basis that a previously-reconstructed sample is going to be used as source data for a prediction, the encoder refines that reconstructed sample, uses the refined reconstructed sample for the prediction, and encodes refinement data to enable the decoder to also refine the previously-reconstructed sample during decoding. In some cases, a reconstructed sample may be refined more than once. Various flags or bits may be used to signal when refinement is enabled or disabled and may be signaled at various hierarchical points in the coding structure.
摘要:
Methods of encoding and decoding video, in particular chroma components, using filtered predictions, are described. An intra or inter predicted block is first filtered before being used to determine residual data at the encoder or to reconstruct chroma data at the decoder. The filtering tends to be low-pass filtering, which removes high-frequency components from the prediction block. The encoder signals the use of the filter to the decoder by including a filtered-prediction flag for each coding unit, or other region within a picture, that has been subject to prediction filtering. The syntax may provide that only coding units above a threshold size have an associated filtered-prediction flag; smaller coding units are governed by a filtered-prediction flag sent in connection with a higher node in the coding tree block.
摘要:
Methods and devices for reconstructing coefficient levels from a bitstream of encoded video data for a coefficient group in a transform unit, and corresponding methods and devices for encoding are provided. The method of reconstructing includes, for each of the non-zero coefficients in the coefficient group, in scan order, decoding a greater-than-one flag for that non-zero coefficient if fewer than two previously-decoded greater-than-one flags for that coefficient group are equal to one. The coefficient levels for the non-zero coefficients are reconstructed based, at least in part, upon the decoded greater-than-one flags.