摘要:
A video encoder sends at least some information regarding boundary strength to the decoder along with the bitstream of encoded video. The decoder is configured to use the received boundary strength information from the encoder to reduce the number of computations necessary for the decoder to determine the boundary strength details required for performing deblocking when decoding the bitstream.
摘要:
A method for entropy coding data using parallel entropy coders to encode an input sequence as a plurality of encoded subsequences, which are then combined to form an output bitstream. The input sequence is separated into subsequences on the basis of a context model. A method for entropy decoding a bitstream of encoded data by extracting a plurality of encoded subsequences from the bitstream and entropy decoding the encoded subsequences in parallel to generate a plurality of decoded subsequences, which are then interleaved based on a context model to produce a reconstructed sequence.
摘要:
A method of encoding a video using constrained soft-decision quantization. The soft-decision quantization includes first performing hard-decision quantization to obtain hard quantized coefficients and, then, obtaining a soft quantized coefficient using a rate-distortion calculation over a search range of quantization levels for a transform domain coefficient, wherein the search range of quantization levels for that transform domain coefficient is constrained within a number of quantization levels of a corresponding hard quantized coefficient. The search range may be based upon a fixed threshold, the coefficient position, the hard quantized coefficient magnitude, a threshold value less accumulated distortion, or other factors, including combinations of these factors. The accumulated distortion may be measured by an L1 norm.
摘要:
Methods, systems, and computer programs for encoding and decoding image are described. In some aspects, an input data block and a prediction data block are accessed. A projection factor is generated based on a projection of the input data block onto the prediction data block. A scaled prediction data block is generated by multiplying the projection factor by the prediction data block. A residual data block is generated based on a difference between the input data block and the scaled prediction data block. In some aspects, a prediction data block, a residual data block, and a projection factor associated with the residual data block are accessed. A scaled prediction data block is generated by multiplying the projection factor by the prediction data block. An output data block is generated by summing the residual data block and the scaled prediction data block.
摘要:
A transcoder and methods of encoding inter-prediction frames of a downsampled video wherein the downsampled video is a spatially downsampled version of a full-resolution video. Full-resolution motion vectors are downscaled and each downscaled motion vector pinpoints a search area within a reference frame. The union or combination of search areas defines the search field for candidate motion vectors. A motion vector is selected from the candidates based on realizing a minimum rate-distortion cost.
摘要:
A transcoder and methods of encoding inter-prediction frames of a downsampled video wherein the downsampled video is a spatially downsampled version of a full-resolution video. Full-resolution motion vectors are downscaled and a weighting factor is calculated for each downscaled motion vector based upon the transform domain residual coefficients associated with that full-resolution motion vector. A motion vector prediction is made based on the weighted average using the downscaled motion vectors and their weighting factors.
摘要:
An encoder for encoding a downsampled video in which macroblock partitioning is based upon downsampled partitioning of full-resolution macroblocks. The refinement of the downsampled partitioning involves storing the downsampled partitioning in a quad-tree data structure and recursively identifying and evaluating possible mergers of leaf nodes having a common parent node. Using rate-distortion analysis possible mergers are evaluated and advantageous mergers are performed, thereby creating larger partitions and new leaf nodes. The traversing of the quad-tree to identify possible leaf node mergers is stopped when no further possible mergers may be identified that would result in a rate-distortion advantage.
摘要:
A transcoder and method of transcoding in which inter-prediction frames of a downsampled video are encoded using downsampled full-resolution residuals in the rate-distortion analysis used to locate a motion vector for a given partition in the downsampled video.
摘要:
A transcoder and methods of encoding inter-prediction frames of a downsampled video wherein the downsampled video is a spatially downsampled version of a full-resolution video. Full-resolution motion vectors are downscaled and a weighting factor is calculated for each downscaled motion vector based upon the transform domain residual coefficients associated with that full-resolution motion vector. A motion vector prediction is made based on the weighted average using the downscaled motion vectors and their weighting factors.
摘要:
Methods and devices for reconstructing coefficient levels from a bitstream of encoded video data for a coefficient group in a transform unit. Sign bits are hidden in the parity of partially overlapping subsets of a set of coefficients. This enables the hiding of multiple sign bits per coefficient group. Other information bits may be hidden instead of sign bits in some cases.