摘要:
Methods of encoding and decoding for video data are described in which multi-level significance maps are used in the encoding and decoding processes. The significant-coefficient flags that form the significance map are grouped into contiguous groups, and a significant-coefficient-group flag signifies for each group whether that group contains no non-zero significant-coefficient flags. If there are no non-zero significant-coefficient flags in the group, then the significant-coefficient-group flag is set to zero. The set of significant-coefficient-group flags is encoded in the bitstream. Any significant-coefficient flags that fall within a group that has a significant-coefficient-group flag that is non-zero are encoded in the bitstream, whereas significant-coefficient flags that fall within a group that has a significant-coefficient-group flag that is zero are not encoded in the bitstream.
摘要:
A method for parallel context modeling through reordering the bits of an input sequence to form groups of bits in accordance with a context model-specific reordering schedule. The reordering schedule is developed such that the groups of bits are formed to satisfy two conditions: first, that the context for each of the bits in a group of bits is different from the context of each of the other bits in that group, and the context of each of the bits in that group is determined independently from each of the other bits in that group. The parallel context modeling may be used in encoding or decoding operations.
摘要:
A transcoder and methods of encoding inter-prediction frames of a downsampled video wherein the downsampled video is a spatially downsampled version of a full-resolution video. Full-resolution motion vectors are downscaled and each downscaled motion vector pinpoints a search area within a reference frame. The union or combination of search areas defines the search field for candidate motion vectors. A motion vector is selected from the candidates based on realizing a minimum rate-distortion cost.
摘要:
A transcoder and methods of encoding inter-prediction frames of a downsampled video wherein the downsampled video is a spatially downsampled version of a full-resolution video. Full-resolution motion vectors are downscaled and a weighting factor is calculated for each downscaled motion vector based upon the transform domain residual coefficients associated with that full-resolution motion vector. A motion vector prediction is made based on the weighted average using the downscaled motion vectors and their weighting factors.
摘要:
A method and system for low-complexity Slepian-Wolf rate estimator in a hybrid Wyner-Ziv video encoder determines the minimum Slepian-Wolf code rate required to allow correct decoding. The Slepian-Wolf estimator does not assume ideality of source and side-information statistics and does not require the presence of a feedback channel from the decoder to the encoder in order to determine the correct Slepian-Wolf coding rate. Instead, it adapts to the statistical properties of the video steam. The Slepian-Wolf estimator provides very efficient compression performance while avoiding Slepian-Wolf decoding failures.
摘要:
A method for encoding an input sequence of symbols. The method includes, sequentially, for each symbol in the input sequence, determining an estimated probability for that symbol based on a context model, identifying a codeword associated with a sequence of symbols resulting from appending that symbol to a previous sequence of symbols associated with that estimated probability, using an encoding tree associated with that estimated probability, and storing the codeword in a buffer element of a first-in-first-out buffer, wherein the buffer element is associated with the estimated probability. Stored codewords are output from the buffer in a first-in-first-out order.
摘要:
Methods and systems are described for combining sources in a data compression system. In a system in which a context model results in the production of multiple sources or symbols, each source being associated with a probability estimate, sources may be combined by defining another, reduced size, set of probabilities. The new set of probabilities may be a subset of the predefined set of probabilities provided by the context model. Minimizing relative entropy may be a basis upon which to define a mapping of predefined probabilities to the new set of probabilities. An input sequence that was modeled using the context model may then be entropy encoded and entropy decoded using the new set of probabilities based on a mapping between the new probabilities and the predefined probabilities.
摘要:
A method for entropy coding data using parallel entropy coders to encode an input sequence as a plurality of encoded subsequences, which are then combined to form an output bitstream. The input sequence is separated into subsequences on the basis of a context model. A method for entropy decoding a bitstream of encoded data by extracting a plurality of encoded subsequences from the bitstream and entropy decoding the encoded subsequences in parallel to generate a plurality of decoded subsequences, which are then interleaved based on a context model to produce a reconstructed sequence.
摘要:
An encoder for encoding a downsampled video in which macroblock partitioning is based upon downsampled partitioning of full-resolution macroblocks. The refinement of the downsampled partitioning involves storing the downsampled partitioning in a quad-tree data structure and recursively identifying and evaluating possible mergers of leaf nodes having a common parent node. Using rate-distortion analysis possible mergers are evaluated and advantageous mergers are performed, thereby creating larger partitions and new leaf nodes. The traversing of the quad-tree to identify possible leaf node mergers is stopped when no further possible mergers may be identified that would result in a rate-distortion advantage.
摘要:
An encoder for encoding a video by performing motion estimation using predicted residuals in the rate-distortion cost expression. The predicted residuals are determined by finding a predicted motion vector with respect to the original reference frame and calculating the corresponding residual values. The actual motion vector is then selected based on minimization of the rate-distortion cost expression with respect to the reconstructed reference frame, where the rate-distortion cost expression includes the predicted residuals. In particular, the cost expression includes reconstructed predicted residuals within the distortion term.