摘要:
There are provided scalable video encoder and decoders, and corresponding scalable video encoding and decoding methods. A scalable video encoder includes an encoder for encoding a block in an enhancement layer of a picture by applying a same weighting parameter to an enhancement layer reference picture as that applied to a lower layer reference picture used for encoding a block in a lower layer of the picture. The block in the enhancement layer corresponds to the block in the lower layer, and the enhancement layer reference picture corresponds to the lower layer reference picture. The scalable video decoder includes a decoder for decoding a block in an enhancement layer of a picture by applying a same weighting parameter to an enhancement layer reference picture as that applied to a lower layer reference picture used for decoding a block in a lower layer of the picture. The block in the enhancement layer corresponds to the block in the lower layer, and the enhancement layer reference picture corresponds to the lower layer reference picture.
摘要:
There is disclosed a video encoder and corresponding method for encoding video data for an image block. The video encoder performs a mode decision by performing initial motion estimation on only a subset of possible block sizes to output motion information corresponding thereto, and determining, based upon the motion information corresponding to only the subset of possible of block sizes and upon other image-related analysis data, whether other block sizes are to be evaluated.
摘要:
There are provided video encoders and corresponding methods for encoding video data for an image that is divisible into super-macroblocks (super-MBs). A video encoder includes an encoder for classifying a super-MB in the image with respect to one of a frame mode or a field mode using a band-pass/high-pass filter applied vertically to the image
摘要:
There are provided methods and apparatus for multi-view video coding. A video encoder includes an encoder for encoding a block in a picture by choosing between temporal prediction and cross-view prediction to enable a prediction for the block. The picture is one of a set of pictures corresponding to multi-view video content and having different view points with respect to a same or similar scene. The picture represents one of the different view points. A high-level syntax is used to indicate the use of cross-view prediction for the block.
摘要:
There is disclosed and described a decoder and decoding method for decoding at least one picture corresponding to at least one of at least two views of multi-view video content from a bitstream, wherein in the bitstream at least one of coding order information and output order information for the at least one picture is decoupled from the at least one view to which the at least one picture corresponds. Furthermore, there is disclosed and described an encoder and encoding method for encoding at least one picture corresponding to at least one of at least two views of multi-view video content to form a resultant bitstream, wherein in the resultant bitstream at least one of coding order information and output order information for the at least one picture is decoupled from the at least one view to which the at least one picture corresponds.
摘要:
There are provided methods and apparatus for transform selection in video coding. An apparatus includes a video encoder for encoding at least a block in a picture by selecting a transform to apply to a residue of the block from a set of two or more available transforms. The transform is selected based on at least one of an inter prediction mode used to predict at least one reference for the block, one or more values corresponding to a motion vector, a value of a residue of one or more previously encoded blocks, a value of prediction data for the block, one or more transform selections of one or more neighboring reconstructed blocks, and a quantization step applied to transform coefficients for the residue of the block.
摘要:
There are provided methods and apparatus for stereoscopic video coding using scalable video coding. A scalable video encoder includes an encoder (100) for encoding at least two views corresponding to multi-view video content by, encoding a particular view of the at least two views as a base layer, and encoding each of at least one other view of the at least two views as an enhancement layer using a prediction from a lower layer corresponding to at least one of the particular view and the at least one other view. The at least two views are encoded based on a selection from among at least two of temporal, spatial, and signal to noise ratio scalability techniques.
摘要:
The present principles relate to a hypothetical reference decoder (HRD) for a Scalable Video Coding extension for a compression algorithm. One such implementation proposes to modify the H.264/AVC HRD for use with the SVC of AVC. That implementation defines HRD constraints for each interoperability point of SVC. One implementation in particular is described, but other implementations are possible and are contemplated by the present principles. The changes for spatial, temporal, and SNR scalability are shown. There are also changes to the related HRD parameters followed that are shown. The several mentioned implementations provide rules for an HRD for SVC. At least one implementation proposes the SVC-HRD rules as modifications to the AVC-HRD rules. A user may use the proposed SVC-HRD rules to build an SVC-HRD and test a bitstream for SVC compliance.
摘要:
There are provided methods and apparatus for transform selection in video coding. An apparatus includes a video encoder for encoding at least a block in a picture by selecting a transform to apply to a residue of the block from a set of two or more available transforms. The transform is selected based on at least one of an inter prediction mode used to predict at least one reference for the block, one or more values corresponding to a motion vector, a value of a residue of one or more previously encoded blocks, a value of prediction data for the block, one or more transform selections of one or more neighboring reconstructed blocks, and a quantization step applied to transform coefficients for the residue of the block.
摘要:
There are provided methods and apparatus for video coding using prediction data refinement. An apparatus includes an encoder for encoding an image region of a picture. The encoder has a prediction refinement filter for refining at least one of an intra prediction and an inter prediction for the image region. The prediction refinement filter refines the inter prediction for the image region using at least one of previously decoded data and previously encoded data, the previously decoded data and the previously encoded data corresponding to pixel values in neighboring regions with respect to the image region.