摘要:
In one aspect of this disclosure, techniques are described for selecting among default weighted prediction, implicit weighted prediction, and explicit weighted prediction. In this context, techniques are also described for adding offset to prediction data, e.g., using the format of explicit weighted prediction to allow for offsets to predictive data that is otherwise determined by implicit or default weighted prediction.
摘要:
This disclosure describes video encoding and decoding techniques in which a first order prediction process and a second order prediction process are used in combination to generate predictive video blocks for video coding. First order prediction may be similar to conventional motion estimation and motion compensation that generates residual video blocks. The second order prediction may involve a process similar to conventional intra-prediction, but is performed on the residual video blocks. The techniques of this disclosure may pre-define the second order prediction to a specific mode, such as a mode similar to the intra-DC mode used in intra coding. In addition, the techniques of this disclosure may combine aspects of the first order and second order prediction into a single process so that the effects of second order prediction on the residuals are taken into account during the first order prediction process, which may improve compression.
摘要:
Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels, for example, 64×64 pixels. In one example, an apparatus includes a video encoder configured to encode a video block having a size of more than 16×16 pixels, generate block-type syntax information that indicates the size of the block, and generate a coded block pattern value for the encoded block, wherein the coded block pattern value indicates whether the encoded block includes at least one non-zero coefficient. The encoder may set the coded block pattern value to zero when the encoded block does not include at least one non-zero coefficient or set the coded block pattern value to one when the encoded block includes a non-zero coefficient.
摘要:
A motion compensated video coding method which can be applied especially in transfer of video streams using low transmission bit rate is presented. In the motion compensated coding method, the motion of picture elements between a piece of reference video information and a piece of current video information is estimated and then modeled using certain basis function and coefficients. The coefficients are quantized, and the quantizer is selected according to a certain selection criterion, for example, based on a target image quality or on a target transmission bit rate. Preferably the selection criterion is such that it automatically adjust the accuracy with which the motion of picture elements is represented to be related to the accuracy with which the prediction error information is represented. A decoding method, an encoder and a corresponding decoder are also described.
摘要:
This disclosure describes techniques for transforming residual blocks of video data. In particular, a plurality of different transforms selectively applied to the residual blocks based on the prediction mode of the video blocks. At least a portion of the plurality of transforms are separable directional transform specifically trained for a corresponding prediction mode to provide better energy compaction for the residual blocks of the given prediction mode. Using separable directional transforms offers the benefits of lower computation complexity and storage requirement than use of non-separable directional transforms. Additionally, a scan order used to scan the coefficients of the residual block may be adjusted when applying separable directional transforms. In particular, the scan order may be adjusted based on statistics associated with one or more previously coded blocks to better ensure that non-zero coefficients are grouped near the front of the one-dimensional coefficient vector to improve the effectiveness of entropy coding.
摘要:
This disclosure describes techniques for scanning coefficients of video blocks, e.g., quantized and transformed coefficients. Rather than use conventional zig-zag scanning, the techniques of this disclosure adapt the scanning order based on statistics associated with previously coded blocks that were coded in the same prediction mode. For each prediction mode, statistics of the coefficients are stored, e.g., indicating probabilities that given coefficients are zero or non-zero. Periodically, adjustments to the scanning order can be made in order to better ensure that non-zero coefficients are grouped together and zero value coefficients are grouped together, which can improve the effectiveness of entropy coding. The techniques of this disclosure provide thresholds and threshold adjustments that can reduce the frequency that the scanning order adjustments occur, yet still achieve desired improvements in compression due to such scanning order adjustments.
摘要:
This disclosure describes techniques for coding an enhancement layer in a scalable video coding (SVC) scheme. The techniques may be used in variable length coding of refinement coefficients of an enhancement layer of a SVC scheme. According to this disclosure, a method may comprise determining first statistics associated with a first type of video block. determining second statistics associated with a second type of video block, selecting a first variable length coding (VLC) table from a plurality of VLC tables to be used in coding the first type of video block based on the first statistics, selecting a second VLC table from the plurality of VLC tables to be used in coding the second type of video block based on the second statistics, coding video blocks of the first type based on the first VLC table, and coding video blocks of the second type based on the second VLC table.
摘要:
This disclosure describes techniques for coding information in a scalable video coding (SVC) scheme that supports spatial scalability. In one example, a method of coding information in a SVC scheme comprises coding first video blocks defined in a base layer of the SVC scheme, and coding second video blocks defined in an enhancement layer of the SVC scheme, wherein the second video blocks provide spatial scalability enhancement relative to the first video blocks. The coding of the second video blocks includes performing inter-layer prediction for the enhancement layer relative to the base layer for a first subset of the second video blocks that fully overlap the first video blocks, and performing inter-layer prediction for the enhancement layer relative to the base layer for a second subset of the second video blocks that partially overlap the first video blocks.
摘要:
This disclosure describes techniques for coding of header information of video blocks. In particular, the techniques of this disclosure select one of a plurality of prediction modes for use in generating a prediction block of a video block of a coding unit, the plurality of prediction modes including unidirectional prediction modes and multi-directional prediction modes that combine at least two unidirectional prediction modes. An encoding device encodes the prediction mode of the current video block based on prediction modes of one or more previously encoded video blocks of the coding unit. Likewise, a decoding unit receives encoded video data of a video block of a coding unit and decodes the encoded video data to identify one of a plurality of prediction modes for use in generating a prediction block of the video block based on prediction modes of one or more previously decoded video blocks of the coding unit.
摘要:
This disclosure describes features and techniques applicable to three-dimensional (3D) video coding. In one example, a technique may include coding a texture view video block, and coding a depth view video block, wherein the depth view video block is associated with the texture view video block. Coding the depth view video block may include coding a syntax element to indicate whether or not motion information associated with the texture view video block is adopted as motion information associated with the depth view video block.