摘要:
This disclosure describes techniques for coding of header information of video blocks. In particular, the techniques of this disclosure select one of a plurality of prediction modes for use in generating a prediction block of a video block of a coding unit, the plurality of prediction modes including unidirectional prediction modes and multi-directional prediction modes that combine at least two unidirectional prediction modes. An encoding device encodes the prediction mode of the current video block based on prediction modes of one or more previously encoded video blocks of the coding unit. Likewise, a decoding unit receives encoded video data of a video block of a coding unit and decodes the encoded video data to identify one of a plurality of prediction modes for use in generating a prediction block of the video block based on prediction modes of one or more previously decoded video blocks of the coding unit.
摘要:
This disclosure describes features and techniques applicable to three-dimensional (3D) video coding. In one example, a technique may include coding a texture view video block, and coding a depth view video block, wherein the depth view video block is associated with the texture view video block. Coding the depth view video block may include coding a syntax element to indicate whether or not motion information associated with the texture view video block is adopted as motion information associated with the depth view video block.
摘要:
In a video processing system, a method and system for applying transforms larger than 8×8 and non-rectangular transforms, and generating transform size syntax elements indicative of the transforms for video decoding are provided. The transform size syntax element may be generated by an encoder based on a prediction block size of a video block and the contents of the video block. Further, the transform size syntax element may be generated according to a set of rules to select from 4×4, 8×8, and larger transform sizes during an encoding process. A decoder may perform an inverse transform based on the transform size syntax element and the rules used by the encoder. The transform size syntax element may be transmitted to the decoder as part of the encoded video bitstream.
摘要:
Aspects of this disclosure relate to a method of coding video data. In an example, the method includes obtaining, from an encoded bitstream and for any view component of a first view, reference view information indicating one or more reference views for predicting view components of the first view. The method also includes including, for decoding a first view component in an access unit and in the first view, one or more reference candidates in a reference picture list, where the one or more reference candidates comprise view components in the access unit and in the reference views indicated by the reference view information, where the number of reference candidates is equal to the number of reference views. The method also includes decoding the first view component based on the one or more reference candidates in the reference picture list.
摘要:
A method for encoding transform coefficients in a video encoding process includes dividing a block of transform coefficients into a plurality of zones, determining a scan order for each of the plurality of zones, and performing a scan on each of the transform coefficients in each of the plurality of zones according to their respective determined scan order. In another example, a method for decoding transform coefficients in a video encoding process includes receiving a one-dimensional array of transform coefficients, determining a scan order for each of a plurality of sections of the one-dimensional array, wherein each section of the one-dimensional array corresponds to one of a plurality of zones defining a block of transform coefficients, and performing a scan on each of the transform coefficients in each of the section of the one dimensional array of zones according to their respective determined scan order.
摘要:
This disclosure describes techniques for coding video data. In particular, this disclosure describes techniques for entropy coding of residual transform coefficients generated by a video coding process. In one example, a method selects a bin 2 context for coding a bin 2 level of one or more transform coefficients in the vector according to the entropy coding process. The method further codes the bin 2 level of one or more transform coefficients in the vector according to the selected bin 2 context. Selecting the bin 2 context comprises selecting the bin 2 context for a current transform coefficient in the vector based on the bin 2 level of one or more previously coded transform coefficients in the vector.
摘要:
This disclosure describes techniques for coding a video block based on an unclipped version of a motion vector predictor candidate. The techniques include determining a motion vector predictor candidate list including motion vector predictor candidates from neighboring video blocks without clipping the motion vector predictor candidates. More specifically, if one of the motion vector predictor candidates points to a prediction block located outside of a reference picture boundary relative to the current video block, the techniques allow an unclipped version of the motion vector predictor candidate to be included in the candidate list. The current video block is then coded based on a determined unclipped motion vector predictor candidate of the candidate list. Elimination of the motion vector predictor candidate clipping process reduces complexity at both the video encoder and the video decoder.
摘要:
A multiplexer may produce an MPEG-2 (Motion Picture Experts Group) System standard bitstream comprising views with nonconsecutive view order indexes. In one example, an apparatus includes a video encoder that encodes views of a scene, and a multiplexer that constructs a data structure for signaling that a corresponding MPEG-2 (Motion Picture Experts Group) System standard bitstream includes a first view of the views of the scene associated with a first view order index and a second view of the plurality of views of the scene associated with a second view order index. The first view order index and the second view order index are non-consecutive. The apparatus may further include an output interface that outputs the data structure.
摘要:
In one example, a device for coding video data includes a video coder (such as a video decoder or a video encoder) configured to code motion information for a current block of video data using a hybrid motion information coding mode, wherein to code the motion information, the video coder is configured to code a merge index syntax element of the motion information in a manner substantially conforming to a merge mode, and code at least one additional syntax element of the motion information in a manner substantially conforming to an advanced motion vector prediction (AMVP) mode, and wherein the video coder is configured to code the current block using the motion information. The hybrid mode may comprise a partial merge mode or a partial AMVP mode.
摘要:
For each prediction unit (PU) belonging to a coding unit (CU), a video coder generates a candidate list. The video coder generates the candidate list such that each candidate in the candidate list that is generated based on motion information of at least one other PU is generated without using motion information of any of the PUs belonging to the CU. After generating the candidate list for a PU, the video coder generates a predictive video block for the PU based on one or more reference blocks indicated by motion information of the PU. The motion information of the PU is determinable based on motion information indicated by a selected candidate in the candidate list for the PU.