摘要:
During a video encoding process, rectangular prediction units (PUs) for a coding unit (CU) are generated. Furthermore, a geometric partitioning mode is used to generate a first and a second geometric PU for the CU. The first and second geometric PUs are associated with different geometric partitions of a sample block of the CU. One of the rectangular PUs is identified as overlapping the first geometric PU. The motion vector of the identified rectangular PU is used to identify a given area of a reference frame. The given area of a reference frame is then used as a starting point of a search to identify a reference sample for the first geometric PU. A motion vector for the first geometric PU indicates a position of the reference sample relative to a position of the first geometric PU. A prediction block is generated using the motion vector for first geometric PU.
摘要:
During a video encoding process, a video encoder may transform the residual data of a coding unit (CU) using a single transform if the CU only has a single prediction unit (PU). If the CU has multiple PUs, the video encoder may transform the residual data using multiple transforms. The video encoder outputs an indication of the size of a transform used to transform residual data of the CU only when the CU has more than one PU. If a video decoder receives such an indication, the video decoder may reconstruct residual data of the CU using a transform of the indicated size. Otherwise, the video decoder may reconstruct the residual data of the CU using a transform having same size as the CU.
摘要:
In one example, an apparatus includes a processor configured to receive video data for two or more views of a scene, determine horizontal locations of camera perspectives for each of the two or more views, assign view identifiers to the two or more views such that the view identifiers correspond to the relative horizontal locations of the camera perspectives, form a representation comprising a subset of the two or more views, and, in response to a request from a client device, send information indicative of a maximum view identifier and a minimum view identifier for the representation to the client device.
摘要:
In one example, an apparatus for encoding video data includes a video encoder configured to calculate a residual block for a block of video data based on a predicted block formed using an intra-prediction mode, and transform the residual block using a transform mapped from the intra-prediction mode. In another example, an apparatus includes video encoder configured to receive an indication of a first intra-prediction mode in a first set of intra-prediction modes for a block of video data, determine a second intra-prediction mode from a second set of intra-prediction modes, smaller than the first set of intra-prediction modes, to which the first intra-prediction mode is mapped, determine a directional transform to which the second intra-prediction mode is mapped, and apply the directional transform to residual data of the block.
摘要:
In one example, an apparatus includes a video encoder configured to partition a block of video data into a first partition and a second partition using a geometric motion partition line, calculate a prediction value of a pixel in a transition region of the block using a filter that applies a value for at least one neighboring pixel from the first partition and a value for at least one neighboring pixel from the second partition, calculate a residual value of the pixel in the transition region of the block based on the prediction value of the pixel in the transition region, and output the residual value of the pixel. In one example, a video decoder may use a similar filter to decode an the encoded block after receiving the residual value for the encoded block, and using a definition of the geometric motion partition line.
摘要:
In one example, an apparatus includes a video encoder configured to partition a block of video data into a first partition and a second partition using a geometric motion partition line, calculate a slope value and a y-intercept value of the geometric motion partition line, wherein the slope value and the y-intercept value comprise integer values, calculate a mask indicative of pixels of the block in the first partition and pixels of the block in the second partition, encode the first partition and the second partition based on the mask, and output the encoded first partition, the encoded second partition, the slope value, and the y-intercept value. This may allow for a fixed point implementation. A video decoder may receive the slope and y-intercept values to calculate the mask and decode the block based on the mask.
摘要:
A video coding unit may be configured to encode or decode chrominance blocks of video data by reusing motion vectors for corresponding luminance blocks. A motion vector may have greater precision for chrominance blocks than luminance blocks, due to downsampling of chrominance blocks relative to corresponding luminance blocks. The video coding unit may interpolate values for a reference chrominance block by selecting interpolation filters based on the position of the pixel position pointed to by the motion vector. For example, a luminance motion vector may have one-quarter-pixel precision and a chrominance motion vector may have one-eighth-pixel precision. There may be interpolation filters associated with the quarter-pixel precisions. The video coding unit may use interpolation filters either corresponding to the pixel position or neighboring pixel positions to interpolate a value for the pixel position pointed to by the motion vector.
摘要:
Source and destination video devices may use data structures that signal details of an operation point for an MPEG-2 (Motion Picture Experts Group) System bitstream. In one example, an apparatus includes a multiplexer that constructs a data structure corresponding to a multiview video coding (MVC) operation point of an MPEG-2 (Motion Picture Experts Group) System standard bitstream, wherein the data structure signals a rendering capability value that describes a rendering capability to be satisfied by a receiving device to use the MVC operation point, a decoding capability value that describes a decoding capability to be satisfied by the receiving device to use the MVC operation point, and a bitrate value that describes a bitrate of the MVC operation point, and that includes the data structure as part of the bitstream, and an output interface that outputs the bitstream comprising the data structure.
摘要:
This disclosure describes video encoding and decoding techniques in which a first order prediction process and a second order prediction process are used in combination to generate predictive video blocks for video coding. First order prediction may be similar to conventional motion estimation and motion compensation that generates residual video blocks. The second order prediction may involve a process similar to conventional intra-prediction, but is performed on the residual video blocks. The techniques of this disclosure may pre-define the second order prediction to a specific mode, such as a mode similar to the intra-DC mode used in intra coding. In addition, the techniques of this disclosure may combine aspects of the first order and second order prediction into a single process so that the effects of second order prediction on the residuals are taken into account during the first order prediction process, which may improve compression.
摘要:
A demultiplexer may assemble view components of sub-bitstreams. In one example, an apparatus comprises a demultiplexer that produces a multiview video coding (MVC) standard compliant bitstream from a received bitstream comprising a primary sub-bitstream and an embedded sub-bitstream. To produce the MVC standard compliant bitstream, the demultiplexer determines whether a view component of the primary sub-bitstream has a view order index that is greater than a view order index of a view component of the embedded sub-bitstream, and to add the view component from the sub-bitstream for which the view order index is lower to the produced bitstream. The received bitstream may comprise delimiter network abstraction layer (NAL) units between each view component to differentiate the view components. The apparatus may further comprise a video decoder to decode the bitstream produced by the demultiplexer.