Abstract:
Techniques and systems are provided for processing video data. For example, 360-degree video data can be obtained for processing by an encoding device or a decoding device. The 360-degree video data includes pictures divided into motion-constrained tiles. The 360-degree video data can be used to generate a media file including a plurality of tracks. Each of the plurality of tracks contain a set of at least one of the motion-constrained tiles. The set of at least one of the motion-constrained tiles corresponds to at least one of a plurality of viewports of the 360-degree video data. A first tile representation can be generated for the media file. The first tile representation encapsulates a first track of the plurality of tracks, and the first track includes a first set of at least one of the motion-constrained tiles at a first tile location in the pictures of the 360-degree video data. The first set of at least one of the motion-constrained tiles corresponds to a viewport of the 360-degree video data.
Abstract:
An apparatus for coding video information according to certain aspects includes a memory unit configured to store video information associated with a plurality of layers and a processor. The processor is configured to obtain information associated with a current access unit (AU) to be coded, the current AU containing pictures from one or more layers of the plurality of layers. The processor is further configured to determine whether the current AU includes a first layer containing an intra random access point (IRAP) picture. The process is additionally configured to reset a picture order count (POC) of the second layer at the current AU, in response to determining that the current AU includes (1) a first layer that contains an IRAP picture and (2) a second layer containing no picture or containing a discardable picture.
Abstract:
An apparatus for coding video information according to certain aspects includes a memory unit configured to store video information associated with a plurality of layers and a processor. The processor is configured to obtain information associated with a current access unit (AU) to be coded, the current AU containing pictures from one or more layers of the plurality of layers. The processor is also configured to reset a picture order count (POC) of a layer included in the current AU via (1) resetting only a most significant bit (MSB) of the POC or (2) resetting both the MSB of the POC and a least significant (LSB) of the POC. The processor is further configured to, for pictures in one or more Ails subsequent to the current AU in decoding order: set a value of a first flag indicative whether a reset of the POC is a full reset.
Abstract:
An apparatus configured to code video information in a bitstream includes a memory and a processor in communication with the memory. The memory is configured to store video information associated with a video layer having a current picture. The processor is configured to: determine whether a slice segment header extension associated with the current picture is present in the bitstream; and determine that one or more most significant bits (MSBs) of a picture order count (POC) value associated with the current picture are not present in the bitstream in response to a determination that the slice segment header extension associated with the current picture is not present in the bitstream. The processor may encode or decode the video information in the bitstream.
Abstract:
An apparatus configured to code video information includes a memory unit and a processor in communication with the memory unit. The memory unit is configured to store video information associated with a first video layer having a current picture. The processor is configured to process a first offset associated with the current picture, the first offset indicating a difference between (a) most significant bits (MSBs) of a first picture order count (POC) of a previous picture in the first video layer that precedes the current picture in decoding order and (b) MSBs of a second POC of the current picture.
Abstract:
According to certain aspects, an apparatus for coding video information includes a memory and a processor configured to determine whether a first syntax element is present in a bitstream, the first syntax element associated with a sequence parameter set (SPS) and a first flag indicative of whether a temporal identifier (ID) of a reference picture for pictures that refer to the SPS can be nested; and in response to determining that the first syntax element is not present in the bitstream: obtain a second syntax element indicative of a maximum number of temporal sub-layers in a particular layer of the plurality of layers; and determine whether to set the first flag equal to a second flag indicative of whether a temporal ID of a reference picture for any pictures can be nested based at least in part on a value of the second syntax element.
Abstract:
An apparatus for coding video information according to certain aspects includes a memory and a processor. The memory is configured to store video information associated with one or more layers. The processor is configured to code a current access unit (AU) in a bitstream including a plurality of layers, the plurality of layers including a reference layer and at least one corresponding enhancement layer. The processor is further configured to code a first end of sequence (EOS) network abstraction layer (NAL) unit associated with the reference layer in the current AU, the first EOS NAL unit having the same layer identifier (ID) as the reference layer. The processor is also configured to code a second EOS NAL unit associated with the enhancement layer in the current AU, the second EOS NAL unit having the same layer ID as the enhancement layer.
Abstract:
An apparatus according to certain aspects includes a memory and a processor configured to: in response to determining that a reference layer intra random access point (IRAP) picture starts a new coded video sequence (CVS) at the current AU: determine whether the reference layer IRAP picture activates a new video parameter set (VPS) or defines a prediction boundary at the current AU; in response to determining that the reference layer IRAP picture activates a new VPS or defines a prediction boundary at the current AU, remove pictures in the decoded picture buffer (DPB) associated with each of a plurality of layers; and in response to determining that the reference layer IRAP picture does not activate a new VPS and does not define a prediction boundary at the current AU, remove only pictures in the DPB associated with the reference layer.
Abstract:
An apparatus configured to code video information includes a memory unit and a processor in communication with the memory unit. The memory unit is configured to store video information associated with a video layer having a picture. The processor is configured to determine whether the picture is a non-picture-order-count (POC)-anchor picture, and based on the determination of whether the picture is a non-POC-anchor picture, perform one of (1) refraining from indicating a POC reset in connection with the picture, or (2) indicating the POC reset in connection with the picture. The processor may encode or decode the video information.
Abstract:
Video data bitstreams may contain bitstream conformance parameters, such as hypothetical reference decoder (HRD) parameters, which may be used to allow a decoder to test the conformance of a received bitstream. In multi-layer codecs transmitted using partitions, the video data may be associated with one or more layer sets. Each layer set may be associated with one or more output layer sets. Each output layer set may be further associated with one or more partitioning schemes. Conformance parameters are mapped to partitions of a partitioning scheme, based upon the output layer set that the partitioning scheme is associated with. This allows for a partition to be associated with different conformance parameters, depending upon the output layer set that is being used.