Abstract:
In various implementations, techniques are described herein to resolve issues related to the location of an EOB NAL unit in an ISO base media file. In various implementations, these techniques include relaxing the requirement that a file include only one EOB NAL unit. These techniques further include providing the EOB NAL unit from the highest layer when reconstructing the bitstream. This EOB NAL unit can be added to a last access unit in the reconstructed bitstream. When the EOB NAL unit from the highest layer is not present, an EOB NAL unit from a lower layer can be provided instead.
Abstract:
Methods for defining decoder capability for decoding multi-layer bitstreams containing video information, in which the decoder is implemented based on multiple single-layer decoder cores, are disclosed. In one aspect, the method may include identifying at least one allocation of layers of the bitstream into at least one set of layers. The method may further include detecting whether each set of layers is capable of being exclusively assigned to one of the decoder cores for the decoding of the bitstream. The method may also include determining whether the decoder is capable of decoding the bitstream based at least in part on detecting whether each set of layers is capable of being exclusively assigned to one of the decoder cores.
Abstract:
A computing device obtains a Network Abstraction Layer (NAL) unit header of a NAL unit of the multi-layer video data. The NAL unit header comprises a layer identifier syntax element having a value that specifies an identifier of a layer of the NAL unit. The layer identifier syntax element comprises a plurality of bits that represent the value within a defined range of values. A requirement of the bitstream conforming to a video coding standard is that the value of the layer identifier syntax element is less than the maximum value of the range of values.
Abstract:
In an example, a method of decoding video data includes decoding, by a video decoder, a first picture of multi-layer video data, where the first picture has a first picture type. The method also includes, based on the first picture having the first picture type, decoding, by the video decoder, data representative of picture order count (POC) most significant bits (MSB) value for the first picture.
Abstract:
A computing device generates a file that comprises a track box that contains metadata for a track in the file. Media data for the track comprises a sequence of samples. Each of the samples is a video access unit of multi-layer video data. As part of generating the file, the computing device generates, in the file, an additional box that documents all of the samples containing at least one Intra Random Access Point (IRAP) picture.
Abstract:
A device generates, in a file storing a multi-layer bitstream, a track box that contains metadata for a track. The device generates, in the track box, a sample description box containing a sample group description entry. Additionally, the device generates, in the track box, a sample-to-group box for the track. The sample-to-group box mapping samples of the track into a sample group. The sample-to-group box specifies target layers among layers present in the track. Each of the target layers contains at least one picture belonging to a particular picture type. The sample group is one of a temporal sub-layer access sample group and the particular picture type is a temporal sub-layer access picture type, or a stepwise temporal sub-layer access sample group and the particular picture type is a step-wise temporal sub-layer access picture type.
Abstract:
A video processing device includes a memory storing video data and one or more processors configured to: receive a first network abstraction layer (NAL) unit comprising a first picture of an access unit; in response to determining the first NAL unit comprises an intra random access point (IRAP) picture and in response to a NAL unit type for the first NAL unit indicating the presence of an instantaneous decoding refresh (IDR) picture without any associated leading pictures for a second NAL unit of the access unit comprising another IRAP picture, determine a NAL unit type for the second NAL unit to be a NAL unit type indicating the presence of an IDR picture without any associated leading pictures; and, process the first NAL unit and the second NAL unit based on the NAL unit type for the second NAL unit.
Abstract:
A device for processing a bitstream including video data, such as a demultiplexer, extracts a descriptor from the bitstream, wherein the bitstream includes layers of video data for operation points, separate from the descriptor, such that each operation point includes one or more of the layers of video data, and wherein the descriptor includes a set of profile, tier, and level (PTL) structures and data that associates each of the layers of each of the operation points with a corresponding one of the PTL structures, extracts video data for one of the operation points from the bitstream based at least in part on the PTL structures to which the layers of the one of the operation points correspond, and provides the extracted video data to a video decoder.
Abstract:
A device for encoding video data includes a memory configured to store at least a portion of a multi-layer bitstream of video data and one or more processors configured to determine that a supplemental enhancement information (SEI) message applies to more than one layer of the multi-layer video data; determine that the SEI message references a syntax element that has multiple instances associated with the more than one layer of the multi-layer video data; and set all of the multiple instances to a same value.
Abstract:
Techniques and systems are provided for decoding and processing video data. For example, a method of decoding video data includes accessing an encoded video bitstream comprising multiple layers. The encoded video bitstream includes multiple access units, with a first access unit including an end of sequence network abstraction layer unit. A second access unit includes an intra random access point picture and is a next subsequent access unit in decoding order following the first access unit. The method further includes reinitializing decoding at the second access unit based on the second access unit being the next subsequent access unit in decoding order following the end of sequence network abstraction layer unit of the first access unit.