Abstract:
The exemplary embodiments of this invention provide in one aspect thereof an ability to signal multiple decoding times for each sample in a file format level in order to allow, for example, different decoding times for each sample (or sample subset) between decoding an entire stream and decoding a subset of the stream. An alternate decoding time box is specified to allow for the signaling of multiple decoding times for each sample. Such a box can contain a compact version of a table that allows indexing from an alternate decoding time to a sample number, where an alternate decoding time is a decoding time to be used with a sample when only a subset of an elementary stream stored in a track is to be decoded. Furthermore, each entry in the table provides the number of consecutive samples with the same time delta, and the delta between those consecutive samples. By adding the deltas a complete time-to-sample map can be constructed.
Abstract:
A system and method provides a first integrity check code that can be calculated at an encoder and then sent to a decoder as a supplemental enhancement information message. The decoder can then calculate a second integrity check code over the actual received network abstraction layer units. This second integrity check code can be compared with the encoder-generated first integrity check code sent via the supplemental enhancement information message to indicate if in fact all of the transmitted NAL units from which the integrity check code was generated have been received without changes in their content. In addition, an error tracking algorithm is provided that can be run at either the encoder or the decoder in order to determine if the network abstraction layer units are correct in content at the decoder level. Therefore, pictures that are sent as just intra coded frames and pictures that are sent as just inter coded frames can both be checked for errors. Hence, error checking can be provided in those situations where the decoder fails to provide adequate information to the encoder as to what has actually been decoded.
Abstract:
An improved system and method for implementing efficient decoding of scalable video bitstreams is provided. A virtual decoded picture buffer is provided for each lower layer of the scalable video bitstream. The virtual decoded picture buffer stores decoded lower layer pictures for reference. The decoded lower layer pictures used for reference are compiled to create a reference picture list for each layer. The reference picture list generated by the virtual decoded picture buffer is used during a direct prediction process instead of a target reference list to correctly decode a current macroblock.
Abstract:
A system and method of modifying error resiliency features by conveying temporal level 0 picture indices, such as tl0_pic_idx, within an SEI message instead of optionally including them in the NAL unit header is provided. In addition, a mechanism is provided for enabling repetition of any SEI messages in Real-Time Transport Protocol (RTP) packets. Enabling such repetition of any SEI messages facilitates detection of lost temporal level 0 pictures on the basis of any received packet.
Abstract:
A system and method for implementing low complexity multi-view video coding. According to various embodiments, single-loop decoding is applied to multi-view video coding. For N coded views, where only M of the N views are to be displayed, only those M views are required to be fully decoded and stored to a decoded picture buffer (DPB) when needed. Pictures of other views are only partially decoded or simply parsed and do not have to be stored into the DPB. Various embodiments also provide for an encoder that encodes multi-view video bitstreams in accordance with the single-loop decoding concept, as well as a decoder that utilizes single-loop decoding to decode and output on a subset of the encoded views from a multi-view bitstream.
Abstract:
A video coding and decoding method, wherein a picture is first divided into sub-pictures corresponding to one or more subjectively important picture regions and to a background region sub-picture, which remains after the other sub-pictures are removed from the picture. The sub-pictures are formed to conform to predetermined allowable groups of video coding macroblocks (MBs). The allowable groups of MBs can be, for example, of rectangular shape. The picture is then divided into slices so that each sub-picture is encoded independent of other sub-pictures except for the background region sub-picture, which may be coded using another sub-pictures. The slices of the background sub-picture are formed in a scan-order with skipping over MBs that belong to another sub-picture. The background sub-picture is only decoded if all the positions and sizes of all other sub-pictures can be reconstructed on decoding the picture.
Abstract:
A system and method for providing improved FGS identification in scalable video coding. According to the present invention, each FGS enhancement layer is assigned a unique dependency identifier and contains only FGS enhancement information. For subsequent enhancement layers, the base dependency identifier will point to either a base-quality layer or an FGS enhancement layer. Alternatively, two base dependency identifiers can be used. One identifier is used to identify the base quality layer, which does not contain FGS information, that is used for the prediction of coding mode and motion information for a subsequent enhancement layer. The other identifier is used to identify the FGS enhancement layer, which contains only FGS information, that is used for the prediction of sample and/or residual data for a subsequent enhancement layer.
Abstract:
A file format structure for use with scalable video coding systems and methods, where a rate share operation box indicates which adaptation operation points for coded media are available in a file. The rate share operation box can include a characterization of the adaptation result for each adaptation operation point. In addition, the rate share operation box can include an identifier that indicates which algorithm was used in generating each adaptation operation point.
Abstract:
A system and method for providing media streaming are provided. A media description file having a range string indicator is used to specify location offsets of a plurality of media fragments within a media file. The range string may be stored directly in the media description file or the media description file may contain an indicator specifying a file containing the range string. The media description file may also include a range unit specifying the size of the offsets in the range string. In an embodiment, offsets within the individual media fragments may also be provided, either in the media description file or in a different file.
Abstract:
A method of encoding scalable video data having multiple layers where each layer in the multiple layers is associated with at least one other layer includes identifying one or more layers using a first identifier where the first identifier indicates decoding dependency, and identifying reference pictures within the identified one or more layers using a second identifier. The coding of the second identifier for pictures in a first layer is independent of pictures in a second enhancement layer. As such, for all pictures with a certain value of DependencyID, the syntax element frame_num is coded independently of other pictures with different values of DependencyID. Within all pictures with a pre-determined value of DependencyID, a default frame_num coding method is used.