摘要:
A system and method for signaling and indicating track relationship information in media files provides a mechanism for signaling the information of layered track groups and the information of multiple description coding (MDC) track groups, as well as a mechanism for signaling track relationship information in an efficient way. In local playback or unicast streaming applications, to select an independently decodable track for a certain media type, the information of alternate track groups is first found via the track relation box, and one track is selected from an alternate track group for the media type. If stream switching is desired, the switching track group information is found via the track relation box. In multicast applications with scalable or MDC streams, the tracks in a layered or MDC group are found via the track relation box and selected among all the layered or MDC groups.
摘要:
A system and method for signaling and indicating track relationship information in media files. The present invention provides a mechanism for signaling the information of layered track groups and the information of MDC track groups, as well as a mechanism for signaling track relationship information in an efficient way. In local playback or unicast streaming applications, to select an independently decodable track for a certain media type, the information of alternate track groups is first found via the track relation box, and one track is selected from an alternate track group for the media type. If stream switching is desired, the switching track group information is found via the track relation box. In multicast applications with scalable or MDC streams, the tracks in a layered or MDC group are found via the track relation box and selected among all the layered or MDC groups.
摘要:
A system and method for identifying when an indicated or predetermined media transcoding process results in a media stream that is compliant with an indicated interoperability point. Various embodiments allow for the encoding, storage, transcoding, and transmission of a media stream, for which a transcoding process is identified and the resulting media stream of the transcoding process is associated with at least one property. The signaling of the property or properties, and an identification of the transcoding process in one embodiment, may be included in the media bitstream, in a file format container containing the media bitstream, or in a transmission or control protocol for transmission or control of the media bitstream.
摘要:
The use of a picture delimiter that is contained in a NAL unit type value that is reserved in the current AVC or SVC specification. The present invention provides the scalability information for the H.264/AVC base layer in such a manner that bitstreams remain decodable with H.264/AVC decoders. In addition, the picture delimiter of the present invention may contain many other syntax elements that can help in easier processing of bitstreams compared to the plain H.264/AVC bitstream syntax.
摘要:
A system and method for encoding a first picture sequence and a second picture sequence into coded pictures, with the first picture sequence and the second picture sequence being different, and with at least one coded picture of the second picture sequence being predicted from at least one picture in the first picture sequence. According to various embodiments of the present invention, signal element is encoded into a coded picture of the second picture sequence. The signal element indicates whether a picture in the first picture sequence is used for prediction of the coded picture of the second picture sequence.
摘要:
A system and method for improving both coding efficiency and decoding accuracy in scalable video coding involving discardable lower layer adaptations, where applications can omit transmission of unused data for inter-layer prediction for all layers below an operation point of receivers. Indicator flags are utilized to avoid transmitting syntax elements that must be set to fixed values. In addition, padding algorithms or additional encoding constraints are applied to the encoding process.
摘要:
The exemplary embodiments of this invention provide in one aspect thereof an ability to signal multiple decoding times for each sample in a file format level in order to allow, for example, different decoding times for each sample (or sample subset) between decoding an entire stream and decoding a subset of the stream. An alternate decoding time box is specified to allow for the signaling of multiple decoding times for each sample. Such a box can contain a compact version of a table that allows indexing from an alternate decoding time to a sample number, where an alternate decoding time is a decoding time to be used with a sample when only a subset of an elementary stream stored in a track is to be decoded. Furthermore, each entry in the table provides the number of consecutive samples with the same time delta, and the delta between those consecutive samples. By adding the deltas a complete time-to-sample map can be constructed.
摘要:
An improved system and method for implementing efficient decoding of scalable video bitstreams is provided. A virtual decoded picture buffer is provided for each lower layer of the scalable video bitstream. The virtual decoded picture buffer stores decoded lower layer pictures for reference. The decoded lower layer pictures used for reference are compiled to create a reference picture list for each layer. The reference picture list generated by the virtual decoded picture buffer is used during a direct prediction process instead of a target reference list to correctly decode a current macroblock.
摘要:
A system and method of modifying error resiliency features by conveying temporal level 0 picture indices, such as tl0_pic_idx, within an SEI message instead of optionally including them in the NAL unit header is provided. In addition, a mechanism is provided for enabling repetition of any SEI messages in Real-Time Transport Protocol (RTP) packets. Enabling such repetition of any SEI messages facilitates detection of lost temporal level 0 pictures on the basis of any received packet.
摘要:
A system and method for implementing low complexity multi-view video coding. According to various embodiments, single-loop decoding is applied to multi-view video coding. For N coded views, where only M of the N views are to be displayed, only those M views are required to be fully decoded and stored to a decoded picture buffer (DPB) when needed. Pictures of other views are only partially decoded or simply parsed and do not have to be stored into the DPB. Various embodiments also provide for an encoder that encodes multi-view video bitstreams in accordance with the single-loop decoding concept, as well as a decoder that utilizes single-loop decoding to decode and output on a subset of the encoded views from a multi-view bitstream.