摘要:
A system and method for identifying when an indicated or predetermined media transcoding process results in a media stream that is compliant with an indicated interoperability point. Various embodiments allow for the encoding, storage, transcoding, and transmission of a media stream, for which a transcoding process is identified and the resulting media stream of the transcoding process is associated with at least one property. The signaling of the property or properties, and an identification of the transcoding process in one embodiment, may be included in the media bitstream, in a file format container containing the media bitstream, or in a transmission or control protocol for transmission or control of the media bitstream.
摘要:
The use of a picture delimiter that is contained in a NAL unit type value that is reserved in the current AVC or SVC specification. The present invention provides the scalability information for the H.264/AVC base layer in such a manner that bitstreams remain decodable with H.264/AVC decoders. In addition, the picture delimiter of the present invention may contain many other syntax elements that can help in easier processing of bitstreams compared to the plain H.264/AVC bitstream syntax.
摘要:
A system and method for encoding a first picture sequence and a second picture sequence into coded pictures, with the first picture sequence and the second picture sequence being different, and with at least one coded picture of the second picture sequence being predicted from at least one picture in the first picture sequence. According to various embodiments of the present invention, signal element is encoded into a coded picture of the second picture sequence. The signal element indicates whether a picture in the first picture sequence is used for prediction of the coded picture of the second picture sequence.
摘要:
A system and method for improving both coding efficiency and decoding accuracy in scalable video coding involving discardable lower layer adaptations, where applications can omit transmission of unused data for inter-layer prediction for all layers below an operation point of receivers. Indicator flags are utilized to avoid transmitting syntax elements that must be set to fixed values. In addition, padding algorithms or additional encoding constraints are applied to the encoding process.
摘要:
The exemplary embodiments of this invention provide in one aspect thereof an ability to signal multiple decoding times for each sample in a file format level in order to allow, for example, different decoding times for each sample (or sample subset) between decoding an entire stream and decoding a subset of the stream. An alternate decoding time box is specified to allow for the signaling of multiple decoding times for each sample. Such a box can contain a compact version of a table that allows indexing from an alternate decoding time to a sample number, where an alternate decoding time is a decoding time to be used with a sample when only a subset of an elementary stream stored in a track is to be decoded. Furthermore, each entry in the table provides the number of consecutive samples with the same time delta, and the delta between those consecutive samples. By adding the deltas a complete time-to-sample map can be constructed.
摘要:
An improved system and method for implementing efficient decoding of scalable video bitstreams is provided. A virtual decoded picture buffer is provided for each lower layer of the scalable video bitstream. The virtual decoded picture buffer stores decoded lower layer pictures for reference. The decoded lower layer pictures used for reference are compiled to create a reference picture list for each layer. The reference picture list generated by the virtual decoded picture buffer is used during a direct prediction process instead of a target reference list to correctly decode a current macroblock.
摘要:
A system and method of modifying error resiliency features by conveying temporal level 0 picture indices, such as tl0_pic_idx, within an SEI message instead of optionally including them in the NAL unit header is provided. In addition, a mechanism is provided for enabling repetition of any SEI messages in Real-Time Transport Protocol (RTP) packets. Enabling such repetition of any SEI messages facilitates detection of lost temporal level 0 pictures on the basis of any received packet.
摘要:
A system and method for implementing low complexity multi-view video coding. According to various embodiments, single-loop decoding is applied to multi-view video coding. For N coded views, where only M of the N views are to be displayed, only those M views are required to be fully decoded and stored to a decoded picture buffer (DPB) when needed. Pictures of other views are only partially decoded or simply parsed and do not have to be stored into the DPB. Various embodiments also provide for an encoder that encodes multi-view video bitstreams in accordance with the single-loop decoding concept, as well as a decoder that utilizes single-loop decoding to decode and output on a subset of the encoded views from a multi-view bitstream.
摘要:
A video coding and decoding method, wherein a picture is first divided into sub-pictures corresponding to one or more subjectively important picture regions and to a background region sub-picture, which remains after the other sub-pictures are removed from the picture. The sub-pictures are formed to conform to predetermined allowable groups of video coding macroblocks (MBs). The allowable groups of MBs can be, for example, of rectangular shape. The picture is then divided into slices so that each sub-picture is encoded independent of other sub-pictures except for the background region sub-picture, which may be coded using another sub-pictures. The slices of the background sub-picture are formed in a scan-order with skipping over MBs that belong to another sub-picture. The background sub-picture is only decoded if all the positions and sizes of all other sub-pictures can be reconstructed on decoding the picture.
摘要:
A file format structure for use with scalable video coding systems and methods, where a rate share operation box indicates which adaptation operation points for coded media are available in a file. The rate share operation box can include a characterization of the adaptation result for each adaptation operation point. In addition, the rate share operation box can include an identifier that indicates which algorithm was used in generating each adaptation operation point.