摘要:
A system and method for improving both coding efficiency and decoding accuracy in scalable video coding involving discardable lower layer adaptations, where applications can omit transmission of unused data for inter-layer prediction for all layers below an operation point of receivers. Indicator flags are utilized to avoid transmitting syntax elements that must be set to fixed values. In addition, padding algorithms or additional encoding constraints are applied to the encoding process.
摘要:
A system and method for improving both coding efficiency and decoding accuracy in scalable video coding involving discardable lower layer adaptations, where applications can omit transmission of unused data for inter-layer prediction for all layers below an operation point of receivers. Indicator flags are utilized to avoid transmitting syntax elements that must be set to fixed values. In addition, padding algorithms or additional encoding constraints are applied to the encoding process.
摘要:
Template matching intra prediction based on a given priority is provided. Priority values of all, or a subset of the pixels on a border between a current block and a reconstructed area are calculated. A border pixel with the highest priority is used as the center of a template block. A search for the best matched candidate template is performed in the reconstructed area. Distortion metrics between known pixels in the to-match template and corresponding pixels in candidate templates are calculated and compared. The candidate template with the smallest distortion metric value is chosen as the best match. Corresponding pixels of the best-matched candidate template in the searching area are used as predictors of the unknown pixels in the template centered at the pixel with the highest priority, and the predicted pixels are marked as known. The process is repeated until all pixels in the current block are marked as known.
摘要:
An encoder for use in scalable video coding has a mechanism to perform macroblock mode selection for the enhancement layer pictures. The mechanism includes a distortion estimator for each macroblock that reacts to channel errors such as packet losses or errors in video segments affected by error propagation; a Lagrange multiple selector for selecting a weighting factor according to estimated or signaled channel error rate, and a mode decision module or algorithm to choose the optimal mode based on encoding parameters. The mode decision module is configured to select the coding mode based on a sum of the estimated coding distortion and the estimated coding rate multiplied by the weighting factor.
摘要:
A system and method for identifying when an indicated or predetermined media transcoding process results in a media stream that is compliant with an indicated interoperability point. Various embodiments allow for the encoding, storage, transcoding, and transmission of a media stream, for which a transcoding process is identified and the resulting media stream of the transcoding process is associated with at least one property. The signaling of the property or properties, and an identification of the transcoding process in one embodiment, may be included in the media bitstream, in a file format container containing the media bitstream, or in a transmission or control protocol for transmission or control of the media bitstream.
摘要:
The use of a picture delimiter that is contained in a NAL unit type value that is reserved in the current AVC or SVC specification. The present invention provides the scalability information for the H.264/AVC base layer in such a manner that bitstreams remain decodable with H.264/AVC decoders. In addition, the picture delimiter of the present invention may contain many other syntax elements that can help in easier processing of bitstreams compared to the plain H.264/AVC bitstream syntax.
摘要:
A system and method for encoding a first picture sequence and a second picture sequence into coded pictures, with the first picture sequence and the second picture sequence being different, and with at least one coded picture of the second picture sequence being predicted from at least one picture in the first picture sequence. According to various embodiments of the present invention, signal element is encoded into a coded picture of the second picture sequence. The signal element indicates whether a picture in the first picture sequence is used for prediction of the coded picture of the second picture sequence.
摘要:
The exemplary embodiments of this invention provide in one aspect thereof an ability to signal multiple decoding times for each sample in a file format level in order to allow, for example, different decoding times for each sample (or sample subset) between decoding an entire stream and decoding a subset of the stream. An alternate decoding time box is specified to allow for the signaling of multiple decoding times for each sample. Such a box can contain a compact version of a table that allows indexing from an alternate decoding time to a sample number, where an alternate decoding time is a decoding time to be used with a sample when only a subset of an elementary stream stored in a track is to be decoded. Furthermore, each entry in the table provides the number of consecutive samples with the same time delta, and the delta between those consecutive samples. By adding the deltas a complete time-to-sample map can be constructed.
摘要:
An improved system and method for implementing efficient decoding of scalable video bitstreams is provided. A virtual decoded picture buffer is provided for each lower layer of the scalable video bitstream. The virtual decoded picture buffer stores decoded lower layer pictures for reference. The decoded lower layer pictures used for reference are compiled to create a reference picture list for each layer. The reference picture list generated by the virtual decoded picture buffer is used during a direct prediction process instead of a target reference list to correctly decode a current macroblock.
摘要:
A system and method of modifying error resiliency features by conveying temporal level 0 picture indices, such as tl0_pic_idx, within an SEI message instead of optionally including them in the NAL unit header is provided. In addition, a mechanism is provided for enabling repetition of any SEI messages in Real-Time Transport Protocol (RTP) packets. Enabling such repetition of any SEI messages facilitates detection of lost temporal level 0 pictures on the basis of any received packet.