Abstract:
Techniques are described related to performing random access starting from a random access point picture that is not an instantaneous decoder refresh picture. Some techniques are also related to reducing the amount of information that is signaled for long-term reference pictures of a reference picture set. Additional techniques are also related to decoded picture buffer management, such as removing decoded pictures based on a temporal identification value.
Abstract:
Techniques are described related to output and removal of decoded pictures from a decoded picture buffer (DPB). The example techniques may remove a decoded picture from the DPB prior to coding a current picture. For instance, the example techniques may remove the decoded picture if that decoded picture is not identified in the reference picture set of the current picture.
Abstract:
A video encoder is configured to determine a picture size for one or more pictures included in a video sequence. The picture size associated with the video sequence may be a multiple of an aligned coding unit size for the video sequence. In one example, the aligned coding unit size for the video sequence may comprise a minimum coding unit size where the minimum coding unit size is selected from a plurality of smallest coding unit sizes corresponding to different pictures in the video sequence. A video decoder is configured to obtain syntax elements to determine the picture size and the aligned coding unit size for the video sequence. The video decoder decodes the pictures included in the video sequence with the picture size, and stores the decoded pictures in a decoded picture buffer.
Abstract:
A system and method of modifying error resiliency features by conveying temporal level 0 picture indices, such as tl0_pic_idx, within an SEI message instead of optionally including them in the NAL unit header is provided. In addition, a mechanism is provided for enabling repetition of any SEI messages in Real-Time Transport Protocol (RTP) packets. Enabling such repetition of any SEI messages facilitates detection of lost temporal level 0 pictures on the basis of any received packet.
Abstract:
Systems and methods are provided which allow receivers to recover the decoding order of network abstraction layer (NAL) units conveyed in different Real Time Protocol (RTP) sessions. An indication of decoding order for application data units (ADUs) in each packet is included in the packet structure of a PACSI NAL unit, when the PACSI NAL unit is a single-time aggregation packet type A (STAP-A) packet and the PACSI NAL unit is the first NAL unit in an aggregation packet (e.g., when a receiver is subscribed to different RTP session that convey NAL units). If the receiver is subscribed to only a base layer RTP session, the CL-DON indication can be ignored.
Abstract:
A device, an encoding method, and a decoding method enable a separate marking of base representations and enhanced representations of key access units to save memory or to allow a better scalable video coding optimization. The encoding method of a sequence of original pictures to a sequence of access units includes, after encoding of one of the access units, storing a first decoded picture of the first encoded picture and a second decoded picture of the second encoded picture for inter prediction for encoding of others of the access units; and identifying the first decoded picture and the second decoded picture to be no longer used for inter prediction. The decoding method includes decoding the first access unit, where a first decoded picture is decoded from the first picture and a second decoded picture is decoded from the second picture; marking the first and second decoded pictures as used for inter prediction; decoding the second access unit; and marking one of the first and second decoded pictures as no longer used for inter prediction.
Abstract:
The present invention provides a method and a system for verifying a match between states of a first video processor and a second video processor, wherein one of said first and second video processors is a video encoder utilizing predictive video encoding and the other one of said first and second video processors is a video decoder capable of reproducing a decoded bit stream from an encoded bit stream generated by said video encoder.
Abstract:
A method for indicating size, shape and location of a region within a digital picture the picture being divided into a set of blocks. A value for at least one size parameter, which is indicative of a number of the blocks within said region is defined, and a value for at least one shape evolution parameter, which is indicative of a selection order of the blocks in said region is selected. Then preferably the values for said at least one size parameter and said at least one shape evolution parameter are encoded into a bitstream of a video sequence in order to indicate size, shape and location of the region within the picture.
Abstract:
Systems and methods are provided for signaling the decoding order of ADUs to enable efficient recovery of the decoding order of ADUs when session multiplexing is in use. A decoding order recovery process in a receiver is improved when session multiplexing is in use. For example, various embodiments improve the decoding order recovery process of SVC when no CS-DONs are utilized. First information associated with a first media sample to identify a second media sample is signaled upon packetization to indicate/aid in recovering. Upon de-packetizing, a decoding order of the first media sample and the second media sample is determined based on the received signaling of the first information.
Abstract:
An improved system and method for enabling the insertion, overlay, removal or replacement of sequential or concurrent targeted program segments and/or visual icons in a video bitstream without modifying the fidelity of the underlying visual media. The present invention provides for a wide variety of supplemental enhancement information fields which permit the use of data updates that are synchronous with delivered video content. The present invention offers a generic approach to program insertion and iconic overlay that covers a wide range of use-cases and applications, without necessarily transmitting the visual content to be inserted as part of the underlying visual media stream.