Abstract:
Disclosed are techniques for creating, coding, decoding, and using, rotation information related to one or more coded pictures in non-normative parts of a coded video bitstream.
Abstract:
In general the present invention provides a video encoder, which is arranged for adaptive encoding mode selection. The video encoder is operable with a plurality of encoding modes for encoding a current macroblock of a video sequence. The video sequence is preferably intended for being transmitted by a communication network, e.g. any circuit-switched or packet-switched communication network. A distortion estimator is arranged for estimating expected distortion values due to potential erroneous transmission of the current macroblock in dependence of the encoding modes. A decision module is arranged for selecting a final encoding mode from the plurality of encoding modes on the basis of the distortion values and encoding parameters. Further, a table is provided, which is referenced by the spatial position of the macroblock and which is updated with an accumulated distortion value. The video encoder is arranged for applying the final encoding mode for encoding the current macroblock.
Abstract:
A method and system for compressed domain video mixing for spatially combining incoming video streams into an outgoing video stream. Using H.264 as an example, each incoming stream is divided into a plurality of slices, each having a plurality of header fields including a first_mb_in_slice header field. Based on the picture format in the outgoing stream, first_mb_in_slice for each incoming stream is modified such that the modified first_mb_in_slice header field is indicative of location in the spatial representation of the outgoing stream at which the slice of the incoming stream is placed. H.264's slice group mechanism is used to map the spatial positions of the second and following macroblocks of the slices to the appropriate locations. If the incoming streams are previously mixed by upstream mixers, a decomposer can be used to separate these mixed streams into component streams before combining them with other incoming streams.
Abstract:
A method of decoding a coded image or a coded picture in a coded video sequence, into a picture representing a projection on a planar surface of a non-planar surface, is provided, and the method is performed by a decoder employing intra prediction. The method includes obtaining an intra prediction mode indicative of an intra prediction direction, for a block of the picture, determining at least one non-straight curve representative of a straight line on the non-planar surface, based on the projection and the obtained intra prediction mode, and creating, for the block, prediction samples along the determined at least one non-straight curve, from at least one boundary sample along the determined at least one non-straight curve.
Abstract:
A method and apparatus for media decoding by a decoder include decoding a first indication indicative of a first conformance point of a coded video sequence. A second indication indicative of a second conformance point of the coded video sequence is decoded. It is determined whether the coded video sequence is decodable by the decoder based on at least one of the first indication and the second indication. The coded video sequence is selectively decoded based on determining whether the decoded video sequence is decodable by the decoder.
Abstract:
A method of and an apparatus for decoding a coded picture of a coded video sequence including a first segment and a second segment, are provided. The method includes determining a first decoding process for decoding the first segment, in which a first prediction is disallowed, based on at least a first syntax element of a high level syntax structure applicable to the first segment and the second segment, the first syntax element indicating that the first prediction is disallowed, and determining a second decoding process for decoding the second segment, in which a second prediction different than the first prediction is disallowed, based on at least a second syntax element of the high level syntax structure, the second syntax element indicating that the second prediction is disallowed.
Abstract:
Transmitting content and tailoring encoding according to application requirements. To change encoding, choosing an operation point for encoding content depending on the application receiving the content, sending data representative of the chosen operation point to a sender of the content, receiving data representative of an operation point from a receiver of the content, adapting encoding of the content depending on the received data representative of the operation point, encoding the content based on the operation point, sending the encoded content to the receiver, and receiving the encoded content wherein the encoding is based on the operation point is provided.
Abstract:
An approach for reconstructing a Network Abstraction Layer (NAL) unit for video decoding using at least one processor includes decoding a first syntax element included in a NAL unit header; determining, based on the first syntax element, a NAL unit class including a plurality of NAL unit types; decoding a second syntax element included in the NAL unit header; and based on the NAL unit class being a first NAL unit class, determining a NAL unit type from among the NAL unit types using a combination of the NAL unit class and the second syntax element, and reconstructing the NAL unit based on the determined NAL unit type; and based on the NAL unit class being a second NAL unit class, determining a temporal identifier (TID) based on the second syntax element, and reconstructing the NAL unit based on the determined TID.
Abstract:
An approach for reconstructing a current Network Abstraction Layer (NAL) unit for video decoding using at least one processor includes determining the current NAL unit to be an Intra Random Access Picture (IRAP) NAL unit; determining whether a previous NAL unit decoded immediately before the current NAL unit indicates an end of a coded video sequence (CVS); based on determining that the previous NAL unit indicates the end of the CVS, decoding the current NAL unit as one from among an Instantaneous Decoder Refresh (IDR) NAL unit or a Broken Link Access (BLA) NAL unit; and based on determining that the previous NAL unit does not indicate the end of the CVS, decoding the current NAL unit as a Clean Random Access (CRA) NAL unit, and reconstructing the decoded current NAL unit.
Abstract:
A method of and a device for decoding at least one video stream, each video stream of the at least one video stream associated with a respective Decoder Parameter Set. The method includes activating, by a decoder, the Decoder Parameter Set of a first video stream of the at least one video stream, processing, by the decoder, an End of Stream NAL unit that is made available to the decoder outside of the first video stream, and deactivating, by the decoder, the Decoder Parameter Set of the first video stream in response to the decoder processing the End of Stream NAL unit.