Abstract:
A scalable video bitstream may have an H.264 /AVC compatible base layer (BL) and a scalable enhancement layer (EL), where scalability refers to color bit depth. The H.264/AVC scalability extension SVC provides also other types of scalability, e.g. spatial scalability where the number of pixels in BL and EL are different. According to the invention, BL information is upsampled in two logical steps (TUp, BDUp), one being texture upsampling and the other being bit depth upsampling. Texture upsampling is a process that increases the number of pixels, and bit depth upsampling is a process that increases the number of values that each pixel can have, corresponding to the pixels color intensity. The upsampled BL data are used to predict the collocated EL. The BL information is upsampled at the encoder side and in the same manner at the decoder side, wherein the upsampling refers to spatial and bit depth characteristics.
Abstract:
This invention presents a scalable solution to encode the whole 12-bit raw video once to generate one bitstream that contains an H.264/AVC compatible base layer and a scalable enhancement layer. If an H.264/AVC decoder is available at the client end, only the base layer sub- bitstream is decoded and the decoded 8-bit video can be viewed on a conventional 8-bit display device; if the color bit depth scalable decoder is available at the client end, both the base layer and the enhancement layer sub-bitstreams will be decoded to obtain the 12-bit video and it can be viewed on a high quality display device that supports more than eight bit.
Abstract:
Enhanced dynamic range requires more than 8 bit representation for single color components of pixels. For this purpose, normal color resolution images and high color resolution images are available. Backward compatibility can be achieved by a layered approach using a color enhancement layer, and a conventional image as color base layer. Both have same spatial and temporal resolution. Encoding of the color enhancement layer uses prediction and residual. A methods for optimized color enhancement prediction is disclosed. Color bit depth prediction is done by constructing a polynomial that approximates for all pixels of one color component of a block the color enhancement layer from the color base layer. A predicted version of the high color resolution image and a residual are generated and updated by a residual. The coefficients are compressed and added as metadata to the data stream.
Abstract:
Conventional intra-prediction uses pixels from left and upper neighbour blocks to predict a macroblock (MB). Thus, the MBs must be sequentially processed, since reconstructed left and upper MBs must be available for prediction. In an improved method for encoding Intra predicted MBs, a MB is encoded in two steps: first, a first portion of the MB is encoded independently, without references outside the MB. Pixels of the first portion can be Intra predicted using DC mode. Then, the first portion is reconstructed. The remaining pixels of the MB, being a second portion, are intra predicted from the reconstructed pixels of the first portion and then reconstructed. The first portion comprises at least one column or one row of pixels of the MB. The encoding is applied to at least two Intra predicted MBs per slice, or per picture if no slices are used.
Abstract:
A method for inter-encoding a block of a colour image in H.264 high444 profile is proposed wherein the image comprises a first, a different second and a different third colour component. Said method comprises the steps of determining among two or more reference block candidates comprised in a different colour image that reference block candidate which has a corresponding first colour component matching said first colour component of said block at least as well as any of corresponding first colour components of the remaining reference block candidates, and encoding the second colour component of said block using a corresponding second colour component of the determined reference block. The reference block having a first colour component, which matches the corresponding first colour component of the block to-be-encoded, often is a good starting point for searching a reference block for a different second colour component of the block to-be-encoded.
Abstract:
There are provided methods and apparatus for inter-layer residue prediction for scalable video. An apparatus is described for an encoder (200) for encoding a block of a picture, or a decoder (300) for decoding a block of a picture, by applying inverse tone mapping to an inter-layer residue prediction process for the block, wherein the inverse tone mapping is performed in the pixel domain. Methods for encoding (440, 460) or decoding (540, 560) a block of a picture are also described; and performed by applying inverse tone mapping to an inter-layer residue prediction process for the block, wherein the inverse tone mapping is performed in the pixel domain.
Abstract:
The invention presents a scalable solution to encode the whole 12-bit raw video once to generate one bitstream that contains an H.264 /AVC compatible base layer and a scalable enhancement layer. If a color bit depth scalable decoder is available at the client end, both the base layer and the enhancement layer sub-bitstreams will be decoded to obtain the 12-bit video and it can be viewed on a high quality display that supports more than eight bit; otherwise only the base layer sub-bitstream is decoded using an H.264 /AVC decoder and the decoded 8-bit video can be viewed on a conventional 8-bit display. The enhancement layer contains a residual based on a prediction from the base layer, which is either based on bit-shift or based on an advanced bit depth prediction is utilized, wherein the advanced bit depth prediction method is a Smoothed Histogram method or a Localized Polynomial Approximation method.
Abstract:
A scalable video bitstream may have an H.264/AVC compatible base layer (BL) and a scalable enhancement layer (EL), where scalability refers to color bit depth. The H.264/AVC scalability extension SVC provides also other types of scalability, e.g. spatial scalability where the number of pixels in BL and EL are different. According to the invention, BL information is upsampled (TUp,BDUp) in two logical steps in adaptive order, one being texture upsampling and the other being bit depth upsampling. Texture upsampling is a process that increases the number of pixels, and bit depth upsampling is a process that increases the number of values that each pixel can have, corresponding to the pixels color intensity. The upsampled BL data are used to predict the collocated EL. A prediction order indication is transferred so that the decoder can upsample BL information in the same manner as the encoder, wherein the upsampling refers to spatial and bit depth characteristics.
Abstract:
A scalable video bitstream may have an H.264 /AVC compatible base layer (BL) and a scalable enhancement layer (EL), where scalability refers to color bit depth. The H.264/AVC scalability extension SVC provides also other types of scalability, e.g. spatial scalability where the number of pixels in BL and EL are different. According to the invention, BL information is upsampled in two logical steps (TUp, BDUp), one being texture upsampling and the other being bit depth upsampling. Texture upsampling is a process that increases the number of pixels, and bit depth upsampling is a process that increases the number of values that each pixel can have, corresponding to the pixels color intensity. The upsampled BL data are used to predict the collocated EL. The BL information is upsampled at the encoder side and in the same manner at the decoder side, wherein the upsampling refers to spatial and bit depth characteristics.
Abstract:
A highly accurate reproduction of visual intensity and contrast rather than the conventional 8-bit color depth is more and more used, motivating the development of an enhanced dynamic range called high bit-depth. A method for encoding a first, low bit-depth image (IM BL ) of M bit RGB pixels and a second, high bit-depth video image (IM EL ) of N bit RGB pixels, M inter ) based on color histograms (Hist BL , Hist EL ) of the first and the second video image, generating a second transfer function (LUT final ) based on the first transfer function (LUT inter ), applying (TR) the second transfer function (LUT final ) to the first video image, wherein a transformed first video image (IM TR, BL ) is generated, calculating and encoding the residual (IM res ) f encoding the first video image (IM BL ) and transmitting the encoded first video image (IM BL ), the second transfer function (LUT final ) and the encoded difference.
Abstract translation:越来越多地使用高度精确的视觉强度和对比度的再现,而不是常规的8位色彩深度,从而激发了称为高位深度的增强动态范围的发展。 一种用于编码M位RGB像素的第一,低位深度图像(IM SUB BL)和第二高位深度视频图像(IM EL EL)的方法, 具有相同空间分辨率的N位RGB像素M EL SUB>),基于第一传递函数(LUT SUB>)生成第二传递函数(LUT SUB>), 将第二传递函数(LUT SUB>)应用(TR)到第一视频图像,其中生成经变换的第一视频图像(IM SUB TR,BL SUB> ,计算和编码编码第一视频图像(IM BL BL)的残差(IM SUB),并且发送编码的第一视频图像(IM BL BL / / SUB >),第二传递函数(LUT SUB> final)和编码的差值。