Abstract:
Methods for the chroma reshaping of high-dynamic range (HDR) signals are presented. For each input pixel, a first scaler is generated based on the pixel luminance and a luma-based chroma reshaping (LCR) function. A second scaler is generated based on a saturation-based chroma reshaping (SCR) function and the saturation value of the pixel. A cascaded scaler is generated by multiplying the first scaler with the second scaler. Reshaped chroma values are generated by multiplying the input chroma values with the cascaded scaler. The characteristics of preferred LCR and SCR functions are discussed. Inverse decoding methods based on chroma-reshaping metadata that define the LCR and SCR functions are also described.
Abstract:
Methods to improve the quality of coding high-dynamic range (HDR) signals are presented. Instead of using a single chroma quantization table for all color formats, a video encoder may adaptively use separate tables for each one, and transmit the table's ID to a decoder. Examples for chroma quantization tables for video content encoded in the YCbCr (PQ) and ICtCp (PQ) color formats under a variety of color gamut containers are provided.
Abstract:
In a method to reconstruct a high dynamic range video signal, a decoder receives a base layer standard dynamic range video signal, an enhancement layer video signal, a metadata bitstream for a reference processing unit and a CRC code related to the metadata. A decoder reconstructs a high-dynamic range video output signal based on the base layer video signal, the enhancement layer video signal, and the data syntax and metadata specified by the metadata bitstream.
Abstract:
A processor for video coding receives a full-frame rate (FFR) HDR video signal and a corresponding FFR SDR video signal. An encoder generates a scalable bitstream that allows decoders to generate half-frame-rate (HFR) SDR, FFR SDR, HFR HDR, or FFR HDR signals. Given odd and even frames of the input FFR SDR signal, the scalable bitstream combines a base layer of coded even SDR frames with an enhancement layer of coded packed frames, where each packed frame includes a downscaled odd SDR frame, a downscaled even HDR residual frame, and a downscaled odd HDR residual frame. In an alternative implementation, the scalable bitstream combines four signals layers: a base layer of even SDR frames, an enhancement layer of odd SDR frames, a base layer of even HDR residual frames and an enhancement layer of odd HDR residual frames. Corresponding decoder architectures are also presented.
Abstract:
A processor for forward reshaping to a target bit depth receives an input image with an input bit depth. The input luminance range is divided into luminance bands and each band is assigned a visual importance index. The output codewords at the target bit depth are divided into codeword bins and each bin is assigned to one of the luminance bands. Each codeword bin is assigned the visual importance index of the luminance band it belongs. Each bin is allocated a number of reshaped codewords according to its visual importance index and a codeword mapping function, where bins with the highest importance index are allocated more codewords than the other bins. A signal reshaping function is generated based on the allocation of codewords, and the reshaping function is applied to the input image to generate an output image in the target bit depth.
Abstract:
Techniques use multiple lower bit depth (e.g., 8 bits) codecs to provide higher bit depth (e.g., 12+ bits) high dynamic range images from an upstream device to a downstream device. Multiple layers comprising a base layer and one or more enhancement layers may be used to carry video signals comprising image data compressed by lower bit depth encoders to a downstream device, wherein the base layer cannot be decoded and viewed on its own. Lower bit depth input image data to base layer processing may be generated from higher bit depth high dynamic range input image data via advanced quantization to minimize the volume of image data to be carried by enhancement layer video signals. The image data in the enhancement layer video signals may comprise residual values, quantization parameters, and mapping parameters based in part on a prediction method corresponding to a specific method used in the advanced quantization. Adaptive dynamic range adaptation techniques take into consideration special transition effects, such as fade-in and fade-outs, for improved coding performance.
Abstract:
Techniques use multiple lower bit depth codecs to provide higher bit depth, high dynamic range, images from an upstream device to a downstream device. A base layer and one or more enhancement layers may be used to carry video signals, wherein the base layer cannot be decoded and viewed on its own. Lower bit depth input image data to base layer processing may be generated from higher bit depth high dynamic range input image data via advanced quantization to minimize the volume of image data to be carried by enhancement layer video signals. The image data in the enhancement layer video signals may comprise residual values, quantization parameters, and mapping parameters based in part on a prediction method corresponding to a specific method used in the advanced quantization. Adaptive dynamic range adaptation techniques take into consideration special transition effects, such as fade-in and fade-outs, for improved coding performance.
Abstract:
A sequence of enhanced dynamic range (EDR) images and a sequence of standard dynamic range images are encoded using a backwards-compatible SDR high-definition (HD) base layer and one or more enhancement layers. The EDR and SDR video signals may be of the same resolution (e.g., HD) or at different resolutions (e.g., 4K and HD) and are encoded using a dual-view-dual-layer (DVDL) encoder to generate a coded base layer (BL) and a coded enhancement layer (EL). The DVDL encoder includes a reference processing unit (RPU) which is adapted to compute a reference stream based on the coded BL stream. The RPU operations include post-processing, normalization, inverse normalization, and image registration. Decoders for decoding the coded BL and EL streams to generate a backwards compatible 2D SDR stream and additional 2D or 3D SDR or EDR streams, are also described.
Abstract:
Given an input progressive sequence, a video encoder creates a dual-layer stream that combines a backwards-compatible interlaced video stream layer with an enhancement layer to reconstruct full-resolution progressive video. Given two consecutive frames in the input progressive sequence, vertical processing generates a top field-bottom field (TFBF) frame in a base layer (BL) TFBF sequence, and horizontal processing generates a side-by-side (SBS) frame in an enhancement layer (EL) SBS video sequence. The BL TFBF and the EL SBS sequences are compressed together to create a coded, backwards compatible output stream.
Abstract:
Given an input progressive sequence, a video encoder creates a dual-layer stream that combines a backwards-compatible interlaced video stream layer with an enhancement layer to reconstruct full-resolution progressive video. Given two consecutive frames in the input progressive sequence, vertical processing generates a top field-bottom field (TFBF) frame in a base layer (BL) TFBF sequence, and horizontal processing generates a side-by-side (SBS) frame in an enhancement layer (EL) SBS video sequence. The BL TFBF and the EL SBS sequences are compressed together to create a coded, backwards compatible output stream.