Abstract:
The precision of up-sampling operations in a layered coding system is preserved when operating on video data with high bit-depth. In response to bit-depth requirements of the video coding or decoding system, scaling and rounding parameters are determined for a separable up-scaling filter. Input data are first filtered across a first spatial direction using a first rounding parameter to generate first up-sampled data. First intermediate data are generated by scaling the first up-sampled data using a first shift parameter. The intermediate data are then filtered across a second spatial direction using a second rounding parameter to generate second up-sampled data. Second intermediate data are generated by scaling the second up-sampled data using a second shift parameter. Final up-sampled data may be generated by clipping the second intermediate data.
Abstract:
Stereoscopic video data and corresponding depth map data for stereoscopic and auto-stereoscopic displays are coded using a coded base layer and one or more coded enhancement layers. Given a 3D input picture and corresponding input depth map data, a side-by-side and a top-and-bottom picture are generated based on the input picture. Using an encoder, the side-by-side picture is coded to generate a coded base layer Using the encoder and a texture reference processing unit (RPU), the top-and-bottom picture is encoded to generate a first enhancement layer, wherein the first enhancement layer is coded based on the base layer stream, and using the encoder and a depth-map RPU, depth data for the side-by-side picture are encoded to generate a second enhancement layer, wherein the second enhancement layer is coded based on to the base layer. Alternative single, dual, and multi-layer depth map delivery systems are also presented.
Abstract:
A 3D display is characterized by a quality of viewing experience (QVE) mapping which represents a display-specific input-output relationship between input depth values and output QVE values. Examples of QVE mappings based on a metric of “viewing blur” are presented. Given reference depth data generated for a reference display and a representation of an artist's mapping function, which represents an input-output relationship between original input depth data and QVE data generated using a QVE mapping for a reference display, a decoder may reconstruct the reference depth data and apply an inverse QVE mapping for a target display to generate output depth data optimized for the target display.
Abstract:
Video data are coded in a coding-standard layered bit stream. Given a base layer (BL) and one or more enhancement layer (EL) signals, the BL signal is coded into a coded BL stream using a BL encoder which is compliant to a first coding standard. In response to the BL signal and the EL signal, a reference processing unit (RPU) determines RPU processing parameters. In response to the RPU processing parameters and the BL signal, the RPU generates an inter-layer reference signal. Using an EL encoder which is compliant to a second coding standard, the EL signal is coded into a coded EL stream, where the encoding of the EL signal is based at least in part on the inter-layer reference signal. Receivers with an RPU and video decoders compliant to both the first and the second coding standards may decode both the BL and the EL coded streams.
Abstract:
Novel methods and systems for color grading are disclosed. The color grading process for a visual dynamic range image can be guided by information relating to the color grading of other images such as the standard dynamic range image.
Abstract:
Novel methods and systems for inverse tone mapping are disclosed. A scene can be analyzed to obtain highlight detection from bright light sources and specular reflections. An inverse tone mapping curve can be calculated based on the lower dynamic range and higher dynamic range displays. Multi-scale filtering can be applied to reduce noise or artifacts.
Abstract:
Video data are coded in a coding-standard layered bit stream. Given a base layer (BL) and one or more enhancement layer (EL) signals, the BL signal is coded into a coded BL stream using a BL encoder which is compliant to a first coding standard. In response to the BL signal and the EL signal, a reference processing unit (RPU) determines RPU processing parameters. In response to the RPU processing parameters and the BL signal, the RPU generates an inter-layer reference signal. Using an EL encoder which is compliant to a second coding standard, the EL signal is coded into a coded EL stream, where the encoding of the EL signal is based at least in part on the inter-layer reference signal. Receivers with an RPU and video decoders compliant to both the first and the second coding standards may decode both the BL and the EL coded streams.
Abstract:
A method for merging graphics and high dynamic range video data is disclosed. In a video receiver, a display management process uses metadata to map input video data from a first dynamic range into the dynamic range of available graphics data. The remapped video signal is blended with the graphics data to generate a video composite signal. An inverse display management process uses the metadata to map the video composite signal to an output video signal with the first dynamic range. To alleviate perceptual tone-mapping jumps during video scene changes, a metadata transformer transforms the metadata to transformed so that on a television (TV) receiver metadata values transition smoothly between consecutive scenes. The TV receiver receives the output video signal and the transformed metadata to generate video data mapped to the dynamic range of the TV's display.
Abstract:
A method for merging graphics and high dynamic range video data is disclosed. In a video receiver, a display management process uses metadata to map input video data from a first dynamic range into the dynamic range of available graphics data. The remapped video signal is blended with the graphics data to generate a video composite signal. An inverse display management process uses the metadata to map the video composite signal to an output video signal with the first dynamic range. To alleviate perceptual tone-mapping jumps during video scene changes, a metadata transformer transforms the metadata to transformed so that on a television (TV) receiver metadata values transition smoothly between consecutive scenes. The TV receiver receives the output video signal and the transformed metadata to generate video data mapped to the dynamic range of the TV's display.
Abstract:
Stereoscopic video data and corresponding depth map data for stereoscopic and auto-stereoscopic displays are coded using a coded base layer and one or more coded enhancement layers. Given a 3D input picture and corresponding input depth map data, a side-by-side and a top-and-bottom picture are generated based on the input picture. Using an encoder, the side-by-side picture is coded to generate a coded base layer Using the encoder and a texture reference processing unit (RPU), the top-and-bottom picture is encoded to generate a first enhancement layer, wherein the first enhancement layer is coded based on the base layer stream, and using the encoder and a depth-map RPU, depth data for the side-by-side picture are encoded to generate a second enhancement layer, wherein the second enhancement layer is coded based on to the base layer. Alternative single, dual, and multi-layer depth map delivery systems are also presented.