Abstract:
In a decoder, a processor extracts a control map of false contour filtering from a part of a multi-layer video signal that includes a low dynamic range image mapped from an original high-dynamic range (HDR) image. It determines one or more filter parameters for a sparse finite-impulse-response (FIR) filter, where the one or more filter parameters relate to at least in part on the control map of false contour filtering and a predicted image predicted from the low dynamic range image. It applies the sparse FIR filter to filter pixel values in a portion of the predicted image based at least in part on the control map of false contour filtering, and it reconstructs a version of the original HDR image based at least in part on the portion of the predicted image as filtered by the FIR filter.
Abstract:
The precision of up-sampling operations in a layered coding system is preserved when operating on video data with high bit-depth. In response to bit-depth requirements of the video coding or decoding system, scaling and rounding parameters are determined for a separable up-scaling filter. Input data are first filtered across a first spatial direction using a first rounding parameter to generate first up-sampled data. First intermediate data are generated by scaling the first up-sampled data using a first shift parameter. The intermediate data are then filtered across a second spatial direction using a second rounding parameter to generate second up-sampled data. Second intermediate data are generated by scaling the second up-sampled data using a second shift parameter. Final up-sampled data may be generated by clipping the second intermediate data.
Abstract:
In a decoder, a processor extracts a control map of false contour filtering from a part of a multi-layer video signal that includes a low dynamic range image mapped from an original high-dynamic range (HDR) image. It determines one or more filter parameters for a sparse finite-impulse-response (FIR) filter, where the one or more filter parameters relate to at least in part on the control map of false contour filtering and a predicted image predicted from the low dynamic range image. It applies the sparse FIR filter to filter pixel values in a portion of the predicted image based at least in part on the control map of false contour filtering, and it reconstructs a version of the original HDR image based at least in part on the portion of the predicted image as filtered by the FIR filter.
Abstract:
Implementations are provided that relate, for example, to view tiling in video encoding and decoding. A particular method includes accessing a video picture that includes multiple pictures combined into a single picture (826), accessing information indicating how the multiple pictures in the accessed video picture are combined (806, 808, 822), decoding the video picture to provide a decoded representation of at least one of the multiple pictures (824, 826), and providing the accessed information and the decoded video picture as output (824, 826). Some other implementations format or process the information that indicates how multiple pictures included in a single video picture are combined into the single video picture, and format or process an encoded representation of the combined multiple pictures.
Abstract:
Implementations are provided that relate, for example, to view tiling in video encoding and decoding. A particular method includes accessing a video picture that includes multiple pictures combined into a single picture (826), accessing information indicating how the multiple pictures in the accessed video picture are combined (806, 808, 822), decoding the video picture to provide a decoded representation of at least one of the multiple pictures (824, 826), and providing the accessed information and the decoded video picture as output (824, 826). Some other implementations format or process the information that indicates how multiple pictures included in a single video picture are combined into the single video picture, and format or process an encoded representation of the combined multiple pictures.
Abstract:
Video data are coded in a coding-standard layered bit stream. Given a base layer (BL) and one or more enhancement layer (EL) signals, the BL signal is coded into a coded BL stream using a BL encoder which is compliant to a first coding standard. In response to the BL signal and the EL signal, a reference processing unit (RPU) determines RPU processing parameters. In response to the RPU processing parameters and the BL signal, the RPU generates an inter-layer reference signal. Using an EL encoder which is compliant to a second coding standard, the EL signal is coded into a coded EL stream, where the encoding of the EL signal is based at least in part on the inter-layer reference signal. Receivers with an RPU and video decoders compliant to both the first and the second coding standards may decode both the BL and the EL coded streams.
Abstract:
Video data are coded in a coding-standard layered bit stream. Given a base layer (BL) and one or more enhancement layer (EL) signals, the BL signal is coded into a coded BL stream using a BL encoder which is compliant to a first coding standard. In response to the BL signal and the EL signal, a reference processing unit (RPU) determines RPU processing parameters. In response to the RPU processing parameters and the BL signal, the RPU generates an inter-layer reference signal. Using an EL encoder which is compliant to a second coding standard, the EL signal is coded into a coded EL stream, where the encoding of the EL signal is based at least in part on the inter-layer reference signal. Receivers with an RPU and video decoders compliant to both the first and the second coding standards may decode both the BL and the EL coded streams.
Abstract:
A multi-layer video system has a first layer encoder that encodes a first layer of video information, at least one second layer encoder that encodes at least one second layer of video information, and an encoder side reference processing unit (RPU) that estimates one or more of an optimal filter or an optimal process that applies on a reference picture that is reconstructed from the first video information layer, and processes a current picture of the second video information layer, based on a correlation between the first layer reconstructed reference picture. The correlation relates to a complexity characteristic that scaleably corresponds to the first video information layer reconstructed reference picture and the second video information layer current picture. A scalable video bitstream is outputted, which may be decoded by a compatible decoder. A decoder side RPU and the encoder side RPU function as an RPU pair.
Abstract:
A visual dynamic range (VDR) coding system creates a sequence of VDR prediction images using corresponding standard dynamic range (SDR) images and a prediction function. For each prediction image, an encoder identifies one or more areas within the prediction image suitable for post-prediction filtering. For each identified post-prediction area, a post-prediction filtering mode is selected among one or more post-prediction filtering modes. The selected post-prediction filtering mode is applied to output a filtered prediction image. Information related to the post-prediction filtering areas and the selected corresponding post-prediction filtering modes may be communicated to a receiver (e.g., as metadata) for guided post-prediction filtering. Example post-prediction filtering modes that use low-pass averaging filtering or adaptive linear interpolation are also described.
Abstract:
An inter-layer reference picture is generated either by considering an inverse mapped signal from a base layer, a temporal signal from an enhancement layer, or a combination of both.