摘要:
In a collaborative video processing method and system, a high resolution video input is optionally downscaled to a low resolution video using a down-sampling filter, followed by an end-to-end video coding system to encode the low resolution video for streaming over the Internet. The original high resolution is obtained at the client end by upscaling the low resolution video using a deep learning based high resolution scaling model, which can be trained in a pre-defined progressive order with low resolution videos having different compression parameters and downscaling factors.
摘要:
A method and apparatus for the single input multiple output based media adaptation is disclosed. In one embodiment, such adaption is performed in two steps. On step 1, content correlation between different compression schemes is used to perform the inter-format adaptation of a stream of a compression format to an intermediate output stream of another compression scheme with the same quality level. On step 2, content correlation between different quality levels is used to perform the intra-format adaptation of the intermediate output stream to multiple output streams at different quality levels with the same compression format. In one embodiment, content correlation is used to limit the search for mode candidates when performing both steps.
摘要:
Methods and apparatus are provided for motion compensation with a smooth reference frame in bit depth scalability. An apparatus includes an encoder for encoding picture data for at least a portion of a picture by generating an inter-layer residue prediction for the portion using an inverse tone mapping operation performed in the pixel domain for bit depth scalability. The inverse tone mapping operation is shifted from a residue domain to the pixel domain.
摘要:
A learned image compression system increases compression efficiency by using a novel conditional context model with embedded autoregressive neighbors and hyperpriors, which can accurately estimate the entropy rate for rate distortion optimization. Generalized Divisive Normalization (GDN) in Residual Neural Network is used in the encoder and decoder networks for fast convergence rate and efficient feature representation.
摘要:
A method and apparatus for use of an adaptive prediction resolution in video coding is disclosed. One or more adaptive flags are provided in one or more syntaxes of a prediction scheme in encoding and decoding video signals. In one embodiment, the adaptive flags are suitable to indicate whether a subset or a full set of intra prediction modes is used at a slice level or a coding block level. In one embodiment, the adaptive flags are suitable to indicate whether integer or fractional motion vector resolution is used at a slice level or a coding block level.
摘要:
A video processing system includes prediction primary transforms, quantization, entropy coding and filtering configured to receive and compress video information and output compressed video information corresponding to the received video information. The compressed video information comprising prediction mode, transform block size, quantization parameter, and filtering type. The video processing system also includes a secondary transform configured to receive and compress the compressed video information. The video processing system also includes a quantization stage configured to receive and compress the transformed coefficients. The video processing system also includes an entropy coding stage configured to convert the compressed video information into binary bits. The video processing system also includes a filtering stage configured to improve the reconstructed video information for better prediction.