TECHNIQUES FOR COMPONENT-BASED IMAGE PREPROCESSING

    公开(公告)号:US20230186435A1

    公开(公告)日:2023-06-15

    申请号:US17551087

    申请日:2021-12-14

    Applicant: NETFLIX, INC.

    Abstract: In various embodiments, an image preprocessing application preprocesses images. To preprocess an image, the image preprocessing application executes a trained machine learning model on first data corresponding to both the image and a first set of components of a luma-chroma color space to generate first preprocessed data. The image preprocessing application executes at least a different trained machine learning model or a non-machine learning algorithm on second data corresponding to both the image and a second set of components of the luma-chroma color space to generate second preprocessed data. Subsequently, the image preprocessing application aggregates at least the first preprocessed data and the second preprocessed data to generate a preprocessed image.

    TECHNIQUES FOR GENERATING A PERCEPTUAL QUALITY MODEL FOR PREDICTING VIDEO QUALITY ACROSS DIFFERENT VIEWING PARAMETERS

    公开(公告)号:US20240119575A1

    公开(公告)日:2024-04-11

    申请号:US17937024

    申请日:2022-09-30

    Applicant: NETFLIX, INC.

    CPC classification number: G06T7/0002 G06N20/10 G06T2207/10016

    Abstract: In various embodiments, a training application generates a trained perceptual quality model that estimates perceived video quality for reconstructed video. The training application computes a pixels-per-degree value based on a normalized viewing distance and a display resolution. The training application computes a set of feature values corresponding to a set of visual quality metrics based on a reconstructed video sequence, a source video sequence, and the pixels-per-degree value. The training application executes a machine learning algorithm on the first set of feature values to generate the trained perceptual quality model. The trained perceptual quality model maps a particular set of feature values corresponding to the set of visual quality metrics to a particular perceptual quality score.

    TECHNIQUES FOR RECONSTRUCTING DOWNSCALED VIDEO CONTENT

    公开(公告)号:US20230143389A1

    公开(公告)日:2023-05-11

    申请号:US17981292

    申请日:2022-11-04

    Applicant: NETFLIX, INC.

    CPC classification number: H04N19/436 H04N19/30

    Abstract: In various embodiments an endpoint application reconstructs downscaled videos. The endpoint application accesses metadata associated with a portion of a downscaled video that has a first resolution and was generated using a trained downscaling convolutional neural network (CNN). The endpoint application determines, based on the metadata, an upscaler that should be used when upscaling the portion of the downscaled video. The endpoint application executes the upscaler on the portion of the downscaled video to generate a portion of a reconstructed video that is accessible for playback and has a second resolution that is greater than the first resolution.

    MACHINE LEARNING TECHNIQUES FOR VIDEO DOWNSAMPLING

    公开(公告)号:US20240233076A1

    公开(公告)日:2024-07-11

    申请号:US18617162

    申请日:2024-03-26

    Applicant: NETFLIX, INC.

    CPC classification number: G06T3/4046 G06N3/084 G06T9/002

    Abstract: In various embodiments, a training application trains a convolutional neural network to downsample images in a video encoding pipeline. The convolution neural network includes at least two residual blocks and is associated with a downsampling factor. The training application executes the convolutional neural network on a source image to generate a downsampled image. The training application then executes an upsampling algorithm on the downsampled image to generate a reconstructed image having the same resolution as the source image. The training application computes a reconstruction error based on the reconstructed image and the source image. The training application updates at least one parameter of the convolutional neural network based on the reconstruction error to generate a trained convolutional neural network. Advantageously, the trained convolution neural network can be implemented in a video encoding pipeline to mitigate visual quality reductions typically experienced with conventional video encoding pipelines that implement conventional downsampling techniques.

    MACHINE LEARNING TECHNIQUES FOR VIDEO DOWNSAMPLING

    公开(公告)号:US20220198607A1

    公开(公告)日:2022-06-23

    申请号:US17133206

    申请日:2020-12-23

    Applicant: NETFLIX, INC.

    Abstract: In various embodiments, a training application trains a convolutional neural network to downsample images in a video encoding pipeline. The convolution neural network includes at least two residual blocks and is associated with a downsampling factor. The training application executes the convolutional neural network on a source image to generate a downsampled image. The training application then executes an upsampling algorithm on the downsampled image to generate a reconstructed image having the same resolution as the source image. The training application computes a reconstruction error based on the reconstructed image and the source image. The training application updates at least one parameter of the convolutional neural network based on the reconstruction error to generate a trained convolutional neural network. Advantageously, the trained convolution neural network can be implemented in a video encoding pipeline to mitigate visual quality reductions typically experienced with conventional video encoding pipelines that implement conventional downsampling techniques.

    TECHNIQUES FOR JOINTLY TRAINING A DOWNSCALER AND AN UPSCALER FOR VIDEO STREAMING

    公开(公告)号:US20230144735A1

    公开(公告)日:2023-05-11

    申请号:US17981281

    申请日:2022-11-04

    Applicant: NETFLIX, INC.

    CPC classification number: G06T3/4046

    Abstract: In various embodiments a training application trains convolutional neural networks (CNNs) to reduce reconstruction errors. The training application executes a first CNN on a source image having a first resolution to generate a downscaled image having a second resolution. The training application executes a second CNN on the downscaled image to generate a reconstructed image having the first resolution. The training application computes a reconstruction error based on the reconstructed image and the source image. The training application updates a first learnable parameter value included in the first CNN based on the reconstruction error to generate at least a partially trained downscaling CNN. The training application updates a second learnable parameter included in the second CNN based on the reconstruction error to generate at least a partially trained upscaling CNN.

Patent Agency Ranking