-
公开(公告)号:US12284360B2
公开(公告)日:2025-04-22
申请号:US18490613
申请日:2023-10-19
Inventor: Yang Zhang , Mingyang Song , Christopher Richard Schroers , Tunc Ozan Aydin , Yuanyi Xue , Scott Labrozzi
IPC: H04N7/12 , H04N19/146 , H04N19/159
Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.
-
公开(公告)号:US12225252B2
公开(公告)日:2025-02-11
申请号:US18179281
申请日:2023-03-06
Inventor: Chen Liu , Wenhao Zhang , Scott Labrozzi , Yuanyi Xue , Xuchang Huangfu , Xiaobo Liu
IPC: H04N21/238 , H04N21/234 , H04N21/2662 , H04N21/845
Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.
-
公开(公告)号:US11012718B2
公开(公告)日:2021-05-18
申请号:US16557920
申请日:2019-08-30
Applicant: Disney Enterprises, Inc.
Inventor: Christopher Schroers , Joaquim Campos , Abdelaziz Djelouah , Yuanyi Xue , Erika Varis Doggett , Jared McPhillen , Scott Labrozzi
IPC: H04N19/91 , G06K9/00 , H04N19/12 , H04N19/124 , G06N3/08
Abstract: Systems and methods are disclosed for generating a latent space residual. A computer-implemented method may use a computer system that includes non-transient electronic storage, a graphical user interface, and one or more physical computer processors. The computer-implemented method may include: obtaining a target frame, obtaining a reconstructed frame, encoding the target frame into a latent space to generate a latent space target frame, encoding the reconstructed frame into the latent space to generate a latent space reconstructed frame, and generating a latent space residual based on the latent space target frame and the latent space reconstructed frame.
-
公开(公告)号:US20210067808A1
公开(公告)日:2021-03-04
申请号:US16557920
申请日:2019-08-30
Applicant: Disney Enterprises, Inc.
Inventor: Christopher Schroers , Joaquim Campos , Abdelaziz Djelouah , Yuanyi Xue , Erika Varis Doggett , Jared McPhillen , Scott Labrozzi
IPC: H04N19/91 , G06K9/00 , H04N19/124 , G06N3/08 , H04N19/12
Abstract: Systems and methods are disclosed for generating a latent space residual. A computer-implemented method may use a computer system that includes non-transient electronic storage, a graphical user interface, and one or more physical computer processors. The computer-implemented method may include: obtaining a target frame, obtaining a reconstructed frame, encoding the target frame into a latent space to generate a latent space target frame, encoding the reconstructed frame into the latent space to generate a latent space reconstructed frame, and generating a latent space residual based on the latent space target frame and the latent space reconstructed frame.
-
公开(公告)号:US12126879B2
公开(公告)日:2024-10-22
申请号:US17861063
申请日:2022-07-08
Applicant: Disney Enterprises, Inc.
Inventor: Scott Labrozzi , William B. May, Jr.
IPC: H04N21/845 , H04N21/2383
CPC classification number: H04N21/8455 , H04N21/2383
Abstract: A system includes a computing platform having processing hardware, and a memory storing software code. The software code is executed to receive content having a sequence of content segments, and marker data identifying a location within the sequence, identify, using the content and the marker data, segment boundaries of a content segment containing the location, determine, using the location and the segment boundaries, whether the location is situated within a predetermined interval of one of the segment boundaries, and re-encode a subsection of the sequence to produce a new segment boundary at the location. When the location is not situated within the predetermined interval, the subsection of the sequence includes the content segment containing the location. When the location is situated within the predetermined interval, the subsection of the sequence includes the content segment containing the location and a content segment adjoining the content segment containing the location.
-
公开(公告)号:US20230379475A1
公开(公告)日:2023-11-23
申请号:US18230409
申请日:2023-08-04
Inventor: Christopher Richard Schroers , Roberto Gerson de Albuquerque Azevedo , Nicholas David Gregory , Yuanyi Xue , Scott Labrozzi , Abdelaziz Djelouah
IPC: H04N19/147 , H04N19/132 , H04N19/184 , G06N3/08 , G06T3/40 , G06T9/00
CPC classification number: H04N19/147 , H04N19/132 , H04N19/184 , G06N3/08 , G06T3/4046 , G06T9/002
Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.
-
公开(公告)号:US20220329876A1
公开(公告)日:2022-10-13
申请号:US17704692
申请日:2022-03-25
Inventor: Abdelaziz Djelouah , Leonhard Markus Helminger , Roberto Gerson de Albuquerque Azevedo , Scott Labrozzi , Christopher Richard Schroers , Yuanyi Xue
Abstract: A system processing hard e executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content. The ML model-based video compression encoder further encodes the latent space representation of the image space residual to produce an encoded latent residual, encodes, using the trained image compression ML model, the latent space representation of the motion compensated video content to produce an encoded latent video content, and generates, using the encoded latent residual and the encoded latent video content, a compressed video content corresponding to the uncompressed video content.
-
公开(公告)号:US11335034B2
公开(公告)日:2022-05-17
申请号:US16249861
申请日:2019-01-16
Applicant: Disney Enterprises, Inc.
Inventor: Christopher Schroers , Erika Doggett , Stephan Marcel Mandt , Jared McPhillen , Scott Labrozzi , Romann Weber , Mauro Bamert
Abstract: Systems and methods for predicting a target set of pixels are disclosed. In one embodiment, a method may include obtaining target content. The target content may include a target set of pixels to be predicted. The method may also include convolving the target set of pixels to generate an estimated set of pixels. The method may include matching a second set of pixels in the target content to the target set of pixels. The second set of pixels may be within a distance from the target set of pixels. The method may include refining the estimated set of pixels to generate a refined set of pixels using a second set of pixels in the target content.
-
公开(公告)号:US20190333190A1
公开(公告)日:2019-10-31
申请号:US16167388
申请日:2018-10-22
Applicant: Disney Enterprises, Inc.
Inventor: Christopher Schroers , Mauro Bamert , Erika Doggett , Jared McPhillen , Scott Labrozzi , Romann Weber
Abstract: Systems and methods for distortion removal at multiple quality levels are disclosed. In one embodiment, a method may include receiving training content. The training content may include original content, reconstructed content, and training distortion quality levels corresponding to the reconstructed content. The reconstructed content may be derived from distorted original content. The method may also include training distortion quality levels corresponding to the reconstructed content. The method may further include receiving an initial distortion removal model. The method may include generating a conditioned distortion removal model by training the initial distortion removal model using the training content. The method may further include storing the conditioned distortion removal model.
-
公开(公告)号:US12278969B2
公开(公告)日:2025-04-15
申请号:US18230409
申请日:2023-08-04
Inventor: Christopher Richard Schroers , Roberto Gerson de Albuquerque Azevedo , Nicholas David Gregory , Yuanyi Xue , Scott Labrozzi , Abdelaziz Djelouah
IPC: H04N19/147 , G06N3/08 , G06T3/4046 , G06T9/00 , H04N19/132 , H04N19/184
Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.
-
-
-
-
-
-
-
-
-