-
公开(公告)号:US20250071299A1
公开(公告)日:2025-02-27
申请号:US18455298
申请日:2023-08-24
Applicant: GOOGLE LLC
Inventor: Yao-Chung Lin , Jingning Han , Yilin Wang , Yeping Su
IPC: H04N19/196 , H04N19/105 , H04N19/176 , H04N19/80
Abstract: Encoding using media compression and processing for machine-learning-based quality metrics includes generating encoded frame data by encoding a current frame from an input video using a neural-network-based video quality model, which includes identifying optimal encoding parameters for encoding a current block, wherein the optimal encoding parameters minimize a rate-distortion optimization cost function, which includes using a gradient value for the current block obtained from a neural-network-based video quality model generated gradient map obtained from the neural-network-based video quality model for the current frame, obtaining a restoration filtered reconstructed frame by restoration filtering a reconstructed frame, obtained by decoding the encoded frame data, using the neural-network-based video quality model generated gradient map obtained for the reconstructed frame.
-
公开(公告)号:US11363262B1
公开(公告)日:2022-06-14
申请号:US17121349
申请日:2020-12-14
Applicant: GOOGLE LLC
Inventor: Ching Yin Derek Pang , Yeping Su , Yao-Chung Lin
IPC: H04N19/105 , H04N19/176 , H04N19/177 , H04N19/114 , H04N19/159
Abstract: A first aspect is a method for coding a group of pictures (GOP) that includes frames of a video. The method includes encoding, at least some of the frames of the GOP, using a first encoding pass to obtain encoding statistics; obtaining, using the encoding statistics, respective temporal dependency likelihoods (TDLs) for the at least some of the frames of the GOP, where the respective TDLs indicate contributions that the at least some of the frames make in reducing prediction errors of the GOP; obtaining a reference frame based on the respective TDLs; and using the reference frame in encoding at least some of the frames of the GOP in a second encoding pass.
-
公开(公告)号:US20230068026A1
公开(公告)日:2023-03-02
申请号:US17462591
申请日:2021-08-31
Applicant: Google LLC
Inventor: Ching Yin Derek Pang , Kyrah Felder , Akshay Gadde , Paul Wilkins , Cheng Chen , Yao-Chung Lin
Abstract: Methods and systems for encoder parameter setting optimization. A media item to be provided to one or more users of a platform is identified. The media item is associated with a media class. An indication of the identified media item is provided as input to a first machine learning model. The first machine learning model is trained to predict, for a given media item, a set of encoder parameter settings that satisfy a performance criterion in view of a respective media class associated with the given media item. One or more outputs of the first machine learning model are obtained. The one or more obtained outputs include encoder data identifying one or more sets of encoder parameter settings and, for each of the sets of encoder parameter settings, an indication of a level of confidence that a respective set of encoder parameter settings satisfies the performance criterion in view of the media class associated with the identified media item. The identified media item is encoded using the respective set of encoding parameter settings associated with the level of confidence that satisfies a confidence criterion.
-
公开(公告)号:US20220191480A1
公开(公告)日:2022-06-16
申请号:US17121349
申请日:2020-12-14
Applicant: GOOGLE LLC
Inventor: Ching Yin Derek Pang , Yeping Su , Yao-Chung Lin
IPC: H04N19/114 , H04N19/105 , H04N19/159 , H04N19/177 , H04N19/176
Abstract: A first aspect is a method for coding a group of pictures (GOP) that includes frames of a video. The method includes encoding, at least some of the frames of the GOP, using a first encoding pass to obtain encoding statistics; obtaining, using the encoding statistics, respective temporal dependency likelihoods (TDLs) for the at least some of the frames of the GOP, where the respective TDLs indicate contributions that the at least some of the frames make in reducing prediction errors of the GOP; obtaining a reference frame based on the respective TDLs; and using the reference frame in encoding at least some of the frames of the GOP in a second encoding pass.
-
公开(公告)号:US20180084254A1
公开(公告)日:2018-03-22
申请号:US15784516
申请日:2017-10-16
Applicant: GOOGLE LLC
Inventor: Yao-Chung Lin , Sam John , Thierry Foucu , Sasi Inguva
IPC: H04N19/134 , H04N19/142
CPC classification number: H04N19/134 , H04N19/00472 , H04N19/00587 , H04N19/00921 , H04N19/142 , H04N19/40 , H04N19/87
Abstract: Systems and methods for processing a video sequence are disclosed. In accordance with some implementations, The method includes determining a first set of shot-change locations associated with a cut transition in a video sequence based on variance data defined in an encoding log associated with at least one previously encoded video frame. The method further includes determining a second set of shot-change locations associated with a fading transition in the video sequence based on the variance data defined in the encoding log associated with the at least one previously encoded video frame. The method also includes processing the video sequence based on at least a portion of the first set of shot-change locations and the second set of shot-change locations.
-
公开(公告)号:US20240137400A1
公开(公告)日:2024-04-25
申请号:US18403659
申请日:2024-01-03
Applicant: Google LLC
Inventor: Ching Yin Derek Pang , Kyrah Felder , Akshay Gadde , Paul Wilkins , Cheng Chen , Yao-Chung Lin
CPC classification number: H04L65/70 , G06N20/00 , H04L65/61 , H04L65/80 , H04N21/251
Abstract: A media item to be provided to users of a platform is identified. The media item is associated with a media class of one or more media classes. An indication of the media item is provided as input to a machine learning model trained based on historical encoding data to predict, for a given media item, a set of encoder parameter settings that satisfy a performance criterion in view of a respective media class of the given media item. The historical encoding data includes a prior set of encoder parameter settings that satisfied the performance criterion with respect to a prior media item associated with the respective class. Encoder parameter settings that satisfy the performance criterion in view of the media class is determined based on an output of the model. The media item is caused to be encoded using the determined encoder parameter settings.
-
公开(公告)号:US11870833B2
公开(公告)日:2024-01-09
申请号:US17462591
申请日:2021-08-31
Applicant: Google LLC
Inventor: Ching Yin Derek Pang , Kyrah Felder , Akshay Gadde , Paul Wilkins , Cheng Chen , Yao-Chung Lin
CPC classification number: H04L65/70 , G06N20/00 , H04L65/61 , H04L65/80 , H04N21/251
Abstract: Methods and systems for encoder parameter setting optimization. A media item to be provided to one or more users of a platform is identified. The media item is associated with a media class. An indication of the identified media item is provided as input to a first machine learning model. The first machine learning model is trained to predict, for a given media item, a set of encoder parameter settings that satisfy a performance criterion in view of a respective media class associated with the given media item. One or more outputs of the first machine learning model are obtained. The one or more obtained outputs include encoder data identifying one or more sets of encoder parameter settings and, for each of the sets of encoder parameter settings, an indication of a level of confidence that a respective set of encoder parameter settings satisfies the performance criterion in view of the media class associated with the identified media item. The identified media item is encoded using the respective set of encoding parameter settings associated with the level of confidence that satisfies a confidence criterion.
-
公开(公告)号:US10291916B2
公开(公告)日:2019-05-14
申请号:US15784516
申请日:2017-10-16
Applicant: GOOGLE LLC
Inventor: Yao-Chung Lin , Sam John , Thierry Foucu , Sasi Inguva
IPC: H04N19/134 , H04N19/142 , H04N7/12 , H04N5/14 , H04B1/66 , G06K9/36 , H04N11/04 , H04N11/02 , H04N19/87 , H04N19/40 , H04N19/51
Abstract: Systems and methods for processing a video sequence are disclosed. In accordance with some implementations, The method includes determining a first set of shot-change locations associated with a cut transition in a video sequence based on variance data defined in an encoding log associated with at least one previously encoded video frame. The method further includes determining a second set of shot-change locations associated with a fading transition in the video sequence based on the variance data defined in the encoding log associated with the at least one previously encoded video frame. The method also includes processing the video sequence based on at least a portion of the first set of shot-change locations and the second set of shot-change locations.
-
-
-
-
-
-
-