-
公开(公告)号:US20240137400A1
公开(公告)日:2024-04-25
申请号:US18403659
申请日:2024-01-03
Applicant: Google LLC
Inventor: Ching Yin Derek Pang , Kyrah Felder , Akshay Gadde , Paul Wilkins , Cheng Chen , Yao-Chung Lin
CPC classification number: H04L65/70 , G06N20/00 , H04L65/61 , H04L65/80 , H04N21/251
Abstract: A media item to be provided to users of a platform is identified. The media item is associated with a media class of one or more media classes. An indication of the media item is provided as input to a machine learning model trained based on historical encoding data to predict, for a given media item, a set of encoder parameter settings that satisfy a performance criterion in view of a respective media class of the given media item. The historical encoding data includes a prior set of encoder parameter settings that satisfied the performance criterion with respect to a prior media item associated with the respective class. Encoder parameter settings that satisfy the performance criterion in view of the media class is determined based on an output of the model. The media item is caused to be encoded using the determined encoder parameter settings.
-
公开(公告)号:US11870833B2
公开(公告)日:2024-01-09
申请号:US17462591
申请日:2021-08-31
Applicant: Google LLC
Inventor: Ching Yin Derek Pang , Kyrah Felder , Akshay Gadde , Paul Wilkins , Cheng Chen , Yao-Chung Lin
CPC classification number: H04L65/70 , G06N20/00 , H04L65/61 , H04L65/80 , H04N21/251
Abstract: Methods and systems for encoder parameter setting optimization. A media item to be provided to one or more users of a platform is identified. The media item is associated with a media class. An indication of the identified media item is provided as input to a first machine learning model. The first machine learning model is trained to predict, for a given media item, a set of encoder parameter settings that satisfy a performance criterion in view of a respective media class associated with the given media item. One or more outputs of the first machine learning model are obtained. The one or more obtained outputs include encoder data identifying one or more sets of encoder parameter settings and, for each of the sets of encoder parameter settings, an indication of a level of confidence that a respective set of encoder parameter settings satisfies the performance criterion in view of the media class associated with the identified media item. The identified media item is encoded using the respective set of encoding parameter settings associated with the level of confidence that satisfies a confidence criterion.
-
公开(公告)号:US20220303583A1
公开(公告)日:2022-09-22
申请号:US17834972
申请日:2022-06-08
Applicant: GOOGLE LLC
Inventor: James Bankoski , Yaowu Xu , Paul Wilkins
IPC: H04N19/80 , H04N19/105 , H04N19/139 , H04N19/176 , H04N19/172 , H04N19/61 , H04N19/107 , H04N19/117 , H04N19/127 , H04N19/179 , H04N19/23 , H04N19/527
Abstract: Video coding using constructed reference frames may include generating, by a processor in response to instructions stored on a non-transitory computer readable medium, a reconstructed video. Generating the reconstructed video may include receiving an encoded bitstream. Video coding using constructed reference frames may include generating a reconstructed non-showable reference frame. Generating the reconstructed non-showable reference frame may include decoding a first encoded frame from the encoded bitstream. Video coding using constructed reference frames may include generating a reconstructed frame. Generating the reconstructed frame may include decoding a second encoded frame from the encoded bitstream using the reconstructed non-showable reference frame as a reference frame. Video coding using constructed reference frames may include including the reconstructed frame in the reconstructed video and outputting the reconstructed video.
-
公开(公告)号:US11240498B2
公开(公告)日:2022-02-01
申请号:US16827864
申请日:2020-03-24
Applicant: GOOGLE LLC
Inventor: Eric Ameres , James Bankoski , Scott Lavarnway , Yaowu Xu , Dan Miller , Adrian Grange , Paul Wilkins
IPC: H04N19/117 , H04N19/52 , H04N19/139 , H04N19/172 , H04N19/13 , H04N19/129 , H04N19/114 , H04N19/14 , H04N19/152 , H04N19/154 , H04N19/156 , H04N19/177 , H04N19/523
Abstract: Video coding may include identifying an input frame from an input video stream, generating, by a processor, an output bitstream by encoding the input frame, wherein encoding the input frame includes dividing the input frame into at least a first contiguous area of the input frame and a second contiguous area of the input frame, generating first encoded data by encoding the first contiguous area of the input frame, generating second encoded data by encoding the second contiguous area of the input frame, wherein encoding the second contiguous area of the input frame is performed independently of encoding the first contiguous area of the input frame, and including the first encoded data and the second encoded data in the output bitstream, and outputting the output bitstream.
-
15.
公开(公告)号:US10390026B2
公开(公告)日:2019-08-20
申请号:US15081082
申请日:2016-03-25
Applicant: GOOGLE LLC
Inventor: Yaowu Xu , James Bankoski , Paul Wilkins , Hui Su
IPC: H04N19/174 , H04N19/159 , H04N19/129 , H04N19/593 , H04N19/103 , H04N19/44 , H04N19/503 , H04N19/65 , H04N19/88 , H04N19/91 , H04N19/192
Abstract: Video data streams can be encoded and decoded using inter or intra prediction. The blocks of a frame can be processed based on depth, from the lowest level sub-blocks to the highest level large blocks, and divided into groups of blocks to be inter predicted, blocks having sub-blocks that are to be inter predicted and sub-blocks that are to be intra predicted, and blocks to be intra predicted, and the blocks to be inter predicted are encoded first, the blocks having sub-blocks to be inter predicted and intra predicted encoded second, and the blocks to be intra predicted encoded last. The availability of data from the inter predicted blocks can improve the performance of intra prediction over processing the blocks in the scan order since more pixel data is available for intra prediction of some blocks.
-
公开(公告)号:US20250080775A1
公开(公告)日:2025-03-06
申请号:US18952696
申请日:2024-11-19
Applicant: GOOGLE LLC
Inventor: James Bankoski , Yaowu Xu , Paul Wilkins
IPC: H04N19/80 , H04N19/105 , H04N19/107 , H04N19/117 , H04N19/127 , H04N19/139 , H04N19/172 , H04N19/176 , H04N19/179 , H04N19/23 , H04N19/527 , H04N19/61
Abstract: Video coding using constructed reference frames may include generating, by a processor in response to instructions stored on a non-transitory computer readable medium, a reconstructed video. Generating the reconstructed video may include receiving an encoded bitstream. Video coding using constructed reference frames may include generating a reconstructed non-showable reference frame. Generating the reconstructed non-showable reference frame may include decoding a first encoded frame from the encoded bitstream. Video coding using constructed reference frames may include generating a reconstructed frame. Generating the reconstructed frame may include decoding a second encoded frame from the encoded bitstream using the reconstructed non-showable reference frame as a reference frame. Video coding using constructed reference frames may include including the reconstructed frame in the reconstructed video and outputting the reconstructed video.
-
公开(公告)号:US10951885B2
公开(公告)日:2021-03-16
申请号:US16117563
申请日:2018-08-30
Applicant: GOOGLE LLC
Inventor: Jingning Han , Paul Wilkins , Yaowu Xu
IPC: H04N19/105 , H04N19/182 , H04N19/167 , H04N19/177 , H04N19/117 , H04N19/137
Abstract: A method for using an alternate frame reference (ARF) includes selecting an anchor frame and video frames, where the anchor frame includes an anchor block, and the anchor block includes an anchor pixel; identifying, for the anchor block of the anchor frame, respective reference blocks in the video frames; determining, for the anchor pixel and using an anchor patch, respective distances between the anchor pixel and respective co-located reference pixels of the respective reference blocks, where the anchor patch includes anchor patch pixels, and a respective distance, of the respective distances, between the anchor pixel and a respective co-located reference pixel is determined using the anchor patch pixels and co-located reference pixels; determining, using the respective distances, respective weights; determining, using the respective weights, an ARF pixel that is co-located with the anchor pixel; and encoding, in a compressed bitstream, the ARF.
-
公开(公告)号:US20200077083A1
公开(公告)日:2020-03-05
申请号:US16117563
申请日:2018-08-30
Applicant: GOOGLE LLC
Inventor: Jingning Han , Paul Wilkins , Yaowu Xu
IPC: H04N19/105 , H04N19/182 , H04N19/167 , H04N19/177 , H04N19/117 , H04N19/137
Abstract: A method for using an alternate frame reference (ARF) includes selecting an anchor frame and video frames, where the anchor frame includes an anchor block, and the anchor block includes an anchor pixel; identifying, for the anchor block of the anchor frame, respective reference blocks in the video frames; determining, for the anchor pixel and using an anchor patch, respective distances between the anchor pixel and respective co-located reference pixels of the respective reference blocks, where the anchor patch includes anchor patch pixels, and a respective distance, of the respective distances, between the anchor pixel and a respective co-located reference pixel is determined using the anchor patch pixels and co-located reference pixels; determining, using the respective distances, respective weights; determining, using the respective weights, an ARF pixel that is co-located with the anchor pixel; and encoding, in a compressed bitstream, the ARF.
-
19.
公开(公告)号:US20190342559A1
公开(公告)日:2019-11-07
申请号:US16512478
申请日:2019-07-16
Applicant: GOOGLE LLC
Inventor: Yaowu Xu , James Bankoski , Paul Wilkins , Hui Su
IPC: H04N19/174 , H04N19/91 , H04N19/593 , H04N19/129 , H04N19/159 , H04N19/65 , H04N19/88 , H04N19/103 , H04N19/44 , H04N19/503
Abstract: Video data streams can be encoded and decoded using inter or intra prediction. The blocks of a frame can be processed based on depth, from the lowest level sub-blocks to the highest level large blocks, and divided into groups of blocks that are inter predicted, blocks having sub-blocks that are inter predicted and sub-blocks that are intra predicted, and blocks that are intra predicted. The inter predicted blocks are encoded or decoded first, followed by the inter predicted sub-blocks, the intra predicted sub-blocks, and the intra predicted blocks. Data from the inter predicted blocks and sub-blocks may be used to improve the prediction performance of intra predicted blocks and sub-blocks.
-
公开(公告)号:US20190058879A1
公开(公告)日:2019-02-21
申请号:US15895006
申请日:2018-02-13
Applicant: GOOGLE LLC
Inventor: Paul Wilkins , James Bankoski , Yaowu Xu
IPC: H04N19/10 , H04N19/21 , H04N19/196 , H04N19/17 , H04N19/176 , H04N19/82 , H04N19/463 , H04N19/523 , H04N19/124 , H04N19/61 , H04N19/117
Abstract: An apparatus for decoding compressed video information may include a memory and a processor configured to execute instructions stored in the memory to receive a compressed video stream including compressed video information, read, from the compressed video information, a segment map associating at least some blocks in a first frame with at least one segment identifier, read, from the compressed video information, a current encoded block, read, from the segment map, a segment identifier that is associated with the current encoded block, read, from the compressed video information, at least one segment parameter associated with the segment identifier that is associated with the current encoded block, generate a current decoded block by decoding the current encoded block using the at least one segment parameter associated with the segment identifier that is associated with the current encoded block, and output the current decoded block for presentation or storage.
-
-
-
-
-
-
-
-
-