-
公开(公告)号:US20220207654A1
公开(公告)日:2022-06-30
申请号:US17698116
申请日:2022-03-18
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , Urvang Joshi , Yue Chen , Sarah Parker
IPC: G06T5/00 , G06T5/50 , G06N3/08 , G06N3/04 , G06N20/20 , G06T3/40 , H04N19/70 , H04N19/176 , G06K9/62 , G06T9/00
Abstract: Guided restoration is used to restore video data degraded from a video frame. The video frame is divided into restoration units (RUs) which each correspond to one or more blocks of the video frame. Restoration schemes are selected for each RU. The restoration schemes may indicate to use one of a plurality of neural networks trained for the guided restoration. Alternatively, the restoration schemes may indicate to use a neural network and a filter-based restoration tool. The video frame is then restored by processing each RU according to the respective selected restoration scheme. During encoding, the restored video frame is encoded to an output bitstream, and the use of the selected restoration schemes may be signaled within the output bitstream. During decoding, the restored video frame is output to an output video stream.
-
公开(公告)号:US11197004B1
公开(公告)日:2021-12-07
申请号:US16919507
申请日:2020-07-02
Applicant: Google LLC
Inventor: Rohit Singh , Debargha Mukherjee , Elliott Karpilovsky , Lester Lu
IPC: H04N19/184 , H04N19/124 , H04N19/61
Abstract: Transform modes are derived for inter-predicted blocks using side information available within a bitstream. An inter-predicted encoded video block and side information are identified within a bitstream. Based on the side information, a trained transform is determined for inverse transforming transform coefficients of the inter-predicted encoded video block from amongst multiple trained transforms. The transform coefficients of the inter-predicted encoded video block are inverse transformed according to the trained transform to produce a prediction residual. A video block is reconstructed using the prediction residual and the reference frame. The video block is then output within an output video stream for storage or display. To determine the trained transforms, a learning model uses individual side information types and combinations of the individual side information types processed against a training data set.
-
公开(公告)号:US11115678B2
公开(公告)日:2021-09-07
申请号:US16861299
申请日:2020-04-29
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , Emil Keyder , Michele Covell , Chen Wang , Sarah Parker , Ramin Zabih
IPC: H04N19/527 , H04N19/172 , H04N19/176 , H04N19/109 , G06T7/246 , H04N19/44 , H04N19/159 , H04N19/124 , H04N19/119 , H04N19/192 , H04N19/137 , H04N19/17 , H04N19/167 , H04N19/543 , H04N19/573 , H04N19/147 , H04N19/14 , H04N19/557
Abstract: An apparatus for encoding a current frame of a video. The apparatus includes a memory and a processor. The processor is configured to execute instructions stored in the memory to generate, for each reference frame of a subset of available reference frames, at least one respective candidate global motion model (GMM); partition the current frame into blocks; generate an aggregated residual frame for the current frame; and encode the respective residual blocks in a compressed bitstream. To generate the aggregated residual frame includes to select, for predicting each block of the blocks, a respective selected GMM, where the respective selected GMM corresponds to the one of the at least one respective candidate GMMs that minimizes a total error associated with the aggregated residual frame; and obtain respective residual blocks for the block.
-
公开(公告)号:US10827196B2
公开(公告)日:2020-11-03
申请号:US15807653
申请日:2017-11-09
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , James Bankoski , Yue Chen , Yuxin Liu , Sarah Parker
IPC: H04N11/02 , H04N19/517 , H04N19/46 , H04N19/136 , H04N19/105 , H04N19/139 , H04N19/176 , H04N19/182
Abstract: Generating a compound predictor block of a current block includes generating, for the current block, predictor blocks including a first predictor block formed of first predictor pixels. Using at least a subset of the first predictor pixels, a first modulation value for modulating a first weight to be applied to a first predictor pixel of the first predictor pixels is determined. The compound predictor block is generated using the first predictor pixel, the first weight, and the first modulation value.
-
公开(公告)号:US20200068224A1
公开(公告)日:2020-02-27
申请号:US16666676
申请日:2019-10-29
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee
IPC: H04N19/86 , H04N19/176 , H04N19/136 , H04N19/117 , G06T5/00 , H04N19/192 , H04N19/85 , H04N19/174 , H04N19/80 , H04N19/46 , H04N19/182
Abstract: A method includes generating, using first restoration parameters, a first guide tile for a degraded tile of the degraded frame, the degraded tile corresponding to a source tile of the source frame; generating, using second restoration parameters, a second guide tile for the degraded tile of the degraded frame, the second restoration parameters being different from the first restoration parameters; determining a first tile difference between the source tile and the first guide tile; determining a second tile difference between the source tile and the second guide tile; calculating projection parameters that minimize a difference between a restored tile of the degraded tile and the source tile; and encoding, in an encoded bitstream, the projection parameters. The difference between the restored tile of the degraded tile and the source tile is a linear combination, using the projection parameters, of the first tile difference and the second tile difference.
-
公开(公告)号:US10469841B2
公开(公告)日:2019-11-05
申请号:US15010594
申请日:2016-01-29
Applicant: GOOGLE LLC
Inventor: James Bankoski , Debargha Mukherjee , Yaowu Xu
IPC: H04N19/132 , H04N19/186 , H04N19/182 , H04N19/593 , H04N19/117 , H04N19/44 , H04N19/176 , H04N19/105 , H04N19/119 , H04N19/137 , H04N19/543 , H04N19/124 , H04N19/52 , H04N19/61
Abstract: A mask is generated from a residual that is a difference between pixel values of at least two frames other than a current frame in a video sequence. The mask is then used to encode or decode a current block by inter-prediction. The mask forms an edge of an object within its bounds and can be used to combine different portions of a reference frame into a single block. This can increase coding efficiency.
-
公开(公告)号:US20190149841A1
公开(公告)日:2019-05-16
申请号:US16016857
申请日:2018-06-25
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , Emil Keyder , Michele Covell , Chen Wang , Sarah Parker , Ramin Zabih
IPC: H04N19/527 , G06T7/246 , H04N19/172 , H04N19/176 , H04N19/159 , H04N19/124 , H04N19/44
Abstract: A method for encoding a current frame of a video includes jointly determining respective motion models for reference frames and encoding the current frame using the respective motion models. The reference frames are used for encoding the current frame. Jointly determining respective motion models for reference frames includes determining respective aggregated residuals for combinations of candidate motion models and selecting the combination of candidate motion models that corresponds to the smallest aggregated residual. The respective motion models correspond to the candidate motion models of the selected combination.
-
公开(公告)号:US10271062B2
公开(公告)日:2019-04-23
申请号:US15074442
申请日:2016-03-18
Applicant: Google LLC
Inventor: Debargha Mukherjee , Yaowu Xu , Yuxin Liu
IPC: H04N19/51 , H04N19/137 , H04N19/577 , H04N19/184 , H04N19/573 , H04N19/44 , H04N19/176 , H04N19/52 , H04N19/463
Abstract: Motion vectors are predicted for a current block of a current frame using motion vectors from previous frames in the video stream. Temporal distances between a current frame and the one or more reference frames used to predict a current block and temporal distances between the previous frame and its reference frames are determined. Temporal distances for current frames and previous frames can be combined to weight the motion vectors and improve motion vector prediction.
-
59.
公开(公告)号:US20180278932A1
公开(公告)日:2018-09-27
申请号:US15987547
申请日:2018-05-23
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , Jim Bankoski
IPC: H04N19/115 , H04N19/119 , H04N19/52 , H04N19/50 , H04N19/126 , H04N19/61 , H04N19/51
CPC classification number: H04N19/115 , H04N19/105 , H04N19/119 , H04N19/126 , H04N19/50 , H04N19/51 , H04N19/52 , H04N19/573 , H04N19/61 , H04N19/91
Abstract: An apparatus (e.g., a computer device) includes a video coding system. The video coding system includes an encoder. The encoder is configured to partition a block of video data into a first partition and a second partition, the first partition and the second partition being divided by a border that is at an angle to an orientation of a row or column of pixels, determine a first motion vector for the first partition and a second motion vector for the second partition, determine a first set of predictor values based on the first motion vector and a second set of predictor values based on the second motion vector, and combine the first set of predictor values and the second set of predictor values into an overall set of predictor values based on a weighting algorithm.
-
公开(公告)号:US20180270497A1
公开(公告)日:2018-09-20
申请号:US15838748
申请日:2017-12-12
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , Yuxin Liu , Sarah Parker
IPC: H04N19/517 , H04N19/513 , H04N19/80 , H04N19/557
CPC classification number: H04N19/517 , H04N19/17 , H04N19/20 , H04N19/521 , H04N19/54 , H04N19/543 , H04N19/547 , H04N19/557 , H04N19/80
Abstract: Encoding and decoding using parametrized motion models are disclosed. A method includes segmenting the video frame with respect to a reference frame resulting in a segmentation, determining a first motion vector for the current block based on the segmentation, determining a second motion vector for the current block using translational motion compensation, and encoding, for the current block, the one of the first motion vector and the second motion vector corresponding to a smaller error. The segmentation includes a segment containing the current block and a parameterized motion model for the segment. Another method includes identifying a parameterized motion model corresponding to a motion model type, associating the parameterized motion model with a segment of a reference frame, and, in response to determining that the current block is encoded using the parameterized motion model, decoding the current block using the parameterized motion model.
-
-
-
-
-
-
-
-
-