-
公开(公告)号:US20240333961A1
公开(公告)日:2024-10-03
申请号:US18742180
申请日:2024-06-13
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , James Bankoski , Yue Chen , Yuxin Liu , Sarah Parker
IPC: H04N19/517 , H04N19/105 , H04N19/136 , H04N19/139 , H04N19/176 , H04N19/182 , H04N19/46
CPC classification number: H04N19/517 , H04N19/105 , H04N19/136 , H04N19/139 , H04N19/176 , H04N19/182 , H04N19/46
Abstract: Generating a compound predictor block includes generating a first predictor block and generating a second predictor block. The first predictor block includes a first pixel and the second predictor block includes a second pixel. The first and the second pixels are located at a same location within the first predictor block and the second predictor block, respectively. A first weight is determined for the first pixel based on a difference between a first value of the first pixel and a second value of the second pixel. A second weight is determined for the second pixel based on the first weight. The compound predictor block is generated by combining the first predictor block and the second predictor block. The compound predictor block includes a weighted pixel that is determined based on a weighted sum of the first pixel and the second pixel based on the first weight and the second weight.
-
公开(公告)号:US12034963B2
公开(公告)日:2024-07-09
申请号:US17731470
申请日:2022-04-28
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , James Bankoski , Yue Chen , Yuxin Liu , Sarah Parker
IPC: H04N11/02 , H04N19/105 , H04N19/136 , H04N19/139 , H04N19/176 , H04N19/182 , H04N19/46 , H04N19/517
CPC classification number: H04N19/517 , H04N19/105 , H04N19/136 , H04N19/139 , H04N19/176 , H04N19/182 , H04N19/46
Abstract: Generating a compound predictor block for a current block of video includes generating, for the current block, a first predictor block using one of inter-prediction or intra-prediction and generating a second predictor block. The first predictor block includes a first pixel and the second predictor block includes a second pixel that is co-located with the first pixel. A first weight is determined for the first pixel using a difference between a value of the first pixel and a value of the second pixel. A second weight is determined for the second pixel using the first weight. The compound predictor block is generated by combining the first predictor block and the second predictor block. The compound predictor block includes a weighted pixel that is determined using a weighted sum of the first pixel and the second pixel using the first weight and the second weight.
-
公开(公告)号:US20240195985A1
公开(公告)日:2024-06-13
申请号:US18286574
申请日:2021-05-07
Applicant: Google LLC
IPC: H04N19/176 , H04N19/119 , H04N19/159
CPC classification number: H04N19/176 , H04N19/119 , H04N19/159
Abstract: Methods, systems, and apparatus, including computer programs, for compression and decompression of video data using an ensemble of machine learning models. Methods can include defining for each frame in a video, a plurality of blocks in the frame. Methods can further include processing the frames of video in sequential sets, wherein each sequential set is at least a current frame (220) of video and a prior frame (240) of video in the ordered sequence. Each respective prediction of a block in the frame of the video includes providing, as input to a prediction model a first and the second border (235,230) of a current block (225) of the current frame, a first and a second border (250, 255) for a respective current block (245) of the prior frame and the respective current block (245) of the prior frame.
-
公开(公告)号:US20240179352A1
公开(公告)日:2024-05-30
申请号:US18437402
申请日:2024-02-09
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee
IPC: H04N19/86 , G06T5/70 , H04N19/117 , H04N19/136 , H04N19/174 , H04N19/176 , H04N19/182 , H04N19/192 , H04N19/46 , H04N19/463 , H04N19/80 , H04N19/85
CPC classification number: H04N19/865 , G06T5/70 , H04N19/117 , H04N19/136 , H04N19/174 , H04N19/176 , H04N19/182 , H04N19/192 , H04N19/46 , H04N19/80 , H04N19/85 , H04N19/86 , G06T2207/20021 , G06T2207/20192 , H04N19/463
Abstract: A portion Y of a degraded frame is restored using a projection operation that uses a first projection parameter α, a second projection parameter β, and at least two guide portions. Restoring the portion Y of the degraded frame includes generating, using first restoration parameters, a first guide portion Y1 for the portion Y; generating, using second restoration parameters, a second guide portion Y2 for the portion Y; and generating a reconstructed portion YR, wherein the projection operation is based on α(Y1−Y)+β(Y2−Y).
-
公开(公告)号:US20240098298A1
公开(公告)日:2024-03-21
申请号:US18521182
申请日:2023-11-28
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , Yuxin Liu , Sarah Parker
IPC: H04N19/517 , H04N19/17 , H04N19/20 , H04N19/513 , H04N19/54 , H04N19/543 , H04N19/547 , H04N19/557 , H04N19/80
CPC classification number: H04N19/517 , H04N19/17 , H04N19/20 , H04N19/521 , H04N19/54 , H04N19/543 , H04N19/547 , H04N19/557 , H04N19/80
Abstract: Multiple global motion models associated with respective segments of a current frame are decoded from a compressed bitstream. Each global motion model is based on a segmentation of the current frame and represents a respective underlying motion of blocks within a respective segment. Blocks of the current frame are decoded by: for each inter-predicted block of a segment, decoding, form the compressed bitstream, an indication of whether to decode the each inter-predicted block based on a global motion model of the multiple global motion models and associated with the segment, or whether to decode the each inter-predicted block based on a motion vector that is different from the global motion model; and decoding the each inter-predicted block based on the indication.
-
公开(公告)号:US20240098280A1
公开(公告)日:2024-03-21
申请号:US18272862
申请日:2021-01-19
Applicant: Google LLC
Inventor: Urvang Joshi , Yue Chen , Sarah Parker , Elliott Karpilovsky , Debargha Mukherjee
IPC: H04N19/176 , H04N19/30
CPC classification number: H04N19/176 , H04N19/30
Abstract: Image coding using guided machine learning restoration may include obtaining reconstructed frame data by decoding, obtaining a restored frame by restoring the reconstructed frame, and outputting the restored frame. Obtaining the restored frame may include obtaining a reconstructed block, obtaining guide parameter values, obtaining a restored block, and including the restored block in the restored frame. Obtaining the restored block may include inputting the reconstructed block to an input layer of a trained guided convolutional neural network, wherein the neural network is constrained such that an output layer has a defined cardinality of channels, obtaining, from the output layer, neural network output channel predictions, obtaining a guided neural network prediction as a linear combination of the guide parameter values and the neural network output channel predictions, and generating the restored block using the guided neural network prediction.
-
公开(公告)号:US20230179789A1
公开(公告)日:2023-06-08
申请号:US18155224
申请日:2023-01-17
Applicant: Google LLC
Inventor: Urvang Joshi , Debargha Mukherjee , Andrew Simpson
IPC: H04N19/50 , H04N7/01 , H04N19/117 , H04N19/184 , H04N19/61 , H04N19/176
CPC classification number: H04N19/50 , H04N7/0117 , H04N19/117 , H04N19/184 , H04N19/619 , H04N19/176
Abstract: A super-resolution coding mode is described. An encoded image can be decoded from an encoded bitstream stored on a non-transitory computer-readable storage medium. A flag can indicate whether an image was encoded using the super-resolution mode at a first resolution. Responsive to the flag indicating that the image was encoded using the super-resolution mode, bits indicating an amount of scaling of the image are included. The image is decoded from the encoded bitstream to obtain a reconstructed image at the first resolution, and the reconstructed image is upscaled to a second resolution using the amount of scaling to obtain an upscaled reconstructed image. The second resolution is higher than the first resolution. Loop restoration parameters within the bitstream can used for look restoration filtering of the upscaled reconstructed image to obtain a loop restored image at the second resolution.
-
公开(公告)号:US20220353545A1
公开(公告)日:2022-11-03
申请号:US17866631
申请日:2022-07-18
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee
IPC: H04N19/86 , H04N19/182 , H04N19/46 , H04N19/80 , H04N19/174 , H04N19/85 , H04N19/192 , G06T5/00 , H04N19/117 , H04N19/136 , H04N19/176
Abstract: A device for restoring a degraded frame resulting from reconstruction of a source frame includes a processor that is configured to receive a compressed bitstream. The compressed bitstream includes a first projection parameter α a second projection parameter β, first restoration parameters comprising a first radius value, and second restoration parameters comprising a second radius value. The processor is further configured to restore at least a portion of the degraded frame using a projection operation that uses the first projection parameter α, the second projection parameter β, and at least two guide tiles.
-
公开(公告)号:US20210409705A1
公开(公告)日:2021-12-30
申请号:US16912767
申请日:2020-06-26
Applicant: GOOGLE LLC
Inventor: Sarah Parker , Debargha Mukherjee , Yue Chen , Elliott Karpilovsky , Urvang Joshi
IPC: H04N19/119 , H04N19/12 , H04N19/61 , H04N19/176 , G06N20/00
Abstract: Transform-level partitioning of a prediction residual block is performed to improve compression efficiency of video data. During encoding, a prediction residual block is generated responsive to prediction-level partitioning performed against a video block, a transform block partition type to use is determined based on the prediction residual block, a non-recursive transform-level partitioning is performed against the prediction residual block according to the transform block partition type, and transform blocks generated as a result of the transform-level partitioning are encoded to a bitstream. During decoding, a symbol representative of the transform block partition type used to encode transform blocks is derived from the bitstream, inverse transformed blocks are produced by inverse transforming encoded video data associated with the prediction residual block, and the prediction residual block is reproduced according to the transform block partition type and used to reconstruct the video block, which is output within an output video stream.
-
公开(公告)号:US20210185312A1
公开(公告)日:2021-06-17
申请号:US16712057
申请日:2019-12-12
Applicant: GOOGLE LLC
Inventor: Lester Lu , Debargha Mukherjee , Elliott Karpilovsky
IPC: H04N19/107 , H04N19/122 , H04N19/50 , H04N19/176
Abstract: Coding a block of video data includes determining a prediction mode for the block, which is an inter-prediction or intra-prediction mode, determining a transform type for the block, and coding the block using the prediction mode and the transform type. The transform type is one of a first plurality of transform types when the prediction mode is the inter-prediction mode, and is one of a second plurality of transform types when the prediction mode is the intra-prediction mode. The first plurality of transform types includes first fixed transform types and first mode-dependent transform types that are based on a first learned transform generated using inter-predicted blocks. The second plurality of transform types includes second fixed transform types and second mode-dependent transform types that are based on a second learned transform generated using intra-predicted blocks. The first and second fixed transform types have at least some fixed transform types in common.
-
-
-
-
-
-
-
-
-