COMPOUND PREDICTION FOR VIDEO CODING
    81.
    发明公开

    公开(公告)号:US20240333961A1

    公开(公告)日:2024-10-03

    申请号:US18742180

    申请日:2024-06-13

    Applicant: GOOGLE LLC

    Abstract: Generating a compound predictor block includes generating a first predictor block and generating a second predictor block. The first predictor block includes a first pixel and the second predictor block includes a second pixel. The first and the second pixels are located at a same location within the first predictor block and the second predictor block, respectively. A first weight is determined for the first pixel based on a difference between a first value of the first pixel and a second value of the second pixel. A second weight is determined for the second pixel based on the first weight. The compound predictor block is generated by combining the first predictor block and the second predictor block. The compound predictor block includes a weighted pixel that is determined based on a weighted sum of the first pixel and the second pixel based on the first weight and the second weight.

    VIDEO INTER/INTRA COMPRESSION USING MIXTURE OF EXPERTS

    公开(公告)号:US20240195985A1

    公开(公告)日:2024-06-13

    申请号:US18286574

    申请日:2021-05-07

    Applicant: Google LLC

    CPC classification number: H04N19/176 H04N19/119 H04N19/159

    Abstract: Methods, systems, and apparatus, including computer programs, for compression and decompression of video data using an ensemble of machine learning models. Methods can include defining for each frame in a video, a plurality of blocks in the frame. Methods can further include processing the frames of video in sequential sets, wherein each sequential set is at least a current frame (220) of video and a prior frame (240) of video in the ordered sequence. Each respective prediction of a block in the frame of the video includes providing, as input to a prediction model a first and the second border (235,230) of a current block (225) of the current frame, a first and a second border (250, 255) for a respective current block (245) of the prior frame and the respective current block (245) of the prior frame.

    Video Coding With Guided Machine Learning Restoration

    公开(公告)号:US20240098280A1

    公开(公告)日:2024-03-21

    申请号:US18272862

    申请日:2021-01-19

    Applicant: Google LLC

    CPC classification number: H04N19/176 H04N19/30

    Abstract: Image coding using guided machine learning restoration may include obtaining reconstructed frame data by decoding, obtaining a restored frame by restoring the reconstructed frame, and outputting the restored frame. Obtaining the restored frame may include obtaining a reconstructed block, obtaining guide parameter values, obtaining a restored block, and including the restored block in the restored frame. Obtaining the restored block may include inputting the reconstructed block to an input layer of a trained guided convolutional neural network, wherein the neural network is constrained such that an output layer has a defined cardinality of channels, obtaining, from the output layer, neural network output channel predictions, obtaining a guided neural network prediction as a linear combination of the guide parameter values and the neural network output channel predictions, and generating the restored block using the guided neural network prediction.

    SUPER-RESOLUTION LOOP RESTORATION
    87.
    发明公开

    公开(公告)号:US20230179789A1

    公开(公告)日:2023-06-08

    申请号:US18155224

    申请日:2023-01-17

    Applicant: Google LLC

    Abstract: A super-resolution coding mode is described. An encoded image can be decoded from an encoded bitstream stored on a non-transitory computer-readable storage medium. A flag can indicate whether an image was encoded using the super-resolution mode at a first resolution. Responsive to the flag indicating that the image was encoded using the super-resolution mode, bits indicating an amount of scaling of the image are included. The image is decoded from the encoded bitstream to obtain a reconstructed image at the first resolution, and the reconstructed image is upscaled to a second resolution using the amount of scaling to obtain an upscaled reconstructed image. The second resolution is higher than the first resolution. Loop restoration parameters within the bitstream can used for look restoration filtering of the upscaled reconstructed image to obtain a loop restored image at the second resolution.

    EXTENDED TRANSFORM PARTITIONS FOR VIDEO COMPRESSION

    公开(公告)号:US20210409705A1

    公开(公告)日:2021-12-30

    申请号:US16912767

    申请日:2020-06-26

    Applicant: GOOGLE LLC

    Abstract: Transform-level partitioning of a prediction residual block is performed to improve compression efficiency of video data. During encoding, a prediction residual block is generated responsive to prediction-level partitioning performed against a video block, a transform block partition type to use is determined based on the prediction residual block, a non-recursive transform-level partitioning is performed against the prediction residual block according to the transform block partition type, and transform blocks generated as a result of the transform-level partitioning are encoded to a bitstream. During decoding, a symbol representative of the transform block partition type used to encode transform blocks is derived from the bitstream, inverse transformed blocks are produced by inverse transforming encoded video data associated with the prediction residual block, and the prediction residual block is reproduced according to the transform block partition type and used to reconstruct the video block, which is output within an output video stream.

    COMBINATION OF MODE-DEPENDENT AND FIXED TRANSFORM TYPES IN VIDEO CODING

    公开(公告)号:US20210185312A1

    公开(公告)日:2021-06-17

    申请号:US16712057

    申请日:2019-12-12

    Applicant: GOOGLE LLC

    Abstract: Coding a block of video data includes determining a prediction mode for the block, which is an inter-prediction or intra-prediction mode, determining a transform type for the block, and coding the block using the prediction mode and the transform type. The transform type is one of a first plurality of transform types when the prediction mode is the inter-prediction mode, and is one of a second plurality of transform types when the prediction mode is the intra-prediction mode. The first plurality of transform types includes first fixed transform types and first mode-dependent transform types that are based on a first learned transform generated using inter-predicted blocks. The second plurality of transform types includes second fixed transform types and second mode-dependent transform types that are based on a second learned transform generated using intra-predicted blocks. The first and second fixed transform types have at least some fixed transform types in common.

Patent Agency Ranking