COMBINATION OF MODE-DEPENDENT AND FIXED TRANSFORM TYPES IN VIDEO CODING

    公开(公告)号:US20240323361A1

    公开(公告)日:2024-09-26

    申请号:US18678031

    申请日:2024-05-30

    Applicant: GOOGLE LLC

    CPC classification number: H04N19/107 H04N19/122 H04N19/176 H04N19/50

    Abstract: Decoding video data includes, for a block encoded using a prediction mode, determining a transform mode for the block using the prediction mode. The transform mode is a first mode when the prediction mode is an inter-prediction mode and is a second mode when the prediction mode is an intra-prediction mode. The first mode is an available first transform type that is a combination of transforms selected from first fixed transforms and first learned transforms that each comprise a respective transformation matrix generated iteratively using blocks predicted using the inter-prediction mode. The second mode is an available second transform type that is a combination of transforms selected from second fixed transforms, which is a proper subset of the first fixed transforms, and a second learned transform comprising a transformation matrix that is generated iteratively using blocks predicted using the intra-prediction mode. Decoding the block uses the prediction and transform modes.

    TRANSFORM PREDICTION WITH PARSING INDEPENDENT CODING

    公开(公告)号:US20240205458A1

    公开(公告)日:2024-06-20

    申请号:US18542850

    申请日:2023-12-18

    Applicant: GOOGLE LLC

    Abstract: Transform prediction with parsing independent coding includes generating a reconstructed frame and outputting the reconstructed frame. Generating the reconstructed frame includes entropy decoding transform blocks for the reconstructed frame, entropy decoding decoded transform identifiers for the transform blocks, obtaining transform-specific probability distributions for available transforms, and, for a current transform block from the transform blocks, identifying a current remapped transform identifier from the decoded transform identifiers, identifying a current transform identifier in accordance with the current remapped transform identifier, the transform coefficients from the current transform block, and the transform-specific probability distributions, identifying a current transform in accordance with the current transform identifier; inverse transforming, in accordance with the current transform, the current transform block to obtain a current residual block and obtaining a current reconstructed block using the current residual block. Generating the reconstructed frame includes including the current reconstructed block in the reconstructed frame.

    Inter-prediction mode-dependent transforms for video coding

    公开(公告)号:US11197004B1

    公开(公告)日:2021-12-07

    申请号:US16919507

    申请日:2020-07-02

    Applicant: Google LLC

    Abstract: Transform modes are derived for inter-predicted blocks using side information available within a bitstream. An inter-predicted encoded video block and side information are identified within a bitstream. Based on the side information, a trained transform is determined for inverse transforming transform coefficients of the inter-predicted encoded video block from amongst multiple trained transforms. The transform coefficients of the inter-predicted encoded video block are inverse transformed according to the trained transform to produce a prediction residual. A video block is reconstructed using the prediction residual and the reference frame. The video block is then output within an output video stream for storage or display. To determine the trained transforms, a learning model uses individual side information types and combinations of the individual side information types processed against a training data set.

Patent Agency Ranking