Codebook generation for cloud-based video applications

    公开(公告)号:US11638007B2

    公开(公告)日:2023-04-25

    申请号:US17446862

    申请日:2021-09-03

    申请人: Adobe Inc.

    摘要: Techniques are disclosed for the improvement of vector quantization (VQ) codebook generation. The improved codebooks may be used for compression in cloud-based video applications. VQ achieves compression by vectorizing input video streams, matching those vectors to codebook vector entries, and replacing them with indexes of the matched codebook vectors along with residual vectors to represent the difference between the input stream vector and the codebook vector. The combination of index and residual is generally smaller than the input stream vector which they collectively encode, thus providing compression. The improved codebook may be generated from training video streams by grouping together similar types of data (e.g., image data, motion data, control data) from the video stream to generate longer vectors having higher dimensions and greater structure. This improves the ability of VQ to remove redundancy and thus increase compression efficiency. Storage space is thus reduced and video transmission may be faster.

    Method and Apparatus for Secondary Transform with Adaptive Kernel Options

    公开(公告)号:US20230083773A1

    公开(公告)日:2023-03-16

    申请号:US17897815

    申请日:2022-08-29

    摘要: This disclosure relates to secondary transform of video blocks with adaptive kernel options. For example, a method for decoding a video block in an video stream is disclosed. The method may include comprising parsing and processing the video stream to generate: a set of secondary transform coefficients associated with the video block; an intra-prediction mode associated with the video block; and a kernel index indicating a secondary transform kernel among a group of secondary transform kernels. The method may further include identifying the group of secondary transform kernels based on the intra-prediction mode; and performing an inverse secondary transform of the set of secondary transform coefficients to generate primary transform coefficients of the video block based on the secondary transform kernel among the group of secondary transform kernels identified by the kernel index. The quantity of kernels in the group of secondary transform kernels depends on at least one of: the intra-prediction mode associated with the video block; a size of the video block; or a primary transform type associated with the video block.

    VIDEO CODING AND DECODING
    8.
    发明申请

    公开(公告)号:US20230077992A1

    公开(公告)日:2023-03-16

    申请号:US18056193

    申请日:2022-11-16

    摘要: A method of encoding a motion information predictor index for an Affine Merge mode, comprising: generating a list of motion information predictor candidates; selecting one of the motion information predictor candidates in the list as an Affine Merge mode predictor; and generating a motion information predictor index for the selected motion information predictor candidate using CABAC coding, one or more bits of the motion information predictor index being bypass CABAC coded.

    CODING EFFICIENCY OF BOUNDARY COORDINATES OF SRCC SCAN REGION

    公开(公告)号:US20230061939A1

    公开(公告)日:2023-03-02

    申请号:US17978092

    申请日:2022-10-31

    发明人: Ye HU Liqiang WANG

    摘要: A video decoding method includes obtaining a size of a coding block of a video image frame, and decoding a bitstream of the coding block to obtain a value of a range variable associated with a SRCC scan region of the coding block. The range variable includes a width range variable and a height range variable. Each of the width range variable and the height range variable are obtained by comparing a prefix value to one or more predefined value intervals. The method further includes determining a horizontal coordinate of boundary coordinates of the SRCC scan region according to a difference between the width of the coding block and the width range variable and determining a vertical coordinate of the boundary coordinates of the SRCC scan region according to a difference between the height of the coding block and the height range variable.