HIERARCHICAL AUDIO/VIDEO OR PICTURE COMPRESSION METHOD AND APPARATUS

    公开(公告)号:US20230396810A1

    公开(公告)日:2023-12-07

    申请号:US18453933

    申请日:2023-08-22

    CPC classification number: H04N19/91 H04N19/184 G06V10/771

    Abstract: This application provides an audio/video or picture compression method and apparatus, which relates to the field of artificial intelligence (AI)-based audio/video or picture compression technologies, and to the field of neural network-based audio/video or picture compression technologies. The method includes: transforming a raw audio/video or picture to feature space through a multilayer convolution operation, extracting features of different layers in the feature space, outputting rounded feature signals of the different layers, predicting probability distribution of shallow feature signals by using deep feature signals or entropy estimation results, and performing entropy encoding on the rounded feature signals. In this application, signal correlation between different layers is utilized. In this way, audio/video or picture compression performance can be improved.

    METHOD AND APPARATUS FOR DETERMINING IMAGE LOSS VALUE, STORAGE MEDIUM, AND PROGRAM PRODUCT

    公开(公告)号:US20240223775A1

    公开(公告)日:2024-07-04

    申请号:US18604886

    申请日:2024-03-14

    CPC classification number: H04N19/154 H04N19/119 H04N19/20

    Abstract: Embodiments of this application disclose a method and an apparatus for determining an image loss value, a storage medium, and a program product, and belong to the field of image compression technologies. In this method, loss values of different areas in an image are determined based on a partition indication map of the image, and then a total loss value is determined based on the loss values of the different areas. The partition indication map may be used to distinguish between a heavily-structured area and a lightly-structured area in the image, that is, the partition indication map may be used to distinguish between an edge structure and a texture. When the total loss value is used to assess image reconstruction quality, the image reconstruction quality can be assessed more comprehensively, and assessment of reconstruction quality of the edge structure and the texture can be maximally prevented from mutual impact.

Patent Agency Ranking