DATA ENCODING AND DECODING METHOD AND RELATED DEVICE

    公开(公告)号:US20250150588A1

    公开(公告)日:2025-05-08

    申请号:US19011813

    申请日:2025-01-07

    Abstract: This application provides a data encoding and decoding method and a related device. In the encoding method, side information feature extraction is performed on a first feature map of current data, and quantization processing is performed, to obtain a first quantized feature map. Entropy encoding is performed based on the first quantized feature map, to obtain a first bitstream of the current data. A scaling coefficient is obtained based on the first quantized feature map, scaling processing is performed on a second feature map based on the scaling coefficient, and quantization processing is performed, to obtain a second quantized feature map. Scaling processing is performed on a first probability distribution parameter based on the scaling coefficient, to obtain a second probability distribution parameter, and entropy encoding is performed on the second quantized feature map based on the second probability distribution parameter, to obtain a second bitstream of the current data.

    ENCODING AND DECODING METHOD AND APPARATUS, AND COMPUTER DEVICE

    公开(公告)号:US20250150591A1

    公开(公告)日:2025-05-08

    申请号:US19012441

    申请日:2025-01-07

    Abstract: This application discloses an encoding and decoding method and apparatus, and a computer device, and is applied to the field of image processing technologies. In the method, a feature map of an image is obtained through an encoding network. A boundary value of a first element in the feature map is determined, and a target range corresponding to the first element is determined based on the boundary value. If the first element is in the target range, entropy encoding is performed on the first element. If the first element is outside the target range, the first element is modified to the boundary value corresponding to the first element, and entropy encoding is performed on the modified first element, or the first element is marked as an out-of-bounds element, and variable-length code encoding is performed on the first element.

    FEATURE MAP ENCODING AND DECODING METHOD AND APPARATUS

    公开(公告)号:US20240221230A1

    公开(公告)日:2024-07-04

    申请号:US18604842

    申请日:2024-03-14

    CPC classification number: G06T9/001 G06T9/002

    Abstract: This application provides a feature map encoding and decoding method and an apparatus, and relates to the field of artificial intelligence (AI)-based data encoding and decoding technologies. The feature map decoding method includes: obtaining a bitstream of a to-be-decoded feature map, where the to-be-decoded feature map includes a plurality of feature elements; obtaining a first probability estimation result corresponding to each feature element based on the bitstream, where the first probability estimation result includes a first peak probability; determining a set of first feature elements and a set of second feature elements from the plurality of feature elements based on a first threshold and the first peak probability corresponding to each feature element; and obtaining a decoded feature map based on the set of first feature elements and the set of second feature elements. This can improve encoding and decoding performance while reducing encoding and decoding complexity.

    HIERARCHICAL AUDIO/VIDEO OR PICTURE COMPRESSION METHOD AND APPARATUS

    公开(公告)号:US20230396810A1

    公开(公告)日:2023-12-07

    申请号:US18453933

    申请日:2023-08-22

    CPC classification number: H04N19/91 H04N19/184 G06V10/771

    Abstract: This application provides an audio/video or picture compression method and apparatus, which relates to the field of artificial intelligence (AI)-based audio/video or picture compression technologies, and to the field of neural network-based audio/video or picture compression technologies. The method includes: transforming a raw audio/video or picture to feature space through a multilayer convolution operation, extracting features of different layers in the feature space, outputting rounded feature signals of the different layers, predicting probability distribution of shallow feature signals by using deep feature signals or entropy estimation results, and performing entropy encoding on the rounded feature signals. In this application, signal correlation between different layers is utilized. In this way, audio/video or picture compression performance can be improved.

    FEATURE DOMAIN OPTICAL FLOW DETERMINING METHOD AND RELATED DEVICE

    公开(公告)号:US20240422324A1

    公开(公告)日:2024-12-19

    申请号:US18819998

    申请日:2024-08-29

    Abstract: This application provides a feature domain optical flow determining method and a related device, and relates to the field of video or picture compression technologies based on artificial intelligence (AI). The method specifically includes: obtaining a picture domain optical flow between a current frame and a reference frame; performing multi-scale feature extraction on the reference frame, to obtain M feature maps of the reference frame, where M is an integer greater than or equal to 1; and performing M times of feature domain optical flow estimation based on the M feature maps of the reference frame and the picture domain optical flow between the current frame and the reference frame, to obtain M feature domain optical flows. A feature domain optical flow obtained by using the solutions of this application is more accurate and more stable, thereby improving inter-prediction accuracy.

    METHOD AND APPARATUS FOR DETERMINING IMAGE LOSS VALUE, STORAGE MEDIUM, AND PROGRAM PRODUCT

    公开(公告)号:US20240223775A1

    公开(公告)日:2024-07-04

    申请号:US18604886

    申请日:2024-03-14

    CPC classification number: H04N19/154 H04N19/119 H04N19/20

    Abstract: Embodiments of this application disclose a method and an apparatus for determining an image loss value, a storage medium, and a program product, and belong to the field of image compression technologies. In this method, loss values of different areas in an image are determined based on a partition indication map of the image, and then a total loss value is determined based on the loss values of the different areas. The partition indication map may be used to distinguish between a heavily-structured area and a lightly-structured area in the image, that is, the partition indication map may be used to distinguish between an edge structure and a texture. When the total loss value is used to assess image reconstruction quality, the image reconstruction quality can be assessed more comprehensively, and assessment of reconstruction quality of the edge structure and the texture can be maximally prevented from mutual impact.

    ENCODING AND DECODING METHOD, APPARATUS, AND DEVICE, STORAGE MEDIUM, COMPUTER PROGRAM, AND COMPUTER PROGRAM PRODUCT

    公开(公告)号:US20240095964A1

    公开(公告)日:2024-03-21

    申请号:US18521067

    申请日:2023-11-28

    Inventor: Yibo SHI Jing WANG

    CPC classification number: G06T9/00

    Abstract: Embodiments of this application disclose encoding and decoding methods, apparatuses, and devices, a storage medium, and a computer program, which relate to the field of encoding and decoding technologies. In embodiments of this application, in a decoding process, a plurality of feature points are divided into a plurality of groups based on a specified numerical value, and probability distributions of feature points in a same group are determined in parallel to improve decoding efficiency. Correspondingly, in an encoding process, the plurality of feature points are also grouped in a same grouping manner, and first image features of each group of feature points in the plurality of groups are sequentially encoded into a bit stream. To be concise, this solution can break through an efficiency bottleneck caused by serial computing when decoding is performed based on a variational auto-encoder (VAE), thereby effectively improving decoding efficiency.

Patent Agency Ranking