-
公开(公告)号:US20250150588A1
公开(公告)日:2025-05-08
申请号:US19011813
申请日:2025-01-07
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yihui FENG , Tiansheng GUO , Yibo SHI , Jing WANG
IPC: H04N19/124 , G06V10/52 , G06V10/77 , G06V10/82 , H04N19/13 , H04N19/196
Abstract: This application provides a data encoding and decoding method and a related device. In the encoding method, side information feature extraction is performed on a first feature map of current data, and quantization processing is performed, to obtain a first quantized feature map. Entropy encoding is performed based on the first quantized feature map, to obtain a first bitstream of the current data. A scaling coefficient is obtained based on the first quantized feature map, scaling processing is performed on a second feature map based on the scaling coefficient, and quantization processing is performed, to obtain a second quantized feature map. Scaling processing is performed on a first probability distribution parameter based on the scaling coefficient, to obtain a second probability distribution parameter, and entropy encoding is performed on the second quantized feature map based on the second probability distribution parameter, to obtain a second bitstream of the current data.
-
公开(公告)号:US20250150591A1
公开(公告)日:2025-05-08
申请号:US19012441
申请日:2025-01-07
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yibo SHI , Yunqi HUANG , Jing WANG
IPC: H04N19/13 , H04N19/167 , H04N19/196
Abstract: This application discloses an encoding and decoding method and apparatus, and a computer device, and is applied to the field of image processing technologies. In the method, a feature map of an image is obtained through an encoding network. A boundary value of a first element in the feature map is determined, and a target range corresponding to the first element is determined based on the boundary value. If the first element is in the target range, entropy encoding is performed on the first element. If the first element is outside the target range, the first element is modified to the boundary value corresponding to the first element, and entropy encoding is performed on the modified first element, or the first element is marked as an out-of-bounds element, and variable-length code encoding is performed on the first element.
-
公开(公告)号:US20240221230A1
公开(公告)日:2024-07-04
申请号:US18604842
申请日:2024-03-14
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yibo SHI , Yunying GE , Jing WANG , Jue MAO , Yin ZHAO , Haitao YANG
IPC: G06T9/00
Abstract: This application provides a feature map encoding and decoding method and an apparatus, and relates to the field of artificial intelligence (AI)-based data encoding and decoding technologies. The feature map decoding method includes: obtaining a bitstream of a to-be-decoded feature map, where the to-be-decoded feature map includes a plurality of feature elements; obtaining a first probability estimation result corresponding to each feature element based on the bitstream, where the first probability estimation result includes a first peak probability; determining a set of first feature elements and a set of second feature elements from the plurality of feature elements based on a first threshold and the first peak probability corresponding to each feature element; and obtaining a decoded feature map based on the set of first feature elements and the set of second feature elements. This can improve encoding and decoding performance while reducing encoding and decoding complexity.
-
公开(公告)号:US20230396810A1
公开(公告)日:2023-12-07
申请号:US18453933
申请日:2023-08-22
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yunying GE , Jing WANG , Yibo SHI , Shangyin GAO
IPC: H04N19/91 , H04N19/184 , G06V10/771
CPC classification number: H04N19/91 , H04N19/184 , G06V10/771
Abstract: This application provides an audio/video or picture compression method and apparatus, which relates to the field of artificial intelligence (AI)-based audio/video or picture compression technologies, and to the field of neural network-based audio/video or picture compression technologies. The method includes: transforming a raw audio/video or picture to feature space through a multilayer convolution operation, extracting features of different layers in the feature space, outputting rounded feature signals of the different layers, predicting probability distribution of shallow feature signals by using deep feature signals or entropy estimation results, and performing entropy encoding on the rounded feature signals. In this application, signal correlation between different layers is utilized. In this way, audio/video or picture compression performance can be improved.
-
公开(公告)号:US20240422324A1
公开(公告)日:2024-12-19
申请号:US18819998
申请日:2024-08-29
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yunying GE , Jing WANG , Yibo SHI
IPC: H04N19/137 , H04N19/169 , H04N19/30
Abstract: This application provides a feature domain optical flow determining method and a related device, and relates to the field of video or picture compression technologies based on artificial intelligence (AI). The method specifically includes: obtaining a picture domain optical flow between a current frame and a reference frame; performing multi-scale feature extraction on the reference frame, to obtain M feature maps of the reference frame, where M is an integer greater than or equal to 1; and performing M times of feature domain optical flow estimation based on the M feature maps of the reference frame and the picture domain optical flow between the current frame and the reference frame, to obtain M feature domain optical flows. A feature domain optical flow obtained by using the solutions of this application is more accurate and more stable, thereby improving inter-prediction accuracy.
-
6.
公开(公告)号:US20240223775A1
公开(公告)日:2024-07-04
申请号:US18604886
申请日:2024-03-14
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Shangyin GAO , Jing WANG , Zhongying QIU , Meng LI , Yibo SHI
IPC: H04N19/154 , H04N19/119 , H04N19/20
CPC classification number: H04N19/154 , H04N19/119 , H04N19/20
Abstract: Embodiments of this application disclose a method and an apparatus for determining an image loss value, a storage medium, and a program product, and belong to the field of image compression technologies. In this method, loss values of different areas in an image are determined based on a partition indication map of the image, and then a total loss value is determined based on the loss values of the different areas. The partition indication map may be used to distinguish between a heavily-structured area and a lightly-structured area in the image, that is, the partition indication map may be used to distinguish between an edge structure and a texture. When the total loss value is used to assess image reconstruction quality, the image reconstruction quality can be assessed more comprehensively, and assessment of reconstruction quality of the edge structure and the texture can be maximally prevented from mutual impact.
-
7.
公开(公告)号:US20240095964A1
公开(公告)日:2024-03-21
申请号:US18521067
申请日:2023-11-28
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
IPC: G06T9/00
CPC classification number: G06T9/00
Abstract: Embodiments of this application disclose encoding and decoding methods, apparatuses, and devices, a storage medium, and a computer program, which relate to the field of encoding and decoding technologies. In embodiments of this application, in a decoding process, a plurality of feature points are divided into a plurality of groups based on a specified numerical value, and probability distributions of feature points in a same group are determined in parallel to improve decoding efficiency. Correspondingly, in an encoding process, the plurality of feature points are also grouped in a same grouping manner, and first image features of each group of feature points in the plurality of groups are sequentially encoded into a bit stream. To be concise, this solution can break through an efficiency bottleneck caused by serial computing when decoding is performed based on a variational auto-encoder (VAE), thereby effectively improving decoding efficiency.
-
-
-
-
-
-