-
公开(公告)号:US20220157041A1
公开(公告)日:2022-05-19
申请号:US17587689
申请日:2022-01-28
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Kai HAN , Yunhe WANG , Han SHU , Chunjing XU
IPC: G06V10/44 , G06V10/764 , G06V10/82
Abstract: This application relates to an image recognition technology in the field of computer vision in the field of artificial intelligence, and provides an image classification method and apparatus. The method includes: obtaining an input feature map of a to-be-processed image; performing convolution processing on the input feature map based on M convolution kernels of a neural network, to obtain a candidate output feature map of M channels, where M is a positive integer; performing matrix transformation on the M channels of the candidate output feature map based on N matrices, to obtain an output feature map of N channels, where a quantity of channels of each of the N matrices is less than M, N is greater than M, and N is a positive integer; and classify the to-be-processed image based on the output feature map, to obtain a classification result of the to-be-processed image.
-
公开(公告)号:US20240185573A1
公开(公告)日:2024-06-06
申请号:US18441229
申请日:2024-02-14
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Han SHU , Jiahao WANG , Hanting CHEN , Wenshuo LI , Yunhe WANG
IPC: G06V10/77 , G06V10/764 , G06V10/80 , G06V10/82
CPC classification number: G06V10/7715 , G06V10/764 , G06V10/806 , G06V10/82
Abstract: This disclosure provides an image classification method and a related device thereof. The method includes the following operations: After obtaining a target image, a transformer network may perform linear transformation processing based on the target image to obtain a Q-feature, a K-feature, and a V-feature. The transformer network calculates a distance between the Q-feature and the K-feature to obtain an attention feature. Then, the transformer network performs fusion processing on the attention feature and the V-feature, and obtains a classification result of the target image based on a fused feature.
-