Invention Application
- Patent Title: IMAGE PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM
-
Application No.: US17501221Application Date: 2021-10-14
-
Publication No.: US20220253631A1Publication Date: 2022-08-11
- Inventor: Yulin LI , Ju HUANG , Qunyi XIE , Xiameng QIN , Chengquan ZHANG , Jingtuo LIU
- Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Applicant Address: CN Beijing
- Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Current Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Current Assignee Address: CN Beijing
- Priority: CN202110156565.9 20210204
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/62 ; G06N3/04 ; G06F40/30

Abstract:
The present disclosure discloses an image processing method, an electronic device and a storage medium, and relates to the field of artificial intelligence technologies, and particularly to the fields of computer vision technologies, deep learning technologies, or the like. The image processing method includes: acquiring a multi-modal feature of each of at least one text region in an image, the multi-modal feature including features in plural dimensions; performing a global attention processing operation on the multi-modal feature of each text region to obtain a global attention feature of each text region; determining a category of each text region based on the global attention feature of each text region; and constructing structured information based on text content and the category of each text region.
Information query