IMAGE PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

Invention Application

US20220253631A1 IMAGE PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

Please log in to see more content

Patent Title: IMAGE PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM
Application No.: US17501221

Application Date: 2021-10-14
Publication No.: US20220253631A1

Publication Date: 2022-08-11
Inventor: Yulin LI , Ju HUANG , Qunyi XIE , Xiameng QIN , Chengquan ZHANG , Jingtuo LIU
Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Applicant Address: CN Beijing
Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Current Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Current Assignee Address: CN Beijing
Priority: CN202110156565.9 20210204
Main IPC: G06K9/00
IPC: G06K9/00 ; G06K9/62 ; G06N3/04 ; G06F40/30

IMAGE PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

Abstract:

The present disclosure discloses an image processing method, an electronic device and a storage medium, and relates to the field of artificial intelligence technologies, and particularly to the fields of computer vision technologies, deep learning technologies, or the like. The image processing method includes: acquiring a multi-modal feature of each of at least one text region in an image, the multi-modal feature including features in plural dimensions; performing a global attention processing operation on the multi-modal feature of each text region to obtain a global attention feature of each text region; determining a category of each text region based on the global attention feature of each text region; and constructing structured information based on text content and the category of each text region.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )