IMAGE PROCESSING METHOD AND APPARATUS, DEVICE AND STORAGE MEDIUM
摘要:
The present disclosure discloses an image processing method and apparatus, a device and a storage medium, and relates to the field of artificial intelligence technologies, and particularly to the fields of computer vision technologies, deep learning technologies, or the like. The image processing method includes: acquiring a multi-modal feature of each of at least one text region in an image, the multi-modal feature including features in plural dimensions; performing a global attention processing operation on the multi-modal feature of each text region to obtain a global attention feature of each text region; determining a category of each text region based on the global attention feature of each text region; and constructing structured information based on text content and the category of each text region. The present disclosure may provide a more universal construction scheme for structured information in an image.
信息查询
0/0