-
公开(公告)号:US20230222827A1
公开(公告)日:2023-07-13
申请号:US18181800
申请日:2023-03-10
Inventor: Wenjin Wang , Zhengjie Huang , Bin Luo , Qiming Peng , Weichong Yin , Shikun Feng , Shiwei Huang , Jingzhou He
IPC: G06V30/414 , G06V30/18 , G06F40/30 , G06F40/295
CPC classification number: G06V30/414 , G06F40/30 , G06F40/295 , G06V30/18143
Abstract: In a method for processing a document image, a document image to be processed is acquired. Text nodes of multiple granularities, visual nodes of multiple granularities, respective node information of the text nodes, and respective node information of the visual nodes in the document image are obtained. A multi-granularity and multi-modality document graph is construct based on the text nodes of multiple granularities, the visual nodes of multiple granularities, the respective node information of the text nodes and the respective node information of the visual nodes. Multi-granularity semantic feature information of the document image is determined based on the multi-granularity and multi-modality document graph, the respective node information of the text nodes and the respective node information of the visual nodes.