Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Bin Luo"

1.

发明公开
METHOD AND APPARATUS FOR PROCESSING DOCUMENT IMAGE, AND ELECTRONIC DEVICE 审中-公开

公开(公告)号：US20230222827A1

公开(公告)日：2023-07-13

申请号：US18181800

申请日：2023-03-10

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Wenjin Wang , Zhengjie Huang , Bin Luo , Qiming Peng , Weichong Yin , Shikun Feng , Shiwei Huang , Jingzhou He

IPC: G06V30/414 , G06V30/18 , G06F40/30 , G06F40/295

CPC classification number: G06V30/414 , G06F40/30 , G06F40/295 , G06V30/18143

Abstract: In a method for processing a document image, a document image to be processed is acquired. Text nodes of multiple granularities, visual nodes of multiple granularities, respective node information of the text nodes, and respective node information of the visual nodes in the document image are obtained. A multi-granularity and multi-modality document graph is construct based on the text nodes of multiple granularities, the visual nodes of multiple granularities, the respective node information of the text nodes and the respective node information of the visual nodes. Multi-granularity semantic feature information of the document image is determined based on the multi-granularity and multi-modality document graph, the respective node information of the text nodes and the respective node information of the visual nodes.

Patent Agency Ranking