Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Shangwen LYU"

1.

发明申请
METHOD AND DEVICE FOR TRAINING, BASED ON CROSSMODAL INFORMATION, DOCUMENT READING COMPREHENSION MODEL 有权

公开(公告)号：US20230061398A1

公开(公告)日：2023-03-02

申请号：US17984034

申请日：2022-11-09

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Shangwen LYU , Hongyu LI , Jing LIU , Hua WU , Haifeng WANG

IPC: G06V30/19 , G06V30/412 , G06V30/194 , G06F40/205

Abstract: A method for training a document reading comprehension model includes: acquiring a question sample and a rich-text document sample, in which the rich-text document sample includes a real answer of the question sample; acquiring text information and layout information of the rich-text document sample by performing OCR processing on image information of the rich-text document sample; acquiring a predicted answer of the question sample by inputting the text information, the layout information and the image information of the rich-text document sample into a preset reading comprehension model; and training the reading comprehension model based on the real answer and the predicted answer. The method may enhance comprehension ability of the reading comprehension model to the long rich-text document, and save labor cost.

Patent Agency Ranking