-
公开(公告)号:US20230061398A1
公开(公告)日:2023-03-02
申请号:US17984034
申请日:2022-11-09
Inventor: Shangwen LYU , Hongyu LI , Jing LIU , Hua WU , Haifeng WANG
IPC: G06V30/19 , G06V30/412 , G06V30/194 , G06F40/205
Abstract: A method for training a document reading comprehension model includes: acquiring a question sample and a rich-text document sample, in which the rich-text document sample includes a real answer of the question sample; acquiring text information and layout information of the rich-text document sample by performing OCR processing on image information of the rich-text document sample; acquiring a predicted answer of the question sample by inputting the text information, the layout information and the image information of the rich-text document sample into a preset reading comprehension model; and training the reading comprehension model based on the real answer and the predicted answer. The method may enhance comprehension ability of the reading comprehension model to the long rich-text document, and save labor cost.