-
公开(公告)号:US12300012B2
公开(公告)日:2025-05-13
申请号:US17984034
申请日:2022-11-09
Inventor: Shangwen Lyu , Hongyu Li , Jing Liu , Hua Wu , Haifeng Wang
IPC: G06V30/19 , G06F40/205 , G06V30/194 , G06V30/412
Abstract: A method for training a document reading comprehension model includes: acquiring a question sample and a rich-text document sample, in which the rich-text document sample includes a real answer of the question sample; acquiring text information and layout information of the rich-text document sample by performing OCR processing on image information of the rich-text document sample; acquiring a predicted answer of the question sample by inputting the text information, the layout information and the image information of the rich-text document sample into a preset reading comprehension model; and training the reading comprehension model based on the real answer and the predicted answer. The method may enhance comprehension ability of the reading comprehension model to the long rich-text document, and save labor cost.