-
公开(公告)号:US12175198B2
公开(公告)日:2024-12-24
申请号:US17952103
申请日:2022-09-23
Inventor: Yingqi Sun
IPC: G06F40/30 , G06F40/103 , G06F40/279 , G06V30/19 , G06V30/412
Abstract: A method of document processing is provided. An implementation solution is: obtaining target text information and target layout information of a target document, the target text information includes target text included in the target document and character position information of the target text, and the target layout information is used to characterize the region where text in the target document is located; fusing the target text information and the target layout information to obtain first multimodal information of the target document; and inputting the first multimodal information into an intelligent document comprehension model, and obtaining at least one target word in the target document and at least one feature vector corresponding to the at least one target word output by the intelligent document comprehension model, each target word is related to semantics of the target document.