-
公开(公告)号:US20240193978A1
公开(公告)日:2024-06-13
申请号:US18065352
申请日:2022-12-13
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Ang Yi , Jing Zhang , Hai Cheng Wang , Jun Hong Zhao , Rajesh M. Desai , Yang Zhong Li , Ye Chen
IPC: G06V30/412 , G06V10/26 , G06V10/74 , G06V10/75 , G06V30/413 , G06V30/414
CPC classification number: G06V30/412 , G06V10/273 , G06V10/751 , G06V10/761 , G06V30/413 , G06V30/414
Abstract: Computer implemented methods, systems, and computer program products include program code executing on a processor(s) that merges a document comprising multiple pages into a single document image. The program code processes the single document image to identify structural elements and textual content. The program code compares the structural elements of the single document image to other structural elements of a group of document templates stored in a database to identify a subset of the group of documents templates with a threshold number of similarities to the single document image. The program code generates, from the single document image, a graph structure representing the document, where the graph structure comprises visual information and connections related to the structural elements and concepts comprising the textual content. The program code uses the structure to identify a document template that is a closest match to the document.
-
公开(公告)号:US20240046677A1
公开(公告)日:2024-02-08
申请号:US17814856
申请日:2022-07-26
Applicant: International Business Machines Corporation
Inventor: Ang Yi , Jing Zhang , Hai Cheng Wang , Jun Hong Zhao , Rajesh M. Desai , Yang Zhong Li , Xue Xu
IPC: G06V30/148 , G06V30/18
CPC classification number: G06V30/153 , G06V30/18181
Abstract: A computer-implemented method for text block segmentation includes determining a first text block segmentation pattern utilized to generate a segmented text block based, at least in part, on a comparison of semantic information associated with the segmented text block and a plurality of predefined types of text block segmentation patterns indicated by a graph; calculating a first degree of confidence in a size of the segmented text block based, at least in part, on comparing semantic entities associated with the segmented text block with semantic entities indicated by leaf nodes stemming from a first non-leaf node included in the graph and representative of the first type of text block segmentation pattern; and determining that the size of the segmented text block is non-optimal based on the calculated degree of confidence in the size of the segmented text block being below a predetermined threshold.
-