-
公开(公告)号:US11681875B2
公开(公告)日:2023-06-20
申请号:US16984231
申请日:2020-08-04
Inventor: Xiangkai Huang , Leyi Wang , Lei Nie , Siyu An , Minghao Liu , Jiangliang Guo
IPC: G06F17/00 , G06F40/30 , G06V30/262 , G06V30/413 , G06V30/19 , G06V10/82 , G06V30/412 , G06V30/28
CPC classification number: G06F40/30 , G06V10/82 , G06V30/19173 , G06V30/274 , G06V30/412 , G06V30/413 , G06V30/293
Abstract: The present application discloses a method for image text recognition, an apparatus, a device, and a storage medium, and relates to image processing technologies in the field of cloud computing. A specific implementation is: acquiring an image to be processed, where at least one text line exists in the image to be processed; processing each text line in the image to be processed to obtain a composite encoded vector corresponding to each word in each text line, where the composite encoded vector carries semantic information and position information; and determining a text recognition result of the image to be processed according to the semantic information and the position information carried in the composite encoded vector corresponding to each word in each text line. This technical solution can accurately distinguish adjacent fields with small pixel spacing in the image and improve the accuracy of text recognition in the image.
-
公开(公告)号:US20220277575A1
公开(公告)日:2022-09-01
申请号:US17743687
申请日:2022-05-13
Inventor: Xia Zhou , Leyi Wang , Qiaoyi Li , Duohao Qin , Minghao Liu
IPC: G06V30/413 , G06V10/75
Abstract: A method and apparatus for detecting a table. The method includes: acquiring a to-be-processed image; inputting the to-be-processed image into a pre-trained deep learning model, and outputting a full table detection branch result, a column detection branch result and a header detection branch result through the deep learning model; where the full table detection branch result represents a detection result for a full table in the to-be-processed image, the column detection branch result represents a detection result for a column in the table in the to-be-processed image, and the header detection branch result represents a detection result for a header in the to-be-processed image; and obtaining a detection result of the table in the to-be-processed image, based on the full table detection branch result, the column detection branch result and the header detection branch result.
-
公开(公告)号:US12154359B2
公开(公告)日:2024-11-26
申请号:US17743687
申请日:2022-05-13
Inventor: Xia Zhou , Leyi Wang , Qiaoyi Li , Duohao Qin , Minghao Liu
IPC: G06V30/413 , G06V10/75
Abstract: A method and apparatus for detecting a table. The method includes: acquiring a to-be-processed image; inputting the to-be-processed image into a pre-trained deep learning model, and outputting a full table detection branch result, a column detection branch result and a header detection branch result through the deep learning model; where the full table detection branch result represents a detection result for a full table in the to-be-processed image, the column detection branch result represents a detection result for a column in the table in the to-be-processed image, and the header detection branch result represents a detection result for a header in the to-be-processed image; and obtaining a detection result of the table in the to-be-processed image, based on the full table detection branch result, the column detection branch result and the header detection branch result.
-
-