-
公开(公告)号:US11893047B1
公开(公告)日:2024-02-06
申请号:US18212533
申请日:2023-06-21
发明人: Julia Penfield , Aatish Suman , Veeru Talreja , Misbah Zahid Khan
IPC分类号: G06F40/40 , G06F16/31 , G06V30/416 , G06F40/295
CPC分类号: G06F16/316 , G06F40/295 , G06F40/40 , G06V30/416
摘要: Systems and methods for automated indexing and extraction of information in digital documents are disclosed. A method may comprise identifying a page containing targeted information; inputting an image of the page into a visual machine learning network (visual ML), wherein the visual ML is trained to recognize text associated with the targeted information in an image; identifying by the visual ML, a section of the image that contains the targeted information; inputting the digital document, and coordinates of the section into an extraction module; and extracting the targeted information by the extraction module from the section.