Method and system for extracting information from a document image
Abstract:
A method performed by a computing system includes receiving, by a document data extraction system (DDES), image data associated with a document. The DDES extracts, via optical character recognition (OCR) logic of the DDES, metadata from the image data. The metadata specifies sequences of text content items and text content item features associated with each text content item of the sequences of text content items. A machine learning logic (MLL) module of the DDES determines, based on the sequences of text content items and the text content item features, one or more text content items associated with a key. The DDES communicates information that specifies the key and a corresponding value that is associated with the one or more text content items that are associated with the key to a terminal.
Public/Granted literature
Information query
Patent Agency Ranking
0/0