Extracting data from documents using multiple deep learning models
摘要:
Techniques for automatically extracting data from documents using multiple deep learning models are provided. According to one set of embodiments, a computer system can receive a document in an electronic format and can segment, using an image segmentation deep learning model, the document into a plurality of segments, where each segment corresponds to a visually discrete portion of the document and is classified as being one of a plurality of types. The computer system can then, for each segment in the plurality of segments, retrieve text in the segment using optical character recognition (OCR) and extract data in the segment from the retrieved text using a named entity recognition (NER) deep learning model, where the retrieving and the extracting are performed in a manner that takes into account the segment's type.
信息查询
0/0