DOCUMENT STRUCTURE IDENTIFICATION USING POST-PROCESSING ERROR CORRECTION
Abstract:
A method comprises determining instance bounds associated with each of one or more structural elements in a document using a machine learning model. The method further comprises determining an error in the instance bounds associated with a particular one of the one or more structural elements. The method further comprises correcting the error in the instance bounds associated with the particular structural element using document content associated with the particular structural element, thereby generating corrected instance bounds associated with the particular structural element. The method further comprises generating a structural map of the document based on the corrected instance bounds.
Information query
Patent Agency Ranking
0/0