Invention Grant
- Patent Title: Rotation and scaling for optical character recognition using end-to-end deep learning
-
Application No.: US16565614Application Date: 2019-09-10
-
Publication No.: US11302108B2Publication Date: 2022-04-12
- Inventor: Johannes Hoehne , Marco Spinaci , Anoop Raveendra Katti
- Applicant: SAP SE
- Applicant Address: DE Walldorf
- Assignee: SAP SE
- Current Assignee: SAP SE
- Current Assignee Address: DE Walldorf
- Agency: Sterne, Kessler, Goldstein & Fox P.L.L.C.
- Main IPC: G06K9/34
- IPC: G06K9/34 ; G06V30/148 ; G06N20/00 ; G06T7/10 ; G06F16/901 ; G06F17/18 ; G06V30/10

Abstract:
Disclosed herein are system, method, and computer program product embodiments for optical character recognition (OCR) pre-processing using machine learning. In an embodiment, a neural network may be trained to identify a standardized document rotation and scale expected by an OCR service performing character recognition. The neural network may then analyze a received document image to identify a corresponding rotation and scale of the document image relative to the expected standardized values. In response to this identification, the document image may be modified in the inverse to standardize the rotation and scale of the document image to match the format expected by the OCR service. In some embodiments, a neural network may perform the standardization as well as the character recognition using a shared computation graph.
Public/Granted literature
- US20210073566A1 ROTATION AND SCALING FOR OPTICAL CHARACTER RECOGNITION USING END-TO-END DEEP LEARNING Public/Granted day:2021-03-11
Information query