Invention Grant
- Patent Title: Automatic delineation and extraction of tabular data using machine learning
-
Application No.: US16659977Application Date: 2019-10-22
-
Publication No.: US11380116B2Publication Date: 2022-07-05
- Inventor: Peter Zhong , Antonio Jose Jimeno Yepes , Elaheh Shafieibavani
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Cantor Colburn LLP
- Agent Joseph Petrokaitis
- Main IPC: G06V30/414
- IPC: G06V30/414 ; G06N3/04 ; G06N20/00

Abstract:
A computer-implemented method for using a machine learning model to automatically extract tabular data from an image includes receiving a set of images of tabular data and a set of markup data corresponding respectively to the images of tabular data. The method further includes training a first neural network to delineate the tabular data into cells using the markup data, and training a second neural network to determine content of the cells in the tabular data using the markup data. The method further includes, upon receiving an input image containing a first tabular data without any markup data, generating an electronic output corresponding to the first tabular data by determining the structure of the first tabular data using the first neural network and extracting content of the first tabular data using the second neural network.
Public/Granted literature
- US20210117668A1 AUTOMATIC DELINEATION AND EXTRACTION OF TABULAR DATA USING MACHINE LEARNING Public/Granted day:2021-04-22
Information query