Invention Application
- Patent Title: METHOD AND SYSTEM FOR INFORMATION EXTRACTION FROM DOCUMENT IMAGES USING CONVERSATIONAL INTERFACE AND DATABASE QUERYING
-
Application No.: US16353570Application Date: 2019-03-14
-
Publication No.: US20200175304A1Publication Date: 2020-06-04
- Inventor: Lovekesh VIG , Gautam SHROFF , Arindam CHOWDHURY , Rohit RAHUL , Gunjan SEHGAL , Vishwanath DORESWAMY , Monika SHARMA , Ashwin SRINIVASAN
- Applicant: Tata Consultancy Services Limited
- Applicant Address: IN Mumbai
- Assignee: Tata Consultancy Services Limited
- Current Assignee: Tata Consultancy Services Limited
- Current Assignee Address: IN Mumbai
- Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@2f8606ff
- Main IPC: G06K9/46
- IPC: G06K9/46 ; G06F17/27 ; G06T5/00 ; G06K9/00 ; G06K9/62 ; G06F16/2452 ; G06F16/2455 ; G06F16/28 ; G06N3/08

Abstract:
Various methods are using SQL based data extraction for extracting relevant information from images. These are rule based methods of generating SQL-Query from NL, if any new English sentences are to be handled then manual intervention is required. Further becomes difficult for non-technical user. A system and method for extracting relevant from the images using a conversational interface and database querying have been provided. The system eliminates noisy effects, identifying the type of documents and detect various entities for diagrams. Further a schema is designed which allows an easy to understand abstraction of the entities detected by the deep vision models and the relationships between them. Relevant information and fields can then be extracted from the document by writing SQL queries on top of the relationship tables. A natural language based interface is added so that a non-technical user, specifying the queries in natural language, can fetch the information effortlessly.
Public/Granted literature
Information query