- 专利标题: METHOD AND SYSTEM FOR RELEVANT DATA EXTRACTION FROM A DOCUMENT
-
申请号: US18239778申请日: 2023-08-30
-
公开(公告)号: US20240355136A1公开(公告)日: 2024-10-24
- 发明人: NIRMAL RAMESH RAYULU VANAPALLI VENKATA , MADHUSUDAN SINGH , TAMILARASAN ELLAPPAN
- 申请人: L&T TECHNOLOGY SERVICES LIMITED
- 申请人地址: US TN Chennai
- 专利权人: L&T TECHNOLOGY SERVICES LIMITED
- 当前专利权人: L&T TECHNOLOGY SERVICES LIMITED
- 当前专利权人地址: US TN Chennai
- 优先权: IN 2341028817 2023.04.20
- 主分类号: G06V30/414
- IPC分类号: G06V30/414 ; G06F40/169 ; G06F40/186 ; G06V10/94 ; G06V20/62 ; G06V30/19
摘要:
A method and system for relevant data extraction from a document is disclosed. The method includes determining first positional information corresponding to a key from a plurality of predefined keys in the document image based on a deep learning model. Further, second positional information corresponding to the key is determined based on OCR of the document image and an NLP model. Final positional information is determined based on the first positional information and the second positional information, in case a difference between the first positional information and the second positional information is minimal. Relevant data is extracted for the key in the OCR document image based on the final positional information.
信息查询