System and method of recognizing data in a table area from unstructured data

    公开(公告)号:US11216425B2

    公开(公告)日:2022-01-04

    申请号:US16584585

    申请日:2019-09-26

    Abstract: A system and method of recognizing data in a table area from unstructured data includes a computer network, one or more processors communicatively coupled with the computer network, a storage location, and a graph-theoretic engine that receives an input stream of unstructured data associated. A table area is recognized from unstructured data, through one or more computer processors, from an input stream of unstructured data received over a computer network. One or more table headers associated with the detected one or more table areas are recognized. Further, one or more column delimiters associated with each column of the detected one or more table areas are determined. One or more tabular data associated with the detected one or more table areas are extracted. The extracted tabular data is mapped to one or more target schema to store onto a relational database.

    SYSTEM AND METHOD OF RECOGNIZING DATA IN A TABLE AREA FROM UNSTRUCTURED DATA

    公开(公告)号:US20200097451A1

    公开(公告)日:2020-03-26

    申请号:US16584585

    申请日:2019-09-26

    Abstract: A system and method of recognizing data in a table area from unstructured data includes a computer network, one or more processors communicatively coupled with the computer network, a storage location, and a graph-theoretic engine that receives an input stream of unstructured data associated. A table area is recognized from unstructured data, through one or more computer processors, from an input stream of unstructured data received over a computer network. One or more table headers associated with the detected one or more table areas are recognized. Further, one or more column delimiters associated with each column of the detected one or more table areas are determined. One or more tabular data associated with the detected one or more table areas are extracted. The extracted tabular data is mapped to one or more target schema to store onto a relational database.

Patent Agency Ranking