Systems and methods for extracting information from a physical document

    公开(公告)号:US12033412B2

    公开(公告)日:2024-07-09

    申请号:US17291647

    申请日:2019-01-28

    Applicant: Google LLC

    CPC classification number: G06V30/40 G06F40/169 G06V30/10

    Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.

    Systems and Methods for Extracting Information from a Physical Document

    公开(公告)号:US20240404308A1

    公开(公告)日:2024-12-05

    申请号:US18671218

    申请日:2024-05-22

    Applicant: Google LLC

    Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.

    Systems and Methods for Extracting Information from a Physical Document

    公开(公告)号:US20210406451A1

    公开(公告)日:2021-12-30

    申请号:US17291647

    申请日:2019-01-28

    Applicant: Google LLC

    Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.

Patent Agency Ranking