-
公开(公告)号:US12033412B2
公开(公告)日:2024-07-09
申请号:US17291647
申请日:2019-01-28
Applicant: Google LLC
Inventor: Rakesh Iyer , Lisha Ruan
IPC: G06V30/40 , G06F40/169 , G06V30/10
CPC classification number: G06V30/40 , G06F40/169 , G06V30/10
Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.
-
公开(公告)号:US20240404308A1
公开(公告)日:2024-12-05
申请号:US18671218
申请日:2024-05-22
Applicant: Google LLC
Inventor: Rakesh Iyer , Lisha Ruan
IPC: G06V30/40 , G06F40/169 , G06V30/10
Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.
-
公开(公告)号:US20210406451A1
公开(公告)日:2021-12-30
申请号:US17291647
申请日:2019-01-28
Applicant: Google LLC
Inventor: Rakesh Iyer , Lisha Ruan
IPC: G06F40/169 , G06K9/46 , G06K9/00 , G06K9/20
Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.
-
-