-
公开(公告)号:US12223015B2
公开(公告)日:2025-02-11
申请号:US17651414
申请日:2022-02-16
Applicant: Google LLC
Inventor: Emmanouil Koukoumidis , Nikolaos Kofinas , Evan Huang , Kiran Bellare , Xiao Liu , Michael Lanning , Lukas Rutishauser
IPC: G06F40/295 , G06F18/20 , G06F18/21
Abstract: A computer-implemented method includes receiving a document insight request that requests document insights for a corpus of documents. The document insight request includes the corpus of documents, a set of entities contained within each document of the corpus of documents, and document insight request parameters that includes a confidence value threshold. The method also includes generating the document insights for the corpus of documents based on the confidence value threshold. Here, the document insights include an accuracy target and a user review rate target. The method also includes transmitting the document insights to the user device causing a graphical user interface to display the document insights on the user device.
-
公开(公告)号:US20240046686A1
公开(公告)日:2024-02-08
申请号:US17817058
申请日:2022-08-03
Applicant: Google LLC
Inventor: Tianjun Ye , Younghwan Jung , Xiaoqi Ren , Wael Farhan , Tianjun Fu , Nikolaos Kofinas , Nikolay Alexeevich Glushnev , Matthew Eastberg Persons , Xiao Liu , Evan S. Huang , Emmanouil Koukoumidis , Bhavishya Mittal
IPC: G06V30/418 , G06V30/19 , G06V30/412 , G06V30/414 , G06V30/18
CPC classification number: G06V30/418 , G06V30/19107 , G06V30/412 , G06V30/19147 , G06V30/1918 , G06V30/414 , G06V30/18152
Abstract: A method for document extraction includes receiving, from a user device associated with a user, an annotated document that includes one or more fields. Each respective field of the one or more fields of the annotated document is labeled by a respective annotation. The method includes clustering, using a template matching algorithm, the annotated document into a cluster and inducing, using the annotated document, a document template for the cluster. The method includes receiving, from the user device, an unannotated document including the one or more fields. The method includes clustering, using the template matching algorithm, the unannotated document into the cluster and, in response to clustering the unannotated document into the cluster, extracting, using the document template, the one or more fields.
-
公开(公告)号:US20230195847A1
公开(公告)日:2023-06-22
申请号:US17651414
申请日:2022-02-16
Applicant: Google LLC
Inventor: Emmanouil Koukoumidis , Nikolaos Kofinas , Evan Huang , Kiran Bellare , Xiao Liu , Michael Lanning , Lukas Rutishauser
IPC: G06K9/62 , G06F40/295
CPC classification number: G06K9/6265 , G06F40/295 , G06K9/6227
Abstract: A computer-implemented method includes receiving a document insight request that requests document insights for a corpus of documents. The document insight request includes the corpus of documents, a set of entities contained within each document of the corpus of documents, and document insight request parameters that includes a confidence value threshold. The method also includes generating the document insights for the corpus of documents based on the confidence value threshold. Here, the document insights include an accuracy target and a user review rate target. The method also includes transmitting the document insights to the user device causing a graphical user interface to display the document insights on the user device.
-
-