-
1.
公开(公告)号:US20240054390A1
公开(公告)日:2024-02-15
申请号:US17891635
申请日:2022-08-19
Applicant: Google LLC
Inventor: James Bradley Wendt , Sandeep Tata , Lauro Ivo Beltrao Colaco Costa , Emmanouil Koukoumidis
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: Labels are often over labeled by machine-learning models and under labeled by human labelers. A solution to the over and under labeling problem is to have both a machine-learning model and a human label a document, then send the document to a parser to determine the discrepancies. The discrepancies are then presented to a human to review and decide whether the machine-learning model identified labels are labels. The feedback is then given to the machine-learning model for further improvement in its confidence calculations which via a confidence threshold determine if the identified labels are presented.
-
公开(公告)号:US20230419020A1
公开(公告)日:2023-12-28
申请号:US17808293
申请日:2022-06-22
Applicant: Google LLC
Inventor: Nikolay Glushnev , Qingze Wang , Emmanouil Koukoumidis , Henry Wahyudi Setiawan , Lauro Ivo Beltrao Colaco Costa , Vincent Perot
IPC: G06F40/174 , G06V30/412 , G06V30/19
CPC classification number: G06F40/174 , G06V30/412 , G06V30/19
Abstract: A method includes obtaining a document with textual fields and a visual element. For each textual field, the method includes determining a textual offset for the textual field that indicates a location of the textual field relative to each other textual field in the document. The method includes detecting, using a machine learning vision model, the visual element and determining a visual element offset indicating a location of the visual element relative to each textual field in the document. The method includes assigning the visual element a visual element anchor token and inserting the visual element anchor token into the textual fields in an order based on the visual element offset and the respective textual offsets. The method also includes, after inserting the visual element anchor token, extracting, using a text-based extraction model, from the textual fields, structured entities representing the series of textual fields and the visual element.
-
公开(公告)号:US20250069270A1
公开(公告)日:2025-02-27
申请号:US18724026
申请日:2022-01-24
Applicant: Google LLC
Inventor: Joseph Johnson, Jr. , Shiblee Hasan , Emmanouil Koukoumidis , Dustin Abramson
Abstract: A method includes obtaining image data, identifying a machine learning-compressible (ML-compressible) portion of the image data, and determining a location of the ML-compressible portion within the image data. The method also includes selecting, from a plurality of ML compression models, an ML compression model for the ML-compressible portion based on an image content thereof, and generating, based on the ML-compressible portion and by the ML compression model, an ML-compressed representation of the ML-compressible portion. The method further includes generating a compressed image data file that includes the ML-compressed representation and the location of the ML-compressible portion, and outputting the compressed image data file. The compressed image data file is configured to cause an ML decompression model corresponding to the ML compression model to generate a reconstruction of the ML-compressible portion of the image data based on the ML-compressed representation.
-
公开(公告)号:US12223015B2
公开(公告)日:2025-02-11
申请号:US17651414
申请日:2022-02-16
Applicant: Google LLC
Inventor: Emmanouil Koukoumidis , Nikolaos Kofinas , Evan Huang , Kiran Bellare , Xiao Liu , Michael Lanning , Lukas Rutishauser
IPC: G06F40/295 , G06F18/20 , G06F18/21
Abstract: A computer-implemented method includes receiving a document insight request that requests document insights for a corpus of documents. The document insight request includes the corpus of documents, a set of entities contained within each document of the corpus of documents, and document insight request parameters that includes a confidence value threshold. The method also includes generating the document insights for the corpus of documents based on the confidence value threshold. Here, the document insights include an accuracy target and a user review rate target. The method also includes transmitting the document insights to the user device causing a graphical user interface to display the document insights on the user device.
-
公开(公告)号:US20240046686A1
公开(公告)日:2024-02-08
申请号:US17817058
申请日:2022-08-03
Applicant: Google LLC
Inventor: Tianjun Ye , Younghwan Jung , Xiaoqi Ren , Wael Farhan , Tianjun Fu , Nikolaos Kofinas , Nikolay Alexeevich Glushnev , Matthew Eastberg Persons , Xiao Liu , Evan S. Huang , Emmanouil Koukoumidis , Bhavishya Mittal
IPC: G06V30/418 , G06V30/19 , G06V30/412 , G06V30/414 , G06V30/18
CPC classification number: G06V30/418 , G06V30/19107 , G06V30/412 , G06V30/19147 , G06V30/1918 , G06V30/414 , G06V30/18152
Abstract: A method for document extraction includes receiving, from a user device associated with a user, an annotated document that includes one or more fields. Each respective field of the one or more fields of the annotated document is labeled by a respective annotation. The method includes clustering, using a template matching algorithm, the annotated document into a cluster and inducing, using the annotated document, a document template for the cluster. The method includes receiving, from the user device, an unannotated document including the one or more fields. The method includes clustering, using the template matching algorithm, the unannotated document into the cluster and, in response to clustering the unannotated document into the cluster, extracting, using the document template, the one or more fields.
-
公开(公告)号:US20230195847A1
公开(公告)日:2023-06-22
申请号:US17651414
申请日:2022-02-16
Applicant: Google LLC
Inventor: Emmanouil Koukoumidis , Nikolaos Kofinas , Evan Huang , Kiran Bellare , Xiao Liu , Michael Lanning , Lukas Rutishauser
IPC: G06K9/62 , G06F40/295
CPC classification number: G06K9/6265 , G06F40/295 , G06K9/6227
Abstract: A computer-implemented method includes receiving a document insight request that requests document insights for a corpus of documents. The document insight request includes the corpus of documents, a set of entities contained within each document of the corpus of documents, and document insight request parameters that includes a confidence value threshold. The method also includes generating the document insights for the corpus of documents based on the confidence value threshold. Here, the document insights include an accuracy target and a user review rate target. The method also includes transmitting the document insights to the user device causing a graphical user interface to display the document insights on the user device.
-
-
-
-
-