-
公开(公告)号:US20210374395A1
公开(公告)日:2021-12-02
申请号:US16890287
申请日:2020-06-02
Applicant: Google LLC
Inventor: Sandeep Tata , Bodhisattwa Prasad Majumder , Qi Zhao , James Bradley Wendt , Marc Najork , Navneet Potti
Abstract: The present disclosure is directed to extracting text from form-like documents. In particular, a computing system can obtain an image of a document that contains a plurality of portions of text. The computing system can extract one or more candidate text portions for each field type included in a target schema. The computing system can generate a respective input feature vector for each candidate for the field type. The computing system can generate a respective candidate embedding for the candidate text portion. The computing system can determine a respective score for each candidate text portion for the field type based at least in part on the respective candidate embedding for the candidate text portion. The computing system can assign one or more of the candidate text portions to the field type based on the respective scores.
-
公开(公告)号:US20210019622A1
公开(公告)日:2021-01-21
申请号:US16948888
申请日:2020-10-05
Applicant: GOOGLE LLC
Inventor: Alan Green , Cayden Meyer , Julian Gibbons , Alexandre Mah , Divanshu Garg , Reuben Kan , Michael Smith , Sandeep Tata , Alexandrin Popescul
IPC: G06N3/08 , G06N3/04 , G06F16/9535
Abstract: A user device can send, to a server, a request for a set of documents likely to be opened by a user, determine a client-suggested document to present to the user and a potential motive for the user to open the client-suggested document, receive a suggestion message from the server, the suggestion message including a set of documents likely to be opened by the user and potential motives for the user to open documents in the set of documents, and present, on a display of the user device, visual representations of the client-suggested document, the potential motive for the user to open the client-suggested document, multiple documents included in the set of documents, and the potential motives for the user to open the multiple documents in the set of documents.
-
公开(公告)号:US20200250527A1
公开(公告)日:2020-08-06
申请号:US16750053
申请日:2020-01-23
Applicant: Google LLC
Inventor: Qi Zhao , Abbas Kazerouni , Sandeep Tata , Jing Xie , Marc Najork
Abstract: The present disclosure provides computing systems and methods directed to active learning and may provide advantages or improvements to active learning applications for skewed data sets. A challenge in training and developing high-quality models for many supervised learning scenarios is obtaining labeled training examples. This disclosure provides systems and methods for active learning on a training dataset that includes both labeled and unlabeled datapoints. In particular, the systems and methods described herein can select (e.g., at each of a number of iterations) a number of the unlabeled datapoints for which labels should be obtained to gain additional labeled datapoints on which to train a machine-learned model (e.g., machine-learned classifier model). Generally, the disclosure provides cost-effective methods and systems for selecting data to improve machine-learned models in applications such as the identification of content items in text, images, and/or audio.
-
公开(公告)号:US20240346316A1
公开(公告)日:2024-10-17
申请号:US18756665
申请日:2024-06-27
Applicant: GOOGLE LLC
Inventor: Sandeep Tata , Julian Gibbons , Divanshu Garg , Alexandre Mah , Alan Green , Cayden Meyer , Michael Smith , Reuben Kan , Alexandrin Popescul
IPC: G06N3/08 , G06F16/9535 , G06N3/045 , G06N3/084
CPC classification number: G06N3/08 , G06F16/9535 , G06N3/045 , G06N3/084
Abstract: A user device can send, to a server, a request for a set of documents likely to be opened by a user, determine a client-suggested document to present to the user and a potential motive for the user to open the client-suggested document, receive a suggestion message from the server, the suggestion message including a set of documents likely to be opened by the user and potential motives for the user to open documents in the set of documents, and present, on a display of the user device, visual representations of the client-suggested document, the potential motive for the user to open the client-suggested document, multiple documents included in the set of documents, and the potential motives for the user to open the multiple documents in the set of documents.
-
5.
公开(公告)号:US20240054390A1
公开(公告)日:2024-02-15
申请号:US17891635
申请日:2022-08-19
Applicant: Google LLC
Inventor: James Bradley Wendt , Sandeep Tata , Lauro Ivo Beltrao Colaco Costa , Emmanouil Koukoumidis
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: Labels are often over labeled by machine-learning models and under labeled by human labelers. A solution to the over and under labeling problem is to have both a machine-learning model and a human label a document, then send the document to a parser to determine the discrepancies. The discrepancies are then presented to a human to review and decide whether the machine-learning model identified labels are labels. The feedback is then given to the machine-learning model for further improvement in its confidence calculations which via a confidence threshold determine if the identified labels are presented.
-
公开(公告)号:US11830269B2
公开(公告)日:2023-11-28
申请号:US17867300
申请日:2022-07-18
Applicant: Google LLC
Inventor: Sandeep Tata , Bodhisattwa Prasad Majumder , Qi Zhao , James Bradley Wendt , Marc Najork , Navneet Potti
IPC: G06T7/70 , G06V30/413 , G06N20/00 , G06N5/04 , G06V30/412 , G06V30/262 , G06V30/416 , G06F18/21 , G06F18/22
CPC classification number: G06V30/413 , G06F18/21 , G06F18/22 , G06N5/04 , G06N20/00 , G06T7/70 , G06V30/274 , G06V30/412 , G06V30/416 , G06T2207/30176
Abstract: The present disclosure is directed to extracting text from form-like documents. In particular, a computing system can obtain an image of a document that contains a plurality of portions of text. The computing system can extract one or more candidate text portions for each field type included in a target schema. The computing system can generate a respective input feature vector for each candidate for the field type. The computing system can generate a respective candidate embedding for the candidate text portion. The computing system can determine a respective score for each candidate text portion for the field type based at least in part on the respective candidate embedding for the candidate text portion. The computing system can assign one or more of the candidate text portions to the field type based on the respective scores.
-
公开(公告)号:US20220375245A1
公开(公告)日:2022-11-24
申请号:US17867300
申请日:2022-07-18
Applicant: Google LLC
Inventor: Sandeep Tata , Bodhisattwa Prasad Majumder , Qi Zhao , James Bradley Wendt , Marc Najork , Navneet Potti
IPC: G06V30/412 , G06K9/62 , G06T7/70 , G06N20/00 , G06N5/04 , G06V10/22 , G06V30/262 , G06V30/413 , G06V30/416
Abstract: The present disclosure is directed to extracting text from form-like documents. In particular, a computing system can obtain an image of a document that contains a plurality of portions of text. The computing system can extract one or more candidate text portions for each field type included in a target schema. The computing system can generate a respective input feature vector for each candidate for the field type. The computing system can generate a respective candidate embedding for the candidate text portion. The computing system can determine a respective score for each candidate text portion for the field type based at least in part on the respective candidate embedding for the candidate text portion. The computing system can assign one or more of the candidate text portions to the field type based on the respective scores.
-
公开(公告)号:US11886533B2
公开(公告)日:2024-01-30
申请号:US17792788
申请日:2020-01-29
Applicant: Google LLC
Inventor: Ying Sheng , Yuchen Lin , Sandeep Tata , Nguyen Vo
IPC: G06F16/958 , G06F16/957 , G06F40/14
CPC classification number: G06F16/986 , G06F16/957 , G06F40/14
Abstract: Systems and methods for efficiently identifying and extracting machine-actionable structured data from web documents are provided. The technology employs neural network architectures which process the raw HTML content of a set of seed websites to create transferable models regarding information of interest. These models can then be applied to the raw HTML of other websites to identify similar information of interest. Data can thus be extracted across multiple websites in a functional, structured form that allows it to be used further by a processing system.
-
公开(公告)号:US20230014465A1
公开(公告)日:2023-01-19
申请号:US17792788
申请日:2020-01-29
Applicant: Google LLC
Inventor: Ying Sheng , Yuchen Lin , Sandeep Tata , Nguyen Vo
IPC: G06F16/958 , G06F16/957 , G06F40/14
Abstract: Systems and methods for efficiently identifying and extracting machine-actionable structured data from web documents are provided. The technology employs neural network architectures which process the raw HTML content of a set of seed websites to create transferable models regarding information of interest. These models can then be applied to the raw HTML of other websites to identify similar information of interest. Data can thus be extracted across multiple websites in a functional, structured form that allows it to be used further by a processing system.
-
公开(公告)号:US11526752B2
公开(公告)日:2022-12-13
申请号:US16750053
申请日:2020-01-23
Applicant: Google LLC
Inventor: Qi Zhao , Abbas Kazerouni , Sandeep Tata , Jing Xie , Marc Najork
Abstract: Provided are computing systems and methods directed to active learning and may provide advantages or improvements to active learning applications for skewed data sets. A challenge in training and developing high-quality models for many supervised learning scenarios is obtaining labeled training examples. Provided are systems and methods for active learning on a training dataset that includes both labeled and unlabeled datapoints. In particular, the systems and methods described herein can select (e.g., at each of a number of iterations) a number of the unlabeled datapoints for which labels should be obtained to gain additional labeled datapoints on which to train a machine-learned model (e.g., machine-learned classifier model). Generally, provided are cost-effective methods and systems for selecting data to improve machine-learned models in applications such as the identification of content items in text, images, and/or audio.
-
-
-
-
-
-
-
-
-