Invention Grant
- Patent Title: Training machine learning models using unsupervised data augmentation
-
Application No.: US17606190Application Date: 2020-04-24
-
Publication No.: US12118064B2Publication Date: 2024-10-15
- Inventor: Thang Minh Luong , Quoc V. Le , Qizhe Xie , Zihang Dai
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- International Application: PCT/US2020/029945 2020.04.24
- International Announcement: WO2020/219971A 2020.10.29
- Date entered country: 2021-10-25
- Main IPC: G06F18/21
- IPC: G06F18/21 ; G06F18/214 ; G06N3/08

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a IT machine learning model. One of the methods includes receiving training data comprising a plurality of unlabeled training inputs and a plurality of labeled training inputs; generating augmented training data, comprising generating, for each of the plurality of unlabeled training inputs, a respective augmented training input by applying a data augmentation technique to the unlabeled training input; and training the machine learning model on the augmented training data. In particular, but not exclusively, the model may be trained for perceptual tasks (e.g. tasks relating to vision or speech).
Public/Granted literature
- US20220215209A1 TRAINING MACHINE LEARNING MODELS USING UNSUPERVISED DATA AUGMENTATION Public/Granted day:2022-07-07
Information query