-
公开(公告)号:US11734600B2
公开(公告)日:2023-08-22
申请号:US16376254
申请日:2019-04-05
Applicant: Google LLC
Inventor: Wei Wang , Bowen Liang , Macduff Hughes , Taro Watanabe , Tetsuji Nakagawa , Alexander Rudnick
Abstract: A method includes generating a base model by training with a first dataset of data pairs and generating an adapted model by training the base model on a second dataset of data pairs. The method also includes determining a contrastive score for each data pair of a third dataset of data pairs using the base model and the adapted model. The contrastive score is indicative of a probability of quality of the respective data pair. The method also includes training a target model using the data pairs of the third dataset and the contrastive scores.
-
公开(公告)号:US20230359938A1
公开(公告)日:2023-11-09
申请号:US18351397
申请日:2023-07-12
Applicant: Google LLC
Inventor: Wei Wang , Bowen Liang , Macduff Hughes , Taro Watanabe , Tetsuji Nakagawa , Alexander Rudnick
Abstract: A method includes generating a base model by training with a first dataset of data pairs and generating an adapted model by training the base model on a second dataset of data pairs. The method also includes determining a contrastive score for each data pair of a third dataset of data pairs using the base model and the adapted model. The contrastive score is indicative of a probability of quality of the respective data pair. The method also includes training a target model using the data pairs of the third dataset and the contrastive scores.
-