-
31.
公开(公告)号:US20220115000A1
公开(公告)日:2022-04-14
申请号:US17082518
申请日:2020-10-28
Applicant: Google LLC
Inventor: Françoise Beaufays , Johan Schalkwyk , Khe Chai Sim
IPC: G10L13/047 , G10L13/10 , G10L13/033
Abstract: Processor(s) of a client device can: identify a textual segment stored locally at the client device; process the textual segment, using an on-device TTS generator model, to generate synthesized speech audio data that includes synthesized speech of the textual segment; process the synthesized speech, using an on-device ASR model to generate predicted ASR output; and generate a gradient based on comparing the predicted ASR output to ground truth output corresponding to the textual segment. Processor(s) of the client device can also: process the synthesized speech audio data using an on-device TTS generator model to make a prediction; and generate a gradient based on the prediction. In these implementations, the generated gradient(s) can be used to update weight(s) of the respective on-device model(s) and/or transmitted to a remote system for use in remote updating of respective global model(s). The updated weight(s) and/or the updated model(s) can be transmitted to client device(s).