Patent search ap:("GOOGLE LLC") AND inv:"Khe Chai Sim" Page 4

31.

发明申请
ON-DEVICE PERSONALIZATION OF SPEECH SYNTHESIS FOR TRAINING OF SPEECH RECOGNITION MODEL(S) 有权

公开(公告)号：US20220115000A1

公开(公告)日：2022-04-14

申请号：US17082518

申请日：2020-10-28

Applicant: Google LLC

Inventor： Françoise Beaufays , Johan Schalkwyk , Khe Chai Sim

IPC: G10L13/047 , G10L13/10 , G10L13/033

Abstract: Processor(s) of a client device can: identify a textual segment stored locally at the client device; process the textual segment, using an on-device TTS generator model, to generate synthesized speech audio data that includes synthesized speech of the textual segment; process the synthesized speech, using an on-device ASR model to generate predicted ASR output; and generate a gradient based on comparing the predicted ASR output to ground truth output corresponding to the textual segment. Processor(s) of the client device can also: process the synthesized speech audio data using an on-device TTS generator model to make a prediction; and generate a gradient based on the prediction. In these implementations, the generated gradient(s) can be used to update weight(s) of the respective on-device model(s) and/or transmitted to a remote system for use in remote updating of respective global model(s). The updated weight(s) and/or the updated model(s) can be transmitted to client device(s).

Patent Agency Ranking