Invention Application
- Patent Title: ON-DEVICE SPEECH SYNTHESIS OF TEXTUAL SEGMENTS FOR TRAINING OF ON-DEVICE SPEECH RECOGNITION MODEL
-
Application No.: US17479285Application Date: 2021-09-20
-
Publication No.: US20220005458A1Publication Date: 2022-01-06
- Inventor: Françoise Beaufays , Johan Schalkwyk , Khe Chai Sim
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Main IPC: G10L13/047
- IPC: G10L13/047 ; G10L15/06

Abstract:
Processor(s) of a client device can: identify a textual segment stored locally at the client device; process the textual segment, using a speech synthesis model stored locally at the client device, to generate synthesized speech audio data that includes synthesized speech of the identified textual segment; process the synthesized speech, using an on-device speech recognition model that is stored locally at the client device, to generate predicted output; and generate a gradient based on comparing the predicted output to ground truth output that corresponds to the textual segment. In some implementations, the generated gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model. In some implementations, the generated gradient is additionally or alternatively transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.
Public/Granted literature
- US11705106B2 On-device speech synthesis of textual segments for training of on-device speech recognition model Public/Granted day:2023-07-18
Information query