Invention Application
- Patent Title: SPEECH SYNTHESIS UTILIZING AUDIO WAVEFORM DIFFERENCE SIGNAL(S)
-
Application No.: US17610934Application Date: 2019-05-20
-
Publication No.: US20220254330A1Publication Date: 2022-08-11
- Inventor: Luis Carlos Cobo Rus , Nal Kalchbrenner , Erich Elsen , Chenjie Gu
- Applicant: DeepMind Technologies Limited
- Applicant Address: GB London
- Assignee: DeepMind Technologies Limited
- Current Assignee: DeepMind Technologies Limited
- Current Assignee Address: GB London
- International Application: PCT/US2019/033104 WO 20190520
- Main IPC: G10L13/047
- IPC: G10L13/047 ; G10L13/08 ; G10L25/30

Abstract:
Techniques are disclosed that enable generation of an audio waveform representing synthesized speech based on a difference signal determined using an autoregressive model. Various implementations include using a distribution of the difference signal values to represent sounds found in human speech with a higher level of granularity than sounds not frequently found in human speech. Additional or alternative implementations include using one or more speakers of a client device to render the generated audio waveform.
Public/Granted literature
- US11915682B2 Speech synthesis utilizing audio waveform difference signal(s) Public/Granted day:2024-02-27
Information query