SPEECH SYNTHESIS UTILIZING AUDIO WAVEFORM DIFFERENCE SIGNAL(S)

Invention Application

US20220254330A1 SPEECH SYNTHESIS UTILIZING AUDIO WAVEFORM DIFFERENCE SIGNAL(S) 有权

Please log in to see more content

Patent Title: SPEECH SYNTHESIS UTILIZING AUDIO WAVEFORM DIFFERENCE SIGNAL(S)
Application No.: US17610934

Application Date: 2019-05-20
Publication No.: US20220254330A1

Publication Date: 2022-08-11
Inventor: Luis Carlos Cobo Rus , Nal Kalchbrenner , Erich Elsen , Chenjie Gu
Applicant: DeepMind Technologies Limited
Applicant Address: GB London
Assignee: DeepMind Technologies Limited
Current Assignee: DeepMind Technologies Limited
Current Assignee Address: GB London
International Application: PCT/US2019/033104 WO 20190520
Main IPC: G10L13/047
IPC: G10L13/047 ; G10L13/08 ; G10L25/30

SPEECH SYNTHESIS UTILIZING AUDIO WAVEFORM DIFFERENCE SIGNAL(S)

Abstract:

Techniques are disclosed that enable generation of an audio waveform representing synthesized speech based on a difference signal determined using an autoregressive model. Various implementations include using a distribution of the difference signal values to represent sounds found in human speech with a higher level of granularity than sounds not frequently found in human speech. Additional or alternative implementations include using one or more speakers of a client device to render the generated audio waveform.

Public/Granted literature

US11915682B2 Speech synthesis utilizing audio waveform difference signal(s) Public/Granted day:2024-02-27

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/04	..语音合成系统的零部件，例如合成设备结构或存储器管理
G10L13/047	...语音合成设备的体系结构