END-TO-END TEXT-TO-SPEECH CONVERSION

发明公开

EP3583594A2 END-TO-END TEXT-TO-SPEECH CONVERSION 审中-公开

请登陆查看更多内容

专利标题： END-TO-END TEXT-TO-SPEECH CONVERSION
申请号： EP18718562.4

申请日： 2018-03-29
公开(公告)号： EP3583594A2

公开(公告)日： 2019-12-25
发明人: BENGIO, Samuel , WANG, Yuxuan , YANG, Zongheng , CHEN, Zhifeng , WU, Yonghui , AGIOMYRGIANNAKIS, Ioannis , WEISS, Ron J. , JAITLY, Navdeep , RIFKIN, Ryan M. , CLARK, Robert Andrew James , LE, Quoc V. , RYAN, Russell J. , XIAO, Ying
申请人： Google LLC
申请人地址： 1600 Amphitheatre Parkway Mountain View, CA 94043 US
专利权人： Google LLC
当前专利权人： Google LLC
当前专利权人地址： 1600 Amphitheatre Parkway Mountain View, CA 94043 US
代理机构： Anderson, Oliver Ben
优先权： GR20170100126 20170329
国际公布： WO2018183650 20181004
主分类号： G10L13/04
IPC分类号： G10L13/04 ; G10L15/16 ; G06N3/08

摘要：

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

公开/授权文献

EP3583594B1 END-TO-END TEXT-TO-SPEECH CONVERSION 公开/授权日：2020-09-09

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/04	..语音合成系统的零部件，例如合成设备结构或存储器管理