END-TO-END TEXT-TO-SPEECH CONVERSION

发明申请

US20190311708A1 END-TO-END TEXT-TO-SPEECH CONVERSION 审中-公开

请登陆查看更多内容

专利标题： END-TO-END TEXT-TO-SPEECH CONVERSION
申请号： US16447862

申请日： 2019-06-20
公开(公告)号： US20190311708A1

公开(公告)日： 2019-10-10
发明人: Samy Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao
申请人： Google LLC
优先权： GR20170100126 20170329
主分类号： G10L13/08
IPC分类号： G10L13/08 ; G10L25/18 ; G10L25/30 ; G06N3/08

摘要：

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

公开/授权文献

US10573293B2 End-to-end text-to-speech conversion 公开/授权日：2020-02-25

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定