Text-to-speech (TTS) processing

发明授权

US11763797B2 Text-to-speech (TTS) processing 有权

请登陆查看更多内容

专利标题： Text-to-speech (TTS) processing
申请号： US16908882

申请日： 2020-06-23
公开(公告)号： US11763797B2

公开(公告)日： 2023-09-19
发明人: Roberto Barra Chicote , Adam Franciszek Nadolski , Thomas Edward Merritt , Bartosz Putrycz , Andrew Paul Breen
申请人： Amazon Technologies, Inc.
申请人地址： US WA Seattle
专利权人： Amazon Technologies, Inc.
当前专利权人： Amazon Technologies, Inc.
当前专利权人地址： US WA Seattle
代理机构： PIERCE ATWOOD LLP
主分类号： G10L13/10
IPC分类号： G10L13/10 ; G10L13/033 ; G10L13/00

摘要：

A speech model includes a sub-model corresponding to a vocal attribute. The speech model generates an output waveform using a sample model, which receives text data, and a conditioning model, which receives text metadata and produces a prosody output for use by the sample model. If, during training or runtime, a different vocal attribute is desired or needed, the sub-model is re-trained or switched to a different sub-model corresponding to the different vocal attribute.

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定
G10L13/10	..来自文本的韵律规则；重音或声调