Duration informed attention network for text-to-speech analysis

发明授权

US11468879B2 Duration informed attention network for text-to-speech analysis 有权

请登陆查看更多内容

专利标题： Duration informed attention network for text-to-speech analysis
申请号： US16397349

申请日： 2019-04-29
公开(公告)号： US11468879B2

公开(公告)日： 2022-10-11
发明人: Chengzhu Yu , Heng Lu , Dong Yu
申请人： TENCENT AMERICA LLC
申请人地址： US CA Palo Alto
专利权人： TENCENT AMERICA LLC
当前专利权人： TENCENT AMERICA LLC
当前专利权人地址： US CA Palo Alto
代理机构： Sughrue Mion, PLLC
主分类号： G10L13/08
IPC分类号： G10L13/08 ; G10L13/047 ; G10L13/00

Duration informed attention network for text-to-speech analysis

摘要：

A method and apparatus include receiving a text input that includes a sequence of text components. Respective temporal durations of the text components are determined using a duration model. A first set of spectra is generated based on the sequence of text components. A second set of spectra is generated based on the first set of spectra and the respective temporal durations of the sequence of text components. A spectrogram frame is generated based on the second set of spectra. An audio waveform is generated based on the spectrogram frame. The audio waveform is provided as an output.

公开/授权文献

US20200342849A1 DURATION INFORMED ATTENTION NETWORK FOR TEXT-TO-SPEECH ANALYSIS 公开/授权日：2020-10-29

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定