- 专利标题: Duration informed attention network for text-to-speech analysis
-
申请号: US16397349申请日: 2019-04-29
-
公开(公告)号: US11468879B2公开(公告)日: 2022-10-11
- 发明人: Chengzhu Yu , Heng Lu , Dong Yu
- 申请人: TENCENT AMERICA LLC
- 申请人地址: US CA Palo Alto
- 专利权人: TENCENT AMERICA LLC
- 当前专利权人: TENCENT AMERICA LLC
- 当前专利权人地址: US CA Palo Alto
- 代理机构: Sughrue Mion, PLLC
- 主分类号: G10L13/08
- IPC分类号: G10L13/08 ; G10L13/047 ; G10L13/00
摘要:
A method and apparatus include receiving a text input that includes a sequence of text components. Respective temporal durations of the text components are determined using a duration model. A first set of spectra is generated based on the sequence of text components. A second set of spectra is generated based on the first set of spectra and the respective temporal durations of the sequence of text components. A spectrogram frame is generated based on the second set of spectra. An audio waveform is generated based on the spectrogram frame. The audio waveform is provided as an output.
公开/授权文献
信息查询