发明授权
US08996377B2 Blending recorded speech with text-to-speech output for specific domains
有权
将记录的语音与特定域的文本到语音输出混合
- 专利标题: Blending recorded speech with text-to-speech output for specific domains
- 专利标题(中): 将记录的语音与特定域的文本到语音输出混合
-
申请号: US13547459申请日: 2012-07-12
-
公开(公告)号: US08996377B2公开(公告)日: 2015-03-31
- 发明人: Sheng Zhao , Peng Wang , Difei Gao , Yijian Wu , Binggong Ding , Shenghua Ye , Max Leung
- 申请人: Sheng Zhao , Peng Wang , Difei Gao , Yijian Wu , Binggong Ding , Shenghua Ye , Max Leung
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Technology Licensing, LLC
- 当前专利权人: Microsoft Technology Licensing, LLC
- 当前专利权人地址: US WA Redmond
- 代理商 Steven Spellman; Jim Ross; Micky Minhas
- 主分类号: G10L13/00
- IPC分类号: G10L13/00 ; G10L13/08
摘要:
A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.
公开/授权文献
信息查询