Blending recorded speech with text-to-speech output for specific domains
    1.
    发明授权
    Blending recorded speech with text-to-speech output for specific domains 有权
    将记录的语音与特定域的文本到语音输出混合

    公开(公告)号:US08996377B2

    公开(公告)日:2015-03-31

    申请号:US13547459

    申请日:2012-07-12

    IPC分类号: G10L13/00 G10L13/08

    CPC分类号: G10L13/08

    摘要: A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.

    摘要翻译: 文本到语音(TTS)引擎将记录的语音与基于文本输入的TTS合成器的合成语音相结合。 TTS引擎接收文本输入并识别语音的域(例如导航,拨号,...)。 识别的域用于从输入文本中选择特定于域的语音记录(例如预先记录的静态短语,例如“左转”,“右转”...)。 基于从输入文本识别的域的静态短语获得语音记录。 TTS引擎将静态短语与TTS输出混合,以平滑输入文本的声轨迹。 静态短语的韵律用于在TTS输出中创建类似的韵律。

    BLENDING RECORDED SPEECH WITH TEXT-TO-SPEECH OUTPUT FOR SPECIFIC DOMAINS
    2.
    发明申请
    BLENDING RECORDED SPEECH WITH TEXT-TO-SPEECH OUTPUT FOR SPECIFIC DOMAINS 有权
    用特定域的文本到语音输出来混合记录的语音

    公开(公告)号:US20140019134A1

    公开(公告)日:2014-01-16

    申请号:US13547459

    申请日:2012-07-12

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.

    摘要翻译: 文本到语音(TTS)引擎将记录的语音与基于文本输入的TTS合成器的合成语音相结合。 TTS引擎接收文本输入并识别语音的域(例如导航,拨号,...)。 识别的域用于从输入文本中选择特定于域的语音记录(例如预先记录的静态短语,例如“左转”,“右转”...)。 基于从输入文本识别的域的静态短语获得语音记录。 TTS引擎将静态短语与TTS输出混合,以平滑输入文本的声轨迹。 静态短语的韵律用于在TTS输出中创建类似的韵律。