Speech synthesis system, speech synthesis program product, and speech synthesis method

    公开(公告)号:US09275631B2

    公开(公告)日:2016-03-01

    申请号:US13731268

    申请日:2012-12-31

    IPC分类号: G10L13/10 G10L13/00 G10L13/07

    CPC分类号: G10L13/00 G10L13/07 G10L13/10

    摘要: Waveform concatenation speech synthesis with high sound quality. Prosody with both high accuracy and high sound quality is achieved by performing a two-path search including a speech segment search and a prosody modification value search. An accurate accent is secured by evaluating the consistency of the prosody by using a statistical model of prosody variations (the slope of fundamental frequency) for both of two paths of the speech segment selection and the modification value search. In the prosody modification value search, a prosody modification value sequence that minimizes a modified prosody cost is searched for. This allows a search for a modification value sequence that can increase the likelihood of absolute values or variations of the prosody to the statistical model as high as possible with minimum modification values.

    SPEECH SYNTHESIS SYSTEM, SPEECH SYNTHESIS PROGRAM PRODUCT, AND SPEECH SYNTHESIS METHOD
    2.
    发明申请
    SPEECH SYNTHESIS SYSTEM, SPEECH SYNTHESIS PROGRAM PRODUCT, AND SPEECH SYNTHESIS METHOD 有权
    语音合成系统,语音合成程序产品和语音合成方法

    公开(公告)号:US20130268275A1

    公开(公告)日:2013-10-10

    申请号:US13731268

    申请日:2012-12-31

    IPC分类号: G10L13/00

    CPC分类号: G10L13/00 G10L13/07 G10L13/10

    摘要: Waveform concatenation speech synthesis with high sound quality. Prosody with both high accuracy and high sound quality is achieved by performing a two-path search including a speech segment search and a prosody modification value search. An accurate accent is secured by evaluating the consistency of the prosody by using a statistical model of prosody variations (the slope of fundamental frequency) for both of two paths of the speech segment selection and the modification value search. In the prosody modification value search, a prosody modification value sequence that minimizes a modified prosody cost is searched for. This allows a search for a modification value sequence that can increase the likelihood of absolute values or variations of the prosody to the statistical model as high as possible with minimum modification values.

    摘要翻译: 具有高音质的波形级联语音综合。 通过执行包括语音片段搜索和韵律修改值搜索的双向搜索来实现高精度和高音质的韵律。 通过使用语音段选择和修改值搜索的两个路径中的韵律变化(基频的斜率)的统计模型来评估韵律的一致性来确保准确的重音。 在韵律修改值搜索中,搜索最小化修改的韵律成本的韵律修改值序列。 这允许搜索修改值序列,其可以使用最小修改值尽可能高地增加对统计模型的韵律的绝对值或变化的可能性。