Voice personalization of speech synthesizer

发明授权

US06970820B2 Voice personalization of speech synthesizer 有权

标题翻译：语音合成器的语音个性化

请登陆查看更多内容

专利标题： Voice personalization of speech synthesizer
专利标题（中）： 语音合成器的语音个性化
申请号： US09792928

申请日： 2001-02-26
公开(公告)号： US06970820B2

公开(公告)日： 2005-11-29
发明人: Jean-Claude Junqua , Florent Perronnin , Roland Kuhn , Patrick Nguyen
申请人： Jean-Claude Junqua , Florent Perronnin , Roland Kuhn , Patrick Nguyen
申请人地址： JP Osaka
专利权人： Matsushita Electric Industrial Co., Ltd.
当前专利权人： Matsushita Electric Industrial Co., Ltd.
当前专利权人地址： JP Osaka
代理机构： Harness, Dickey & Pierce, PLC
主分类号： G10L13/08
IPC分类号： G10L13/08 ; G10L13/02 ; G10L13/04 ; G10L13/06 ; G10L21/00 ; G10L13/00

Voice personalization of speech synthesizer

摘要：

The speech synthesizer is personalized to sound like or mimic the speech characteristics of an individual speaker. The individual speaker provides a quantity of enrollment data, which can be extracted from a short quantity of speech, and the system modifies the base synthesis parameters to more closely resemble those of the new speaker. More specifically, the synthesis parameters may be decomposed into speaker dependent parameters, such as context-independent parameters, and speaker independent parameters, such as context dependent parameters. The speaker dependent parameters are adapted using enrollment data from the new speaker. After adaptation, the speaker dependent parameters are combined with the speaker independent parameters to provide a set of personalized synthesis parameters. To adapt the parameters with a small amount of enrollment data, an eigenspace is constructed and used to constrain the position of the new speaker so that context independent parameters not provided by the new speaker may be estimated.

摘要（中）：

语音合成器被个性化以发音或模仿单个扬声器的语音特征。单个扬声器提供一定数量的登记数据，其可以从短语言中提取，并且系统将基本合成参数修改为更接近于新说话者的参考数据。更具体地，合成参数可以被分解为与扬声器相关的参数，诸如与上下文无关的参数，以及与扬声器无关的参数，诸如与上下文相关的参数。使用来自新扬声器的注册数据来调整与扬声器相关的参数。在适应之后，将扬声器依赖参数与扬声器独立参数组合以提供一组个性化合成参数。为了使参数具有少量的注册数据，构造本征空间并用于约束新的说话者的位置，以便可以估计不能由新发言者提供的上下文独立参数。

公开/授权文献

US20020120450A1 Voice personalization of speech synthesizer 公开/授权日：2002-08-29

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定