-
公开(公告)号:US07130799B1
公开(公告)日:2006-10-31
申请号:US09684331
申请日:2000-10-10
申请人: Katsumi Amano , Shisei Cho , Soichi Toyama , Hiroyuki Ishihara
发明人: Katsumi Amano , Shisei Cho , Soichi Toyama , Hiroyuki Ishihara
IPC分类号: G10L13/00
摘要: A speech synthesizing method which synthesizes speech naturally is disclosed. Standardized frame power values of an n-th frame is calculated when frame power values at head and tail frames in a phoneme are standardized. An average value of the power values sampled from the power frequency characteristics in the n-th frame at a predetermined frequency interval is set as a mean frame power value. A sum of squares of signal levels in one frame of a frequency signal from a sound source is calculated as a frame power correction value. A speech envelope signal is calculated as a function having variables of the standardized frame power values, the frame power correction value and the mean frame power value. The speech envelope signal adjusts the amplitude level of a speech waveform signal supplied from a vocal tract filter according to the level of the speech envelope signal.
摘要翻译: 公开了一种自然合成语音的语音合成方法。 在音素中的头部和尾部帧的帧功率值被标准化时,计算第n帧的标准化帧功率值。 以预定频率间隔从第n帧中的功率频率特性采样的功率值的平均值被设置为平均帧功率值。 将来自声源的频率信号的一帧中的信号电平的平方和计算为帧功率校正值。 计算语音包络信号作为具有标准化帧功率值,帧功率校正值和平均帧功率值的变量的函数。 语音包络信号根据语音包络信号的电平来调节从声道滤波器提供的语音波形信号的幅度电平。