发明授权
US08321208B2 Speech processing and speech synthesis using a linear combination of bases at peak frequencies for spectral envelope information 有权
在频谱包络信息的峰值频率处使用基线的线性组合的语音处理和语音合成

Speech processing and speech synthesis using a linear combination of bases at peak frequencies for spectral envelope information
摘要:
An information extraction unit extracts spectral envelope information of L-dimension from each frame of speech data by discrete Fourier transform. The spectral envelope information is represented by L points. A basis storage unit stores N bases (L>N>1). Each basis is differently a frequency band having a maximum as a peak frequency in a spectral domain having L-dimension. A value corresponding to a frequency outside the frequency band along a frequency axis of the spectral domain is zero. Two frequency bands of which two peak frequencies are adjacent along the frequency axis partially overlap. A parameter calculation unit minimizes a distortion between the spectral envelope information and a linear combination of each basis with a coefficient for each of L points of the spectral envelope information by changing the coefficient, and sets the coefficient of each basis from which the distortion is minimized to a spectral envelope parameter of the spectral envelope information.
信息查询
0/0