发明授权
US08321208B2 Speech processing and speech synthesis using a linear combination of bases at peak frequencies for spectral envelope information
有权
在频谱包络信息的峰值频率处使用基线的线性组合的语音处理和语音合成
- 专利标题: Speech processing and speech synthesis using a linear combination of bases at peak frequencies for spectral envelope information
- 专利标题(中): 在频谱包络信息的峰值频率处使用基线的线性组合的语音处理和语音合成
-
申请号: US12327399申请日: 2008-12-03
-
公开(公告)号: US08321208B2公开(公告)日: 2012-11-27
- 发明人: Masatsune Tamura , Katsumi Tsuchiya , Takehiko Kagoshima
- 申请人: Masatsune Tamura , Katsumi Tsuchiya , Takehiko Kagoshima
- 申请人地址: JP Tokyo
- 专利权人: Kabushiki Kaisha Toshiba
- 当前专利权人: Kabushiki Kaisha Toshiba
- 当前专利权人地址: JP Tokyo
- 代理机构: Oblon, Spivak, McClelland, Maier & Neustadt, L.L.P.
- 优先权: JP2007-312336 20071203
- 主分类号: G10L13/06
- IPC分类号: G10L13/06 ; G10L19/02
摘要:
An information extraction unit extracts spectral envelope information of L-dimension from each frame of speech data by discrete Fourier transform. The spectral envelope information is represented by L points. A basis storage unit stores N bases (L>N>1). Each basis is differently a frequency band having a maximum as a peak frequency in a spectral domain having L-dimension. A value corresponding to a frequency outside the frequency band along a frequency axis of the spectral domain is zero. Two frequency bands of which two peak frequencies are adjacent along the frequency axis partially overlap. A parameter calculation unit minimizes a distortion between the spectral envelope information and a linear combination of each basis with a coefficient for each of L points of the spectral envelope information by changing the coefficient, and sets the coefficient of each basis from which the distortion is minimized to a spectral envelope parameter of the spectral envelope information.
公开/授权文献
信息查询