Speech processing technique for use in speech recognition and speech coding
    1.
    发明授权
    Speech processing technique for use in speech recognition and speech coding 失效
    用于语音识别和语音编码的语音处理技术

    公开(公告)号:US06263306B1

    公开(公告)日:2001-07-17

    申请号:US09259644

    申请日:1999-02-26

    IPC分类号: G10L1104

    CPC分类号: G10L25/90 G10L15/02

    摘要: A technique for obtaining an intermediate set of frequency dependant features from a speech signal for use in speech processing and in obtaining estimates of speech pitch. The technique utilizes multiple tapers derived from Slepian sequences to obtain a product of the speech signal and the Slepian functions. Multiple tapered Fourier transforms are then obtained from the product, from which the set of frequency dependent features are calculated. In a preferred embodiment, a derivative of the cepstrum of the speech signal is used as an estimate of speech signal pitch. In another preferred embodiment, the F-spectrum is calculated from the product and the F-cepstrum is obtained therefrom by calculating the Fourier transform of the smoothed derivative of the log of the F-spectrum. The maximum of the F-cepstrum also provides a pitch estimation.

    摘要翻译: 一种用于从语音信号中获取频率相关特征的中间集合的技术,用于语音处理和获得语音间距的估计。 该技术利用来自Slepian序列的多个锥度来获得语音信号和Slepian函数的乘积。 然后从乘积中获得多个渐变傅立叶变换,从中计算出一组频率相关特征。 在优选实施例中,语音信号的倒谱的导数用作语音信号音调的估计。 在另一个优选实施例中,从乘积计算F频谱,并通过计算F频谱的对数的平滑导数的傅立叶变换来获得F倒谱。 F倒谱的最大值也提供了音调估计。