专利检索 ap:"Michael Sean Fee" 第 1 页

1.

发明授权
Speech processing technique for use in speech recognition and speech coding 失效
标题翻译：用于语音识别和语音编码的语音处理技术

公开(公告)号：US06263306B1

公开(公告)日：2001-07-17

申请号：US09259644

申请日：1999-02-26

申请人： Michael Sean Fee , Ching Elizabeth Ho , Partha Pratim Mitra , Bijan Pesaran

发明人： Michael Sean Fee , Ching Elizabeth Ho , Partha Pratim Mitra , Bijan Pesaran

IPC分类号： G10L1104

CPC分类号： G10L25/90 , G10L15/02

摘要： A technique for obtaining an intermediate set of frequency dependant features from a speech signal for use in speech processing and in obtaining estimates of speech pitch. The technique utilizes multiple tapers derived from Slepian sequences to obtain a product of the speech signal and the Slepian functions. Multiple tapered Fourier transforms are then obtained from the product, from which the set of frequency dependent features are calculated. In a preferred embodiment, a derivative of the cepstrum of the speech signal is used as an estimate of speech signal pitch. In another preferred embodiment, the F-spectrum is calculated from the product and the F-cepstrum is obtained therefrom by calculating the Fourier transform of the smoothed derivative of the log of the F-spectrum. The maximum of the F-cepstrum also provides a pitch estimation.

摘要翻译： 一种用于从语音信号中获取频率相关特征的中间集合的技术，用于语音处理和获得语音间距的估计。该技术利用来自Slepian序列的多个锥度来获得语音信号和Slepian函数的乘积。然后从乘积中获得多个渐变傅立叶变换，从中计算出一组频率相关特征。在优选实施例中，语音信号的倒谱的导数用作语音信号音调的估计。在另一个优选实施例中，从乘积计算F频谱，并通过计算F频谱的对数的平滑导数的傅立叶变换来获得F倒谱。 F倒谱的最大值也提供了音调估计。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类