发明授权
US07643989B2 Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint
有权
使用非线性预测器和目标引导时间约束的声道共振跟踪的方法和装置
- 专利标题: Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint
- 专利标题(中): 使用非线性预测器和目标引导时间约束的声道共振跟踪的方法和装置
-
申请号: US10652976申请日: 2003-08-29
-
公开(公告)号: US07643989B2公开(公告)日: 2010-01-05
- 发明人: Li Deng , Alejandro Acero , Issam Bazzi
- 申请人: Li Deng , Alejandro Acero , Issam Bazzi
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理机构: Westman, Champlin & Kelly, P.A.
- 代理商 Theodore M. Magee
- 主分类号: G10L19/06
- IPC分类号: G10L19/06
摘要:
A method and apparatus map a set of vocal tract resonant frequencies, together with their corresponding bandwidths, into a simulated acoustic feature vector in the form of LPC cepstrum by calculating a separate function for each individual vocal tract resonant frequency/bandwidth and summing the result to form an element of the simulated feature vector. The simulated feature vector is applied to a model along with an input feature vector to determine a probability that the set of vocal tract resonant frequencies is present in a speech signal. Under one embodiment, the model includes a target-guided transition model that provides a probability of a vocal tract resonant frequency based on a past vocal tract resonant frequency and a target for the vocal tract resonant frequency. Under another embodiment, the phone segmentation is provided by an HMM system and is used to precisely determine which target value to use at each frame.
公开/授权文献
信息查询