Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint

发明授权

US07643989B2 Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint 有权

标题翻译：使用非线性预测器和目标引导时间约束的声道共振跟踪的方法和装置

请登陆查看更多内容

专利标题： Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint
专利标题（中）： 使用非线性预测器和目标引导时间约束的声道共振跟踪的方法和装置
申请号： US10652976

申请日： 2003-08-29
公开(公告)号： US07643989B2

公开(公告)日： 2010-01-05
发明人: Li Deng , Alejandro Acero , Issam Bazzi
申请人： Li Deng , Alejandro Acero , Issam Bazzi
申请人地址： US WA Redmond
专利权人： Microsoft Corporation
当前专利权人： Microsoft Corporation
当前专利权人地址： US WA Redmond
代理机构： Westman, Champlin & Kelly, P.A.
代理商 Theodore M. Magee
主分类号： G10L19/06
IPC分类号： G10L19/06

Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint

摘要：

A method and apparatus map a set of vocal tract resonant frequencies, together with their corresponding bandwidths, into a simulated acoustic feature vector in the form of LPC cepstrum by calculating a separate function for each individual vocal tract resonant frequency/bandwidth and summing the result to form an element of the simulated feature vector. The simulated feature vector is applied to a model along with an input feature vector to determine a probability that the set of vocal tract resonant frequencies is present in a speech signal. Under one embodiment, the model includes a target-guided transition model that provides a probability of a vocal tract resonant frequency based on a past vocal tract resonant frequency and a target for the vocal tract resonant frequency. Under another embodiment, the phone segmentation is provided by an HMM system and is used to precisely determine which target value to use at each frame.

摘要（中）：

一种方法和装置将一组声道共振频率及其相应带宽与LPC倒谱谱形式映射成模拟的声学特征向量，通过计算每个单独的声道共振频率/带宽的单独函数，并将结果相加到形成模拟特征向量的元素。将模拟特征向量与输入特征向量一起应用于模型，以确定声道谐振频率的集合存在于语音信号中的概率。在一个实施例中，该模型包括目标引导的转换模型，其基于过去的声道共振频率和用于声道共振频率的目标提供声道共振频率的概率。在另一个实施例中，电话分割由HMM系统提供，并且用于精确地确定在每个帧处使用哪个目标值。

公开/授权文献

US20050049866A1 Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal constraint 公开/授权日：2005-03-03

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L19/00	用于冗余度下降情形（例如在声码器中）的语音或音频信号分析-合成技术；语音或音频信号编码或解码，采用源滤波器模型或心理声学分析（乐器中的入G10H）
G10L19/04	.利用预测技术
G10L19/06	..例如短期预测系数的频谱特征的确定或编码