Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint
    1.
    发明授权
    Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal restraint 有权
    使用非线性预测器和目标引导时间约束的声道共振跟踪的方法和装置

    公开(公告)号:US07643989B2

    公开(公告)日:2010-01-05

    申请号:US10652976

    申请日:2003-08-29

    IPC分类号: G10L19/06

    CPC分类号: G10L25/48 G10L25/15

    摘要: A method and apparatus map a set of vocal tract resonant frequencies, together with their corresponding bandwidths, into a simulated acoustic feature vector in the form of LPC cepstrum by calculating a separate function for each individual vocal tract resonant frequency/bandwidth and summing the result to form an element of the simulated feature vector. The simulated feature vector is applied to a model along with an input feature vector to determine a probability that the set of vocal tract resonant frequencies is present in a speech signal. Under one embodiment, the model includes a target-guided transition model that provides a probability of a vocal tract resonant frequency based on a past vocal tract resonant frequency and a target for the vocal tract resonant frequency. Under another embodiment, the phone segmentation is provided by an HMM system and is used to precisely determine which target value to use at each frame.

    摘要翻译: 一种方法和装置将一组声道共振频率及其相应带宽与LPC倒谱谱形式映射成模拟的声学特征向量,通过计算每个单独的声道共振频率/带宽的单独函数,并将结果相加到 形成模拟特征向量的元素。 将模拟特征向量与输入特征向量一起应用于模型,以确定声道谐振频率的集合存在于语音信号中的概率。 在一个实施例中,该模型包括目标引导的转换模型,其基于过去的声道共振频率和用于声道共振频率的目标提供声道共振频率的概率。 在另一个实施例中,电话分割由HMM系统提供,并且用于精确地确定在每个帧处使用哪个目标值。

    Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal constraint
    2.
    发明申请
    Method and apparatus for vocal tract resonance tracking using nonlinear predictor and target-guided temporal constraint 有权
    使用非线性预测器和目标引导时间约束的声道共振跟踪的方法和装置

    公开(公告)号:US20050049866A1

    公开(公告)日:2005-03-03

    申请号:US10652976

    申请日:2003-08-29

    CPC分类号: G10L25/48 G10L25/15

    摘要: A method and apparatus map a set of vocal tract resonant frequencies, together with their corresponding bandwidths, into a simulated acoustic feature vector in the form of LPC cepstrum by calculating a separate function for each individual vocal tract resonant frequency/bandwidth and summing the result to form an element of the simulated feature vector. The simulated feature vector is applied to a model along with an input feature vector to determine a probability that the set of vocal tract resonant frequencies is present in a speech signal. Under one embodiment, the model includes a target-guided transition model that provides a probability of a vocal tract resonant frequency based on a past vocal tract resonant frequency and a target for the vocal tract resonant frequency. Under another embodiment, the phone segmentation is provided by an HMM system and is used to precisely determine which target value to use at each frame.

    摘要翻译: 一种方法和装置将一组声道共振频率及其相应带宽与LPC倒谱谱形式映射成模拟的声学特征向量,通过计算每个单独的声道共振频率/带宽的单独函数,并将结果相加到 形成模拟特征向量的元素。 将模拟特征向量与输入特征向量一起应用于模型,以确定声道谐振频率的集合存在于语音信号中的概率。 在一个实施例中,该模型包括目标引导的转换模型,其基于过去的声道共振频率和用于声道共振频率的目标提供声道共振频率的概率。 在另一个实施例中,电话分割由HMM系统提供,并且用于精确地确定在每个帧处使用哪个目标值。

    Method and apparatus for formant tracking using a residual model
    3.
    发明授权
    Method and apparatus for formant tracking using a residual model 有权
    使用残差模型进行共振峰跟踪的方法和装置

    公开(公告)号:US07424423B2

    公开(公告)日:2008-09-09

    申请号:US10404411

    申请日:2003-04-01

    IPC分类号: G10L19/04

    CPC分类号: G10L15/02 G10L25/15

    摘要: A method of tracking formants defines a formant search space comprising sets of formants to be searched. Formants are identified for a first frame in the speech utterance by searching the entirety of the formant search space using the codebook, and for the remaining frames by searching the same space using both the codebook and the continuity constraint across adjacent frames. Under one embodiment, the formants are identified by mapping sets of formants into feature vectors and applying the feature vectors to a model. Formants are also identified by applying dynamic programming to search for the best sequence that optimally satisfies the continuity constraint required by the model.

    摘要翻译: 跟踪共享器的方法定义了包括要搜索的共振峰集合的共振峰搜索空间。 通过使用码本搜索整体的共振峰搜索空间,并且通过使用码本和相邻帧之间的连续性约束搜索相同的空间,为语音语音中的第一帧识别共振峰。 在一个实施例中,通过将共振峰集合映射到特征向量中并将特征向量应用于模型来识别共振峰。 还通过应用动态规划来搜索最优序列,以最佳地满足模型所需的连续性约束,来确定共振峰。

    Greedy algorithm for identifying values for vocal tract resonance vectors

    公开(公告)号:US20060047506A1

    公开(公告)日:2006-03-02

    申请号:US10925585

    申请日:2004-08-25

    IPC分类号: G10L19/06

    CPC分类号: G10L25/48 G10L15/02 G10L25/15

    摘要: A method and apparatus identify values for components of a vocal tract resonance vector by sequentially determining values for each component of the vocal tract resonance vector. To determine a value for a component, the other components are set to static values. A plurality of values for a function are then determined using a plurality of values for the component that is being determined while using the static values for all of the other components. One of the plurality of values for the component is then selected based on the plurality of values for the function.