发明申请
US20060200351A1 Two-stage implementation for phonetic recognition using a bi-directional target-filtering model of speech coarticulation and reduction 有权
使用语音合成和还原的双向目标滤波模型进行语音识别的两阶段实现

  • 专利标题: Two-stage implementation for phonetic recognition using a bi-directional target-filtering model of speech coarticulation and reduction
  • 专利标题(中): 使用语音合成和还原的双向目标滤波模型进行语音识别的两阶段实现
  • 申请号: US11069474
    申请日: 2005-03-01
  • 公开(公告)号: US20060200351A1
    公开(公告)日: 2006-09-07
  • 发明人: Alejandro AceroDong YuLi Deng
  • 申请人: Alejandro AceroDong YuLi Deng
  • 申请人地址: US WA Redmond
  • 专利权人: Microsoft Corporation
  • 当前专利权人: Microsoft Corporation
  • 当前专利权人地址: US WA Redmond
  • 主分类号: G10L15/04
  • IPC分类号: G10L15/04
Two-stage implementation for phonetic recognition using a bi-directional target-filtering model of speech coarticulation and reduction
摘要:
A structured generative model of a speech coarticulation and reduction is described with a novel two-stage implementation. At the first stage, the dynamics of formants or vocal tract resonance (VTR) are generated using prior information of resonance targets in the phone sequence. Bi-directional temporal filtering with finite impulse response (FIR) is applied to the segmental target sequence as the FIR filter's input. At the second stage the dynamics of speech cepstra are predicted analytically based on the FIR filtered VTR targets. The combined system of these two stages thus generates correlated and causally related VTR and cepstral dynamics where phonetic reduction is represented explicitly in the hidden resonance space and implicitly in the observed cepstral space. The combined system also gives the acoustic observation probability given a phone sequence. Using this probability, different phone sequences can be compared and ranked in terms of their respective probability values. This then permits the use of the model for phonetic recognition.
信息查询
0/0