发明授权
US06418412B1 Quantization using frequency and mean compensated frequency input data for robust speech recognition 有权
使用频率和平均补偿频率输入数据量化,用于鲁棒语音识别

  • 专利标题: Quantization using frequency and mean compensated frequency input data for robust speech recognition
  • 专利标题(中): 使用频率和平均补偿频率输入数据量化,用于鲁棒语音识别
  • 申请号: US09649737
    申请日: 2000-08-28
  • 公开(公告)号: US06418412B1
    公开(公告)日: 2002-07-09
  • 发明人: Safdar M. AsgharLin Cong
  • 申请人: Safdar M. AsgharLin Cong
  • 主分类号: G10L1514
  • IPC分类号: G10L1514
Quantization using frequency and mean compensated frequency input data for robust speech recognition
摘要:
A speech recognition system utilizes multiple quantizers to process frequency parameters and mean compensated frequency parameters derived from an input signal. The quantizers may be matrix and vector quantizer pairs, and such quantizer pairs may also function as front ends to a second stage speech classifiers such as hidden Markov models (HMMs) and/or utilizes neural network postprocessing to, for example, improve speech recognition performance. Mean compensating the frequency parameters can remove noise frequency components that remain approximately constant during the duration of the input signal. HMM initial state and state transition probabilities derived from common quantizer types and the same input signal may be consolidated to improve recognition system performance and efficiency. Matrix quantization exploits the “evolution” of the speech short-term spectral envelopes as well as frequency domain information, and vector quantization (VQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the VQ may provide error compensation. The matrix and vector quantizers may split spectral subbands to target selected frequencies for enhanced processing and may use fuzzy associations to develop fuzzy observation sequence data. A mixer may provide a variety of input data to the neural network for classification determination. Fuzzy operators may be utilized to reduce quantization error. Multiple codebooks may also be combined to form single respective codebooks for split matrix and split vector quantization to reduce processing resources demand.
信息查询
0/0