发明授权
- 专利标题: System and method for rescoring N-best hypotheses of an automatic speech recognition system
- 专利标题(中): 自动语音识别系统的N最佳假设的系统和方法
-
申请号: US09286099申请日: 1999-04-02
-
公开(公告)号: US07761296B1公开(公告)日: 2010-07-20
- 发明人: Raimo Bakis , Ellen M. Eide
- 申请人: Raimo Bakis , Ellen M. Eide
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: F. Chau & Associates, LLC
- 主分类号: G10L17/00
- IPC分类号: G10L17/00 ; G10L15/00
摘要:
A system and method for rescoring the N-best hypotheses from an automatic speech recognition system by comparing an original speech waveform to synthetic speech waveforms that are generated for each text sequence of the N-best hypotheses. A distance is calculated from the original speech waveform to each of the synthesized waveforms, and the text associated with the synthesized waveform that is determined to be closest to the original waveform is selected as the final hypothesis. The original waveform and each synthesized waveform are aligned to a corresponding text sequence on a phoneme level. The mean of the feature vectors which align to each phoneme is computed for the original waveform as well as for each of the synthesized hypotheses. The distance of a synthesized hypothesis to the original speech signal is then computed as the sum over all phonemes in the hypothesis of the Euclidean distance between the means of the feature vectors of the frames aligning to that phoneme for the original and the synthesized signals. The text of the hypothesis which is closest under the above metric to the original waveform is chosen as the final system output.
信息查询