发明授权
- 专利标题: System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling
- 专利标题(中): 通过改进发音建模来增加词汇单词识别率的系统和方法
-
申请号: US13311512申请日: 2011-12-05
-
公开(公告)号: US08892441B2公开(公告)日: 2014-11-18
- 发明人: Alistair D. Conkie , Mazin Gilbert , Andrej Ljolje
- 申请人: Alistair D. Conkie , Mazin Gilbert , Andrej Ljolje
- 申请人地址: US GA Atlanta
- 专利权人: AT&T Intellectual Property I, L.P.
- 当前专利权人: AT&T Intellectual Property I, L.P.
- 当前专利权人地址: US GA Atlanta
- 主分类号: G10L15/187
- IPC分类号: G10L15/187 ; G10L15/06
摘要:
The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes overgenerating potential pronunciations based on symbolic input, identifying potential pronunciations in a speech recognition context, and storing the identified potential pronunciations in a lexicon. Overgenerating potential pronunciations can include establishing a set of conversion rules for short sequences of letters, converting portions of the symbolic input into a number of possible lexical pronunciation variants based on the set of conversion rules, modeling the possible lexical pronunciation variants in one of a weighted network and a list of phoneme lists, and iteratively retraining the set of conversion rules based on improved pronunciations. Symbolic input can include multiple examples of a same spoken word. Speech data can be labeled explicitly or implicitly and can include words as text and recorded audio.
公开/授权文献
信息查询