System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring

发明授权

US08548807B2 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring 有权

请登陆查看更多内容

专利标题： System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring
申请号： US12480848

申请日： 2009-06-09
公开(公告)号： US08548807B2

公开(公告)日： 2013-10-01
发明人: Andrej Ljolje , Alistair D. Conkie , Ann K. Syrdal
申请人： Andrej Ljolje , Alistair D. Conkie , Ann K. Syrdal
申请人地址： US GA Atlanta
专利权人： AT&T Intellectual Property I, L.P.
当前专利权人： AT&T Intellectual Property I, L.P.
当前专利权人地址： US GA Atlanta
主分类号： G10L15/04
IPC分类号： G10L15/04

System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring

摘要：

Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model restructuring. The method identifies an acoustic model and a matching pronouncing dictionary trained on typical native speech in a target dialect. The method collects speech from a new speaker resulting in collected speech and transcribes the collected speech to generate a lattice of plausible phonemes. Then the method creates a custom speech model for representing each phoneme used in the pronouncing dictionary by a weighted sum of acoustic models for all the plausible phonemes, wherein the pronouncing dictionary does not change, but the model of the acoustic space for each phoneme in the dictionary becomes a weighted sum of the acoustic models of phonemes of the typical native speech. Finally the method includes recognizing via a processor additional speech from the target speaker using the custom speech model.

公开/授权文献

US20100312560A1 SYSTEM AND METHOD FOR ADAPTING AUTOMATIC SPEECH RECOGNITION PRONUNCIATION BY ACOUSTIC MODEL RESTRUCTURING 公开/授权日：2010-12-09

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/04	.分段；字极限检测