发明授权
- 专利标题: Automatic spoken language identification based on phoneme sequence patterns
- 专利标题(中): 基于音素序列模式的自动口语识别
-
申请号: US13846316申请日: 2013-03-18
-
公开(公告)号: US08781812B2公开(公告)日: 2014-07-15
- 发明人: Mahapathy Kadirkamanathan , Christopher John Waple
- 申请人: Longsand Limited
- 申请人地址: GB
- 专利权人: Longsand Limited
- 当前专利权人: Longsand Limited
- 当前专利权人地址: GB
- 主分类号: G06F17/20
- IPC分类号: G06F17/20
摘要:
A language identification system that includes a universal phoneme decoder (UPD) is described. The UPD contains a universal phoneme set representing both 1) all phonemes occurring in the set of two or more spoken languages, and 2) captures phoneme correspondences across languages, such that a set of unique phoneme patterns and probabilities are calculated in order to identify a most likely phoneme occurring each time in the audio files in the set of two or more potential languages in which the UPD was trained on. Each statistical language model (SLM) uses the set of unique phoneme patterns created for each language in the set to distinguish between spoken human languages in the set of languages. The run-time language identifier module identifies a particular human language being spoken by utilizing the linguistic probabilities supplied by the SLMs that are based on the set of unique phoneme patterns created for each language.
公开/授权文献
信息查询