Automatic spoken language identification based on phoneme sequence patterns

发明授权

US08781812B2 Automatic spoken language identification based on phoneme sequence patterns 有权

标题翻译：基于音素序列模式的自动口语识别

请登陆查看更多内容

专利标题： Automatic spoken language identification based on phoneme sequence patterns
专利标题（中）： 基于音素序列模式的自动口语识别
申请号： US13846316

申请日： 2013-03-18
公开(公告)号： US08781812B2

公开(公告)日： 2014-07-15
发明人: Mahapathy Kadirkamanathan , Christopher John Waple
申请人： Longsand Limited
申请人地址： GB
专利权人： Longsand Limited
当前专利权人： Longsand Limited
当前专利权人地址： GB
主分类号： G06F17/20
IPC分类号： G06F17/20

Automatic spoken language identification based on phoneme sequence patterns

摘要：

A language identification system that includes a universal phoneme decoder (UPD) is described. The UPD contains a universal phoneme set representing both 1) all phonemes occurring in the set of two or more spoken languages, and 2) captures phoneme correspondences across languages, such that a set of unique phoneme patterns and probabilities are calculated in order to identify a most likely phoneme occurring each time in the audio files in the set of two or more potential languages in which the UPD was trained on. Each statistical language model (SLM) uses the set of unique phoneme patterns created for each language in the set to distinguish between spoken human languages in the set of languages. The run-time language identifier module identifies a particular human language being spoken by utilizing the linguistic probabilities supplied by the SLMs that are based on the set of unique phoneme patterns created for each language.

摘要（中）：

描述了包括通用音素解码器（UPD）的语言识别系统。 UPD包含一个通用音素集合，表示1）所有发音在两组或多种语言中的音素，以及2）跨语言捕获音素对应，以便计算一组独特的音素模式和概率，以便识别最有可能在音频文件中出现的两种或多种潜在语言的UPD被训练的音频文件中。每个统计语言模型（SLM）使用为集合中的每种语言创建的一组独特的音素模式，以区分该语言集中的口语人类语言。运行时语言标识符模块通过利用由SLM提供的基于为每种语言创建的唯一音素模式的集合提供的语言概率来识别正在说出的特定人类语言。

公开/授权文献

US20130226583A1 AUTOMATIC SPOKEN LANGUAGE IDENTIFICATION BASED ON PHONEME SEQUENCE PATTERNS 公开/授权日：2013-08-29

信息查询

Espacenet