专利检索 ap:"Mahapathy Kadirkamanathan" 第 2 页

11.

发明授权
Speech recognition system 有权
标题翻译：语音识别系统

公开(公告)号：US08229743B2

公开(公告)日：2012-07-24

申请号：US12489786

申请日：2009-06-23

申请人： David Carter , Mahapathy Kadirkamanathan

发明人： David Carter , Mahapathy Kadirkamanathan

IPC分类号： G10L15/00

CPC分类号： G10L15/065 , G10L15/197

摘要： Various methods and apparatus are described for a speech recognition system. In an embodiment, the statistical language model (SLM) provides probability estimates of how linguistically likely a sequence of linguistic items are to occur in that sequence based on an amount of times the sequence of linguistic items occurs in text and phrases in general use. The speech recognition decoder module requests a correction module for one or more corrected probability estimates P′(z|xy) of how likely a linguistic item z follows a given sequence of linguistic items x followed by y, where (x, y, and z) are three variable linguistic items supplied from the decoder module. The correction module is trained to linguistics of a specific domain, and is located in between the decoder module and the SLM in order to adapt the probability estimates supplied by the SLM to the specific domain when those probability estimates from the SLM significantly disagree with the linguistic probabilities in that domain.

摘要翻译： 描述了用于语音识别系统的各种方法和装置。在一个实施例中，统计语言模型（SLM）基于语言项目序列出现在通常使用的文本和短语中的次数，提供语言序列在该序列中如何语言上可能发生的概率估计。语音识别解码器模块向修正模块请求一个或多个校正概率估计P'（z | xy），语言项目z在给定的语言项目序列x之后跟随y，其中（x，y和z））是从解码器模块提供的三个可变语言项目。校正模块被训练成特定领域的语言学，并且位于解码器模块和SLM之间，以便当来自SLM的概率估计显着不同于语言学时，将SLM提供的概率估计值适应于特定领域该领域的概率。

12.

发明授权
Automatic spoken language identification based on phoneme sequence patterns 有权
标题翻译：基于音素序列模式的自动口语识别

公开(公告)号：US08190420B2

公开(公告)日：2012-05-29

申请号：US12535038

申请日：2009-08-04

申请人： Mahapathy Kadirkamanathan , Christopher John Waple

发明人： Mahapathy Kadirkamanathan , Christopher John Waple

IPC分类号： G06F17/20

CPC分类号： G10L15/187 , G10L15/005

摘要： A language identification system that includes a universal phoneme decoder (UPD) is described. The UPD contains a universal phoneme set representing both 1) all phonemes occurring in the set of two or more spoken languages, and 2) captures phoneme correspondences across languages, such that a set of unique phoneme patterns and probabilities are calculated in order to identify a most likely phoneme occurring each time in the audio files in the set of two or more potential languages in which the UPD was trained on. Each statistical language models (SLM) uses the set of unique phoneme patterns created for each language in the set to distinguish between spoken human languages in the set of languages. The run-time language identifier module identifies a particular human language being spoken by utilizing the linguistic probabilities supplied by the one or more SLMs that are based on the set of unique phoneme patterns created for each language.

摘要翻译： 描述了包括通用音素解码器（UPD）的语言识别系统。 UPD包含一个通用音素集合，表示1）所有发音在两组或多种语言中的音素，以及2）跨语言捕获音素对应，以便计算一组独特的音素模式和概率，以便识别最有可能在音频文件中出现的两种或多种潜在语言的UPD被训练的音频文件中。每个统计语言模型（SLM）使用为集合中的每种语言创建的一组独特的音素模式，以区分该语言集中的口语人类语言。运行时语言标识符模块通过利用由基于为每种语言创建的唯一音素模式的集合的一个或多个SLM提供的语言概率来识别正在说出的特定人类语言。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类