专利检索 ap:("Janet M. Baker" OR "Laurence S. Gillick" OR "James K. Baker" OR "Jonathan P. Yamron") AND inv:"Jonathan P. Yamron" 第 1 页

1.

发明授权
Systems and methods for word recognition 失效
标题翻译：词识别的系统和方法

公开(公告)号：US5680511A

公开(公告)日：1997-10-21

申请号：US477287

申请日：1995-06-07

申请人： Janet M. Baker , Laurence S. Gillick , James K. Baker , Jonathan P. Yamron

发明人： Janet M. Baker , Laurence S. Gillick , James K. Baker , Jonathan P. Yamron

IPC分类号： G10L15/18 , G10L9/00

CPC分类号： G10L15/1815

摘要： In one aspect, the invention provides word recognition systems that operate to recognize an unrecognized or ambiguous word that occurs within a passage of words. The system can offer several words as choice words for inserting into the passage to replace the unrecognized word. The system can select the best choice word by using the choice word to extract from a reference source, sample passages of text that relate to the choice word. For example, the system can select the dictionary passage that defines the choice word. The system then compares the selected passage to the current passage, and generates a score that indicates the likelihood that the choice word would occur within that passage of text. The system can select the choice word with the best score to substitute into the passage. The passage of words being analyzed can be any word sequence including an utterance, a portion of handwritten text, a portion of typewritten text or other such sequence of words, numbers and characters. Alternative embodiments of the present invention are disclosed which function to retrieve documents from a library as a function of context.

摘要翻译： 在一个方面，本发明提供了操作以识别在单词通过内出现的未识别或不明确的单词的单词识别系统。该系统可以提供多个单词作为选择单词，用于插入到段落中以替换未被识别的单词。系统可以通过使用选择单词从参考源中提取出最佳选择单词，与选择单词相关的文本的样本段落。例如，系统可以选择定义选择字的字典通道。然后，系统将所选择的段落与当前段落进行比较，并生成一个分数，指示选择单词在文本段落内发生的可能性。系统可以选择具有最佳分数的选择词来代替段落。正在分析的单词的通过可以是包括发音，手写文本的一部分，打字文本的一部分或其他这样的单词，数字和字符序列的任何单词序列。公开了本发明的替代实施例，其功能是根据上下文从库中检索文档。

2.

发明授权
Text segmentation and identification of topic using language models 失效
标题翻译：使用语言模型的文本分割和主题识别

公开(公告)号：US6052657A

公开(公告)日：2000-04-18

申请号：US978487

申请日：1997-11-25

申请人： Jonathan P. Yamron , Paul G. Bamberg , James Barnett , Laurence S. Gillick , Paul A. van Mulbregt

发明人： Jonathan P. Yamron , Paul G. Bamberg , James Barnett , Laurence S. Gillick , Paul A. van Mulbregt

IPC分类号： G06F17/30 , G06F17/27 , G10L15/00

CPC分类号： G06F17/30616

摘要： System for segmenting text and identifying segment topics that match a user-specified topic. Topic tracking system creates a set of topic models from training text containing topic boundaries using a clustering algorithm. User supplies topic text. System creates a topic model of the topic text and adds the topic model to the set of topic models. User-supplied test text is segmented according to the set of topic models. Segments relating to the same topic as the topic text are selected.

摘要翻译： 用于分割文本并识别符合用户指定主题的段主题的系统。主题跟踪系统使用聚类算法从训练包含主题边界的文本创建一组主题模型。用户提供主题文本。系统创建主题文本的主题模型，并将主题模型添加到主题模型集合中。用户提供的测试文本根据主题模型的集合进行分段。选择与主题文本相同主题的细分。

3.

发明授权
Training and using pronunciation guessers in speech recognition 有权
标题翻译：在语音识别中训练和使用发音猜测器

公开(公告)号：US07467087B1

公开(公告)日：2008-12-16

申请号：US10684135

申请日：2003-10-10

申请人： Laurence S. Gillick , Steven A. Wegmann , Jonathan P. Yamron

发明人： Laurence S. Gillick , Steven A. Wegmann , Jonathan P. Yamron

IPC分类号： G10L13/00 , G10L15/00 , G10L15/06

CPC分类号： G10L15/063 , G10L15/26 , G10L2015/025

摘要： The error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data in such training as a function of the frequency of the words in which such mappings occur. Preferably the ratio of the weight to word frequency increases as word frequencies decreases. Acoustic phoneme models for use in speech recognition with phonetic spellings generated by a pronunciation guesser that makes errors are trained against word models whose phonetic spellings have been generated by a pronunciation guesser that makes similar errors. As a result, the acoustic models represent blends of phoneme sounds that reflect the spelling errors made by the pronunciation guessers. Speech recognition enabled systems are made by storing in them both a pronunciation guesser and a corresponding set of such blended acoustic models.

摘要翻译： 猜测语音识别中使用的单词的拼音拼写的发音猜测器的错误率通过使其训练来衡量用作这种训练中的数据的字母到音素映射，作为其中这样的单词的频率的函数映射发生。优选地，权重与字频率的比率随着字频率的降低而增加。用于语音识别的声学音素模型，由发音猜测器产生的语音拼写用于产生错误的声音拼音针对由发音猜测器产生类似错误的语音拼写的单词模型。结果，声学模型表示声音发音的混合，反映了发音猜测者的拼写错误。通过在其中存储发音猜测器和相应的一组这样的混合声学模型来进行支持语音识别的系统。

4.

发明授权
Multilingual speech recognition 有权
标题翻译：多语言语音识别

公开(公告)号：US08065144B1

公开(公告)日：2011-11-22

申请号：US12699172

申请日：2010-02-03

申请人： Laurence S. Gillick , Thomas E. Lynch , Michael J. Newman , Daniel L. Roth , Steven A. Wegmann , Jonathan P. Yamron

发明人： Laurence S. Gillick , Thomas E. Lynch , Michael J. Newman , Daniel L. Roth , Steven A. Wegmann , Jonathan P. Yamron

IPC分类号： G10L15/06 , G10L15/28

CPC分类号： G10L15/005

摘要： A method for speech recognition. The method uses a single pronunciation estimator to train acoustic phoneme models and recognize utterances from multiple languages. The method includes accepting text spellings of training words in a plurality of sets of training words, each set corresponding to a different one of a plurality of languages. The method also includes, for each of the sets of training words in the plurality, receiving pronunciations for the training words in the set, the pronunciations being characteristic of native speakers of the language of the set, the pronunciations also being in terms of subword units at least some of which are common to two or more of the languages. The method also includes training a single pronunciation estimator using data comprising the text spellings and the pronunciations of the training words.

摘要翻译： 一种语音识别方法。该方法使用单个发音估计器来训练声音音素模型并识别来自多种语言的语音。该方法包括接受多组训练词中训练词的文本拼写，每组训练单词对应于多种语言中的不同语言。该方法还包括对于多个训练词集合中的每一组，接收组中的训练单词的发音，发音是该组语言的母语者的特征，发音还以子单位其中至少有一些是两种或多种语言的共同之处。该方法还包括使用包括文本拼写和训练词的发音的数据训练单个发音估计器。

5.

发明授权
Multilingual speech recognition 有权
标题翻译：多语言语音识别

公开(公告)号：US07716050B2

公开(公告)日：2010-05-11

申请号：US10716027

申请日：2003-11-17

申请人： Laurence S. Gillick , Thomas E. Lynch , Michael J. Newman , Daniel L. Roth , Steven A. Wegmann , Jonathan P. Yamron

发明人： Laurence S. Gillick , Thomas E. Lynch , Michael J. Newman , Daniel L. Roth , Steven A. Wegmann , Jonathan P. Yamron

IPC分类号： G10L15/00

CPC分类号： G10L15/005

摘要： A method for speech recognition. The method uses a single pronunciation estimator to train acoustic phoneme models and recognize utterances from multiple languages. The method includes accepting text spellings of training words in a plurality of sets of training words, each set corresponding to a different one of a plurality of languages. The method also includes, for each of the sets of training words in the plurality, receiving pronunciations for the training words in the set, the pronunciations being characteristic of native speakers of the language of the set, the pronunciations also being in terms of subword units at least some of which are common to two or more of the languages. The method also includes training a single pronunciation estimator using data comprising the text spellings and the pronunciations of the training words.

摘要翻译： 一种语音识别方法。该方法使用单个发音估计器来训练声音音素模型并识别来自多种语言的语音。该方法包括接受多组训练词中训练词的文本拼写，每组训练单词对应于多种语言中的不同语言。该方法还包括对于多个训练词集合中的每一组，接收组中的训练单词的发音，发音是该组语言的母语者的特征，发音还以子单位其中至少有一些是两种或多种语言的共同之处。该方法还包括使用包括文本拼写和训练词的发音的数据训练单个发音估计器。

6.

发明授权
Expanding an effective vocabulary of a speech recognition system 有权
标题翻译：扩展语音识别系统的有效词汇

公开(公告)号：US07120582B1

公开(公告)日：2006-10-10

申请号：US09390370

申请日：1999-09-07

申请人： Jonathan H. Young , Haakon L. Chevalier , Laurence S. Gillick , Toffee A. Albina , Marlboro B. Moore, III , Paul E. Rensing , Jonathan P. Yamron

发明人： Jonathan H. Young , Haakon L. Chevalier , Laurence S. Gillick , Toffee A. Albina , Marlboro B. Moore, III , Paul E. Rensing , Jonathan P. Yamron

IPC分类号： G10L15/00 , G10L15/06

CPC分类号： G10L15/063 , G10L2015/0633 , G10L2015/0635 , G10L2015/0636

摘要： The invention provides techniques for creating and using fragmented word models to increase the effective size of an active vocabulary of a speech recognition system. The active vocabulary represents all words and word fragments that the speech recognition system is able to recognize. Each word may be represented by a combination of acoustic models. As such, the active vocabulary represents the combinations of acoustic models that the speech recognition system may compare to a user's speech to identify acoustic models that best match the user's speech. The effective size of the active vocabulary may be increased by dividing words into constituent components or fragments (for example, prefixes, suffixes, separators, infixes, and roots) and including each component as a separate entry in the active vocabulary. Thus, for example, a list of words and their plural forms (for example, “book, books, cook, cooks, hook, hooks, look and looks”) may be represented in the active vocabulary using the words (for example, “book, cook, hook and look”) and an entry representing the suffix that makes the words plural (for example, “+s”, where the “+” preceding the “s” indicates that “+s” is a suffix). For a large list of words, and ignoring the entry associated with the suffix, this technique may reduce the number of vocabulary entries needed to represent the list of words considerably.

摘要翻译： 本发明提供了用于创建和使用分割词模型以增加语音识别系统的活跃词汇表的有效大小的技术。活动词汇表示语音识别系统能够识别的所有单词和单词片段。每个单词可以由声学模型的组合来表示。因此，活动词汇表示声学模型的组合，语音识别系统可以与用户的语音进行比较，以识别与用户的语音最匹配的声学模型。活动词汇表的有效大小可以通过将单词划分成组成组件或片段（例如，前缀，后缀，分隔符，中缀和根）并将每个组件作为活动词汇表中的单独条目来增加。因此，例如，可以在活动词汇表中使用单词（例如，“书籍，书籍，烹饪，烹饪，钩子，钩子，外观和外观”）的单词列表及其复数形式书签，烹饪，钩子和外观“）和表示使单词复数的后缀的条目（例如，”+ s“，其中”+“之前的”+“表示”+ s“是后缀）。对于大量单词列表，忽略与后缀相关联的条目，这种技术可能会大大减少用于表示单词列表所需的词汇表数量。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类