Systems and methods for word recognition
    1.
    发明授权
    Systems and methods for word recognition 失效
    词识别的系统和方法

    公开(公告)号:US5680511A

    公开(公告)日:1997-10-21

    申请号:US477287

    申请日:1995-06-07

    IPC分类号: G10L15/18 G10L9/00

    CPC分类号: G10L15/1815

    摘要: In one aspect, the invention provides word recognition systems that operate to recognize an unrecognized or ambiguous word that occurs within a passage of words. The system can offer several words as choice words for inserting into the passage to replace the unrecognized word. The system can select the best choice word by using the choice word to extract from a reference source, sample passages of text that relate to the choice word. For example, the system can select the dictionary passage that defines the choice word. The system then compares the selected passage to the current passage, and generates a score that indicates the likelihood that the choice word would occur within that passage of text. The system can select the choice word with the best score to substitute into the passage. The passage of words being analyzed can be any word sequence including an utterance, a portion of handwritten text, a portion of typewritten text or other such sequence of words, numbers and characters. Alternative embodiments of the present invention are disclosed which function to retrieve documents from a library as a function of context.

    摘要翻译: 在一个方面,本发明提供了操作以识别在单词通过内出现的未识别或不明确的单词的单词识别系统。 该系统可以提供多个单词作为选择单词,用于插入到段落中以替换未被识别的单词。 系统可以通过使用选择单词从参考源中提取出最佳选择单词,与选择单词相关的文本的样本段落。 例如,系统可以选择定义选择字的字典通道。 然后,系统将所选择的段落与当前段落进行比较,并生成一个分数,指示选择单词在文本段落内发生的可能性。 系统可以选择具有最佳分数的选择词来代替段落。 正在分析的单词的通过可以是包括发音,手写文本的一部分,打字文本的一部分或其他这样的单词,数字和字符序列的任何单词序列。 公开了本发明的替代实施例,其功能是根据上下文从库中检索文档。

    Training and using pronunciation guessers in speech recognition
    3.
    发明授权
    Training and using pronunciation guessers in speech recognition 有权
    在语音识别中训练和使用发音猜测器

    公开(公告)号:US07467087B1

    公开(公告)日:2008-12-16

    申请号:US10684135

    申请日:2003-10-10

    IPC分类号: G10L13/00 G10L15/00 G10L15/06

    摘要: The error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data in such training as a function of the frequency of the words in which such mappings occur. Preferably the ratio of the weight to word frequency increases as word frequencies decreases. Acoustic phoneme models for use in speech recognition with phonetic spellings generated by a pronunciation guesser that makes errors are trained against word models whose phonetic spellings have been generated by a pronunciation guesser that makes similar errors. As a result, the acoustic models represent blends of phoneme sounds that reflect the spelling errors made by the pronunciation guessers. Speech recognition enabled systems are made by storing in them both a pronunciation guesser and a corresponding set of such blended acoustic models.

    摘要翻译: 猜测语音识别中使用的单词的拼音拼写的发音猜测器的错误率通过使其训练来衡量用作这种训练中的数据的字母到音素映射,作为其中这样的单词的频率的函数 映射发生。 优选地,权重与字频率的比率随着字频率的降低而增加。 用于语音识别的声学音素模型,由发音猜测器产生的语音拼写用于产生错误的声音拼音针对由发音猜测器产生类似错误的语音拼写的单词模型。 结果,声学模型表示声音发音的混合,反映了发音猜测者的拼写错误。 通过在其中存储发音猜测器和相应的一组这样的混合声学模型来进行支持语音识别的系统。

    Multilingual speech recognition
    4.
    发明授权
    Multilingual speech recognition 有权
    多语言语音识别

    公开(公告)号:US08065144B1

    公开(公告)日:2011-11-22

    申请号:US12699172

    申请日:2010-02-03

    IPC分类号: G10L15/06 G10L15/28

    CPC分类号: G10L15/005

    摘要: A method for speech recognition. The method uses a single pronunciation estimator to train acoustic phoneme models and recognize utterances from multiple languages. The method includes accepting text spellings of training words in a plurality of sets of training words, each set corresponding to a different one of a plurality of languages. The method also includes, for each of the sets of training words in the plurality, receiving pronunciations for the training words in the set, the pronunciations being characteristic of native speakers of the language of the set, the pronunciations also being in terms of subword units at least some of which are common to two or more of the languages. The method also includes training a single pronunciation estimator using data comprising the text spellings and the pronunciations of the training words.

    摘要翻译: 一种语音识别方法。 该方法使用单个发音估计器来训练声音音素模型并识别来自多种语言的语音。 该方法包括接受多组训练词中训练词的文本拼写,每组训练单词对应于多种语言中的不同语言。 该方法还包括对于多个训练词集合中的每一组,接收组中的训练单词的发音,发音是该组语言的母语者的特征,发音还以子单位 其中至少有一些是两种或多种语言的共同之处。 该方法还包括使用包括文本拼写和训练词的发音的数据训练单个发音估计器。

    Multilingual speech recognition
    5.
    发明授权
    Multilingual speech recognition 有权
    多语言语音识别

    公开(公告)号:US07716050B2

    公开(公告)日:2010-05-11

    申请号:US10716027

    申请日:2003-11-17

    IPC分类号: G10L15/00

    CPC分类号: G10L15/005

    摘要: A method for speech recognition. The method uses a single pronunciation estimator to train acoustic phoneme models and recognize utterances from multiple languages. The method includes accepting text spellings of training words in a plurality of sets of training words, each set corresponding to a different one of a plurality of languages. The method also includes, for each of the sets of training words in the plurality, receiving pronunciations for the training words in the set, the pronunciations being characteristic of native speakers of the language of the set, the pronunciations also being in terms of subword units at least some of which are common to two or more of the languages. The method also includes training a single pronunciation estimator using data comprising the text spellings and the pronunciations of the training words.

    摘要翻译: 一种语音识别方法。 该方法使用单个发音估计器来训练声音音素模型并识别来自多种语言的语音。 该方法包括接受多组训练词中训练词的文本拼写,每组训练单词对应于多种语言中的不同语言。 该方法还包括对于多个训练词集合中的每一组,接收组中的训练单词的发音,发音是该组语言的母语者的特征,发音还以子单位 其中至少有一些是两种或多种语言的共同之处。 该方法还包括使用包括文本拼写和训练词的发音的数据训练单个发音估计器。

    Expanding an effective vocabulary of a speech recognition system
    6.
    发明授权
    Expanding an effective vocabulary of a speech recognition system 有权
    扩展语音识别系统的有效词汇

    公开(公告)号:US07120582B1

    公开(公告)日:2006-10-10

    申请号:US09390370

    申请日:1999-09-07

    IPC分类号: G10L15/00 G10L15/06

    摘要: The invention provides techniques for creating and using fragmented word models to increase the effective size of an active vocabulary of a speech recognition system. The active vocabulary represents all words and word fragments that the speech recognition system is able to recognize. Each word may be represented by a combination of acoustic models. As such, the active vocabulary represents the combinations of acoustic models that the speech recognition system may compare to a user's speech to identify acoustic models that best match the user's speech. The effective size of the active vocabulary may be increased by dividing words into constituent components or fragments (for example, prefixes, suffixes, separators, infixes, and roots) and including each component as a separate entry in the active vocabulary. Thus, for example, a list of words and their plural forms (for example, “book, books, cook, cooks, hook, hooks, look and looks”) may be represented in the active vocabulary using the words (for example, “book, cook, hook and look”) and an entry representing the suffix that makes the words plural (for example, “+s”, where the “+” preceding the “s” indicates that “+s” is a suffix). For a large list of words, and ignoring the entry associated with the suffix, this technique may reduce the number of vocabulary entries needed to represent the list of words considerably.

    摘要翻译: 本发明提供了用于创建和使用分割词模型以增加语音识别系统的活跃词汇表的有效大小的技术。 活动词汇表示语音识别系统能够识别的所有单词和单词片段。 每个单词可以由声学模型的组合来表示。 因此,活动词汇表示声学模型的组合,语音识别系统可以与用户的语音进行比较,以识别与用户的语音最匹配的声学模型。 活动词汇表的有效大小可以通过将单词划分成组成组件或片段(例如,前缀,后缀,分隔符,中缀和根)并将每个组件作为活动词汇表中的单独条目来增加。 因此,例如,可以在活动词汇表中使用单词(例如,“书籍,书籍,烹饪,烹饪,钩子,钩子,外观和外观”)的单词列表及其复数形式 书签,烹饪,钩子和外观“)和表示使单词复数的后缀的条目(例如,”+ s“,其中”+“之前的”+“表示”+ s“是后缀)。 对于大量单词列表,忽略与后缀相关联的条目,这种技术可能会大大减少用于表示单词列表所需的词汇表数量。