-
公开(公告)号:US07577569B2
公开(公告)日:2009-08-18
申请号:US10949991
申请日:2004-09-24
IPC分类号: G10L13/08
CPC分类号: G10L13/08 , G10L15/187 , G10L15/19
摘要: Text-to-speech (TTS) generation is used in conjunction with large vocabulary speech recognition to say words selected by the speech recognition. The software for performing the large vocabulary speech recognition can share speech modeling data with the TTS software. TTS or recorded audio can be used to automatically say both recognized text and the names of recognized commands after their recognition. The TTS can automatically repeats text recognized by the speech recognition after each of a succession of end of utterance detections. A user can move a cursor back or forward in recognized text, and the TTS can speak one or more words at the cursor location after each such move. The speech recognition can be used to produces a choice list of possible recognition candidates and the TTS can be used to provide spoken output of one or more of the candidates on the choice list.
摘要翻译: 文本到语音(TTS)生成与大词汇语音识别结合使用来说出由语音识别选择的单词。 用于执行大词汇语音识别的软件可以与TTS软件共享语音建模数据。 TTS或录制音频可以用于在识别后自动说出识别的文本和识别的命令的名称。 TTS可以在每次连续的话语检测结束后自动重复通过语音识别识别的文本。 用户可以在识别的文本中向后或向前移动光标,并且在每次这样的移动之后,TTS可以在光标位置说一个或多个单词。 语音识别可用于产生可能的识别候选者的选择列表,并且TTS可以用于在选择列表上提供一个或多个候选者的口语输出。
-
公开(公告)号:US07313526B2
公开(公告)日:2007-12-25
申请号:US10950092
申请日:2004-09-24
摘要: The present invention relates to speech recognition using selectable recognition modes. This includes innovations such as: large vocabulary speech recognition programming that supplies recognized words to external program as they are recognized, and allows a user to select between large vocabulary recognition of an utterance with and without language context from the prior utterance independently of state of the external program; allowing a user to select between continuous and discrete speech recognition that use substantially the same vocabulary; allowing a user to select between continuous and discrete large-vocabulary speech recognition modes; allowing a user to select between at least two different alphabetic entry speech recognition modes; and allowing a user to select from among four or more of the following recognitions modes when creating text: a large-vocabulary mode, an alphabetic entry mode, a number entry mode, and a punctuation entry mode.
摘要翻译: 本发明涉及使用可选择识别模式的语音识别。 这包括创新,例如:大量词汇语音识别程序,在识别出外部程序时,将识别的词提供给外部程序,并允许用户在与先前的语言无关的语言语境的大量词汇识别与非语言语境之间进行选择 外部程序; 允许用户在使用基本相同词汇的连续和离散语音识别之间进行选择; 允许用户在连续和离散的大词汇语音识别模式之间进行选择; 允许用户在至少两个不同的字母进入语音识别模式之间进行选择; 并且允许用户在创建文本时从四种或更多种以下识别模式中进行选择:大词汇模式,字母输入模式,数字输入模式和标点输入模式。
-
3.
公开(公告)号:US07225130B2
公开(公告)日:2007-05-29
申请号:US10227653
申请日:2002-09-06
摘要: The present invention relates to: speech recognition using selectable recognition modes; using choice lists in large-vocabulary speech recognition; enabling users to select word transformations; speech recognition that automatically turns recognition off in one or more specified ways; phone key control of large-vocabulary speech recognition; speech recognition using phone key alphabetic filtering and spelling: speech recognition that enables a user to perform re-utterance recognition; the combination of speech recognition and text-to-speech (TTS) generation; the combination of speech recognition with handwriting and/or character recognition; and the combination of large-vocabulary speech recognition with audio recording and playback.
摘要翻译: 本发明涉及:使用可选择识别模式的语音识别; 在大词汇语音识别中使用选择列表; 使用户能够选择字变换; 以一种或多种指定方式自动转移识别的语音识别; 电话键控大词汇语音识别; 使用手机密钥字母过滤和拼写的语音识别:语音识别,使得用户能够执行重新发音识别; 语音识别和文本到语音(TTS)生成的组合; 语音识别与手写和/或字符识别的组合; 以及大词汇语音识别与音频录制和播放的组合。
-
-