Voice over short message service
    21.
    发明授权
    Voice over short message service 有权
    短信服务语音

    公开(公告)号:US08081993B2

    公开(公告)日:2011-12-20

    申请号:US12146892

    申请日:2008-06-26

    Applicant: Daniel L. Roth

    Inventor: Daniel L. Roth

    Abstract: A method of operating a mobile communication device, the method involving: over a wireless messaging channel receiving a text message that contains a non-text representation of an utterance; extracting the non-text representation from the text message; synthesizing an audio representation of the spoken utterance from the non-text representation; and playing the synthesized audio representation through an audio output device on the mobile communication device.

    Abstract translation: 一种操作移动通信设备的方法,所述方法包括:通过接收包含话语的非文本表示的文本消息的无线消息收发信道; 从文本消息中提取非文本表示; 从所述非文本表示合成所述说出的话语的音频表示; 以及通过移动通信设备上的音频输出设备播放合成音频表示。

    EXTENDABLE VOICE COMMANDS
    22.
    发明申请
    EXTENDABLE VOICE COMMANDS 有权
    可扩展的声音命令

    公开(公告)号:US20110294476A1

    公开(公告)日:2011-12-01

    申请号:US13206008

    申请日:2011-08-09

    Abstract: A mobile device, such as a cellular telephone includes a voice interface that includes one part that may not be specific to a particular carrier, and a second part that provides an interface to services that are specific to a carrier or to service or information providers that are not necessarily available with all carriers. A voice command interface provides easy access to the carrier services. The set of carrier services is optionally extendible by the carrier.

    Abstract translation: 诸如蜂窝电话的移动设备包括语音接口,其包括可能不是特定于特定载波的一个部分,以及第二部分,其提供与运营商或服务或信息提供者特有的服务的接口, 不一定适用于所有运营商。 语音命令界面提供了对运营商服务的轻松访问。 运营商服务的集合可以由运营商可选地扩展。

    Combined speech recognition and text-to-speech generation
    23.
    发明授权
    Combined speech recognition and text-to-speech generation 有权
    组合语音识别和文本到语音生成

    公开(公告)号:US07577569B2

    公开(公告)日:2009-08-18

    申请号:US10949991

    申请日:2004-09-24

    CPC classification number: G10L13/08 G10L15/187 G10L15/19

    Abstract: Text-to-speech (TTS) generation is used in conjunction with large vocabulary speech recognition to say words selected by the speech recognition. The software for performing the large vocabulary speech recognition can share speech modeling data with the TTS software. TTS or recorded audio can be used to automatically say both recognized text and the names of recognized commands after their recognition. The TTS can automatically repeats text recognized by the speech recognition after each of a succession of end of utterance detections. A user can move a cursor back or forward in recognized text, and the TTS can speak one or more words at the cursor location after each such move. The speech recognition can be used to produces a choice list of possible recognition candidates and the TTS can be used to provide spoken output of one or more of the candidates on the choice list.

    Abstract translation: 文本到语音(TTS)生成与大词汇语音识别结合使用来说出由语音识别选择的单词。 用于执行大词汇语音识别的软件可以与TTS软件共享语音建模数据。 TTS或录制音频可以用于在识别后自动说出识别的文本和识别的命令的名称。 TTS可以在每次连续的话语检测结束后自动重复通过语音识别识别的文本。 用户可以在识别的文本中向后或向前移动光标,并且在每次这样的移动之后,TTS可以在光标位置说一个或多个单词。 语音识别可用于产生可能的识别候选者的选择列表,并且TTS可以用于在选择列表上提供一个或多个候选者的口语输出。

    Combined speech recognition and sound recording
    24.
    发明授权
    Combined speech recognition and sound recording 有权
    组合语音识别和录音

    公开(公告)号:US07505911B2

    公开(公告)日:2009-03-17

    申请号:US11005568

    申请日:2004-12-05

    CPC classification number: G10L15/22 G10L15/26 G10L2015/225

    Abstract: A handheld device with both large-vocabulary speech recognition and audio recoding allows users to switch between at least two of the following three modes: (1) recording audio without corresponding speech recognition; (2) recording with speech recognition; and (3) speech recognition without audio recording. A handheld device with both large-vocabulary speech recognition and audio recoding enables a user to select a portion of previously recorded sound and have speech recognition performed upon it. A system enables a user to search for a text label associated with portions of unrecognized recorded sound by uttering the label's words. A large-vocabulary system allows users to switch between playing back recorded audio and speech recognition with a single input, with successive audio playbacks automatically starting slightly before the end of prior playback. And a cell phone that allows both large-vocabulary speech recognition and audio recording and playback.

    Abstract translation: 具有大词汇语音识别和音频重新编码的手持设备允许用户在以下三种模式中的至少两种之间进行切换:(1)记录没有相应语音识别的音频; (2)用语音识别录音; 和(3)没有录音的语音识别。 具有大词汇语音识别和音频重新编码的手持设备使得用户能够选择先前记录的声音的一部分并且对其进行语音识别。 系统使用户能够通过发出标签的单词来搜索与未被识别的记录声音的部分相关联的文本标签。 大词汇系统允许用户使用单个输入在回放记录的音频和语音识别之间切换,连续的音频播放在先前播放结束之前自动开始。 和一个手机,允许大词汇语音识别和音频录音和播放。

    COLLECTION AND USE OF SIDE INFORMATION IN VOICE-MEDIATED MOBILE SEARCH
    25.
    发明申请
    COLLECTION AND USE OF SIDE INFORMATION IN VOICE-MEDIATED MOBILE SEARCH 审中-公开
    收集和使用语音媒体移动搜索中的信息

    公开(公告)号:US20080154870A1

    公开(公告)日:2008-06-26

    申请号:US11673995

    申请日:2007-02-12

    CPC classification number: H04M1/72522 G06F16/9535 G10L15/08

    Abstract: Methods and systems for providing voice-mediated search capability to a mobile communications device involve receiving a signal from the mobile device that includes a representation of a spoken search request from a user of the mobile device, using speech recognition software to convert the search request into a text search request, extracting side information contained implicitly within the received signal, using the extracted side information to assign the user to a category, sending the text search request and the user category to content providers, receiving from the content providers content that is responsive to the text search request and the user category, and sending to the mobile device search results that are based on content from content providers. The methods and systems further involve sending searches and user categories to advertising providers, and sending advertisements returned by the advertising providers to the mobile device along with the search results.

    Abstract translation: 用于向移动通信设备提供语音媒介搜索能力的方法和系统涉及从移动设备接收包括来自移动设备的用户的口语搜索请求的表示的信号,使用语音识别软件将搜索请求转换成 文本搜索请求,使用所提取的侧面信息将用户分配给类别,将内容提供商发送文本搜索请求和用户类别,从内容提供者接收响应的内容,提取包含在接收到的信号内的隐含的侧面信息 到文本搜索请求和用户类别,并且向移动设备发送基于来自内容提供商的内容的搜索结果。 方法和系统还包括向广告提供者发送搜索和用户类别,以及将广告提供者返回的广告与搜索结果一起发送到移动设备。

    VOICE SEARCH-ENABLED MOBILE DEVICE
    26.
    发明申请
    VOICE SEARCH-ENABLED MOBILE DEVICE 审中-公开
    语音搜索启用移动设备

    公开(公告)号:US20080153465A1

    公开(公告)日:2008-06-26

    申请号:US11673341

    申请日:2007-02-09

    Abstract: Methods and devices for providing a user of a mobile communications device with mobile voice-mediated search capability. The methods and devices involve receiving an utterance from a user of the mobile device, the utterance including a search request; using the speech recognition functionality to recognize that the utterance includes a search request; as a result of recognizing that the utterance includes a search request, establishing a wireless data connection to a remote server; sending a representation of the search request to the remote server over the wireless data connection; receiving search results that are responsive to the search request; and presenting the search results on the mobile device.

    Abstract translation: 用于向移动通信设备的用户提供具有移动语音媒介搜索能力的方法和设备。 所述方法和设备涉及从所述移动设备的用户接收话语,所述话语包括搜索请求; 使用所述语音识别功能来识别所述话语包括搜索请求; 作为识别出话语包括搜索请求的结果,建立到远程服务器的无线数据连接; 通过无线数据连接向远程服务器发送搜索请求的表示; 接收响应于该搜索请求的搜索结果; 并在移动设备上呈现搜索结果。

    Speech recognition using selectable recognition modes
    27.
    发明授权
    Speech recognition using selectable recognition modes 有权
    使用可选识别模式进行语音识别

    公开(公告)号:US07313526B2

    公开(公告)日:2007-12-25

    申请号:US10950092

    申请日:2004-09-24

    CPC classification number: G10L15/22 G10L15/19

    Abstract: The present invention relates to speech recognition using selectable recognition modes. This includes innovations such as: large vocabulary speech recognition programming that supplies recognized words to external program as they are recognized, and allows a user to select between large vocabulary recognition of an utterance with and without language context from the prior utterance independently of state of the external program; allowing a user to select between continuous and discrete speech recognition that use substantially the same vocabulary; allowing a user to select between continuous and discrete large-vocabulary speech recognition modes; allowing a user to select between at least two different alphabetic entry speech recognition modes; and allowing a user to select from among four or more of the following recognitions modes when creating text: a large-vocabulary mode, an alphabetic entry mode, a number entry mode, and a punctuation entry mode.

    Abstract translation: 本发明涉及使用可选择识别模式的语音识别。 这包括创新,例如:大量词汇语音识别程序,在识别出外部程序时,将识别的词提供给外部程序,并允许用户在与先前的语言无关的语言语境的大量词汇识别与非语言语境之间进行选择 外部程序; 允许用户在使用基本相同词汇的连续和离散语音识别之间进行选择; 允许用户在连续和离散的大词汇语音识别模式之间进行选择; 允许用户在至少两个不同的字母进入语音识别模式之间进行选择; 并且允许用户在创建文本时从四种或更多种以下识别模式中进行选择:大词汇模式,字母输入模式,数字输入模式和标点输入模式。

    Methods, systems, and programming for performing speech recognition
    28.
    发明授权
    Methods, systems, and programming for performing speech recognition 有权
    用于执行语音识别的方法,系统和编程

    公开(公告)号:US07225130B2

    公开(公告)日:2007-05-29

    申请号:US10227653

    申请日:2002-09-06

    CPC classification number: G10L15/19 G10L15/22

    Abstract: The present invention relates to: speech recognition using selectable recognition modes; using choice lists in large-vocabulary speech recognition; enabling users to select word transformations; speech recognition that automatically turns recognition off in one or more specified ways; phone key control of large-vocabulary speech recognition; speech recognition using phone key alphabetic filtering and spelling: speech recognition that enables a user to perform re-utterance recognition; the combination of speech recognition and text-to-speech (TTS) generation; the combination of speech recognition with handwriting and/or character recognition; and the combination of large-vocabulary speech recognition with audio recording and playback.

    Abstract translation: 本发明涉及:使用可选择识别模式的语音识别; 在大词汇语音识别中使用选择列表; 使用户能够选择字变换; 以一种或多种指定方式自动转移识别的语音识别; 电话键控大词汇语音识别; 使用手机密钥字母过滤和拼写的语音识别:语音识别,使得用户能够执行重新发音识别; 语音识别和文本到语音(TTS)生成的组合; 语音识别与手写和/或字符识别的组合; 以及大词汇语音识别与音频录制和播放的组合。

    Training speech recognition word models from word samples synthesized by Monte Carlo techniques
    29.
    发明授权
    Training speech recognition word models from word samples synthesized by Monte Carlo techniques 有权
    通过蒙特卡罗技术合成的单词样本训练语音识别词模型

    公开(公告)号:US07133827B1

    公开(公告)日:2006-11-07

    申请号:US10361154

    申请日:2003-02-06

    CPC classification number: G10L15/063

    Abstract: A new word model is trained from synthetic word samples derived by Monte Carlo techniques from one or more prior word models. The prior word model can be a phonetic word model and the new word model can be a non-phonetic, whole-word, word model. The prior word model can be trained from data that has undergone a first channel normalization and the synthesized word samples from which the new word model is trained can undergo a different channel normalization similar to that to be used in a given speech recognition context. The prior word model can have a first model structure and the new word model can have a second, different, model structure. These differences in model structure can include, for example, differences of model topology; differences of model complexity; and differences in the type of basis function used in a description of such probability distributions.

    Abstract translation: 从一个或多个先前的单词模型通过蒙特卡罗技术衍生的合成词样本训练新的单词模型。 先验词模型可以是语音模型,新词模型可以是非语音,全字,单词模型。 可以从已经经历第一信道规范化的数据训练现有单词模型,并且从其中训练新单词模型的合成单词样本可以经历与在给定语音识别上下文中使用相似的不同信道规范化。 先验词模型可以具有第一模型结构,并且新词模型可以具有第二,不同的模型结构。 模型结构的这些差异可以包括例如模型拓扑的差异; 模型复杂性差异; 以及在这种概率分布的描述中使用的基函数的类型的差异。

    Speech recognition using automatic recognition turn off
    30.
    发明授权
    Speech recognition using automatic recognition turn off 有权
    语音识别使用自动识别关闭

    公开(公告)号:US07716058B2

    公开(公告)日:2010-05-11

    申请号:US10949972

    申请日:2004-09-24

    CPC classification number: G10L15/22 G10L15/19

    Abstract: Large vocabulary speech recognition can automatically turn recognition off in one or more ways. A user command can turn on recognition that is automatically turned off after the next end of utterance. A plurality of buttons can each be associated with a different speech mode and the touch of a given button can turn on, and then automatically turn off, the given button's associated speech recognition mode. These selectable modes can include large vocabulary and alphabetic entry modes, or continuous and discrete modes. A first user input can start recognition that allows a sequence of vocabulary words to be recognized and a second user input can start recognition that turns off after one word has been recognized. A first user input can start recognition that allows a sequence of utterances to be recognized and a second user input can start recognition that allows only a single utterance to be recognized.

    Abstract translation: 大词汇语音识别可以以一种或多种方式自动转移识别。 用户命令可以打开在下一个结束语句后自动关闭的识别。 多个按钮可以各自与不同的语音模式相关联,并且给定按钮的触摸可以打开,然后自动关闭给定按钮的相关语音识别模式。 这些可选择的模式可以包括大词汇和字母输入模式,或连续和离散模式。 第一用户输入可以开始识别,其允许识别词汇序列的序列,并且第二用户输入可以开始识别,一个字被识别之后关闭。 第一用户输入可以开始识别,其允许识别一系列话语,并且第二用户输入可以开始仅允许单个话语被识别的识别。

Patent Agency Ranking