SELECTIVE ENABLEMENT OF SPEECH RECOGNITION GRAMMARS
    1.
    发明申请
    SELECTIVE ENABLEMENT OF SPEECH RECOGNITION GRAMMARS 有权
    选择性认知语音识别格式

    公开(公告)号:US20080189111A1

    公开(公告)日:2008-08-07

    申请号:US12042968

    申请日:2008-03-05

    IPC分类号: G10L15/18

    CPC分类号: G10L15/30 G10L15/19

    摘要: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

    摘要翻译: 一种用于在网络连接的客户端设备中处理语音的方法可以包括:选择在网络连接的客户端设备中的语音识别系统中使用的语音语法; 表征所选语音语法; 并且基于表征,确定是否在网络连接的客户端设备中本地处理语音语法,或者在网络中的语音服务器中进行远程处理。 在本发明的一个方面,选择步骤可以包括建立与语音服务器的通信会话; 并且通过所建立的通信会话向语音服务器询问语音语法。 此外,选择步骤还可以包括在语音识别系统中登记语音语法。

    Selective enablement of speech recognition grammars
    2.
    发明授权
    Selective enablement of speech recognition grammars 有权
    语音识别语法的选择性启用

    公开(公告)号:US07610204B2

    公开(公告)日:2009-10-27

    申请号:US12042968

    申请日:2008-03-05

    IPC分类号: G10L11/00

    CPC分类号: G10L15/30 G10L15/19

    摘要: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

    摘要翻译: 一种用于在网络连接的客户端设备中处理语音的方法可以包括:选择在网络连接的客户端设备中的语音识别系统中使用的语音语法; 表征所选语音语法; 并且基于表征,确定是否在网络连接的客户端设备中本地处理语音语法,或者在网络中的语音服务器中进行远程处理。 在本发明的一个方面,选择步骤可以包括建立与语音服务器的通信会话; 并且通过所建立的通信会话向语音服务器询问语音语法。 此外,选择步骤还可以包括在语音识别系统中登记语音语法。

    Overriding default speech processing behavior using a default focus receiver
    3.
    发明申请
    Overriding default speech processing behavior using a default focus receiver 有权
    使用默认焦点接收器覆盖默认语音处理行为

    公开(公告)号:US20070038462A1

    公开(公告)日:2007-02-15

    申请号:US11201003

    申请日:2005-08-10

    IPC分类号: G10L21/00

    CPC分类号: G10L15/28

    摘要: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.

    摘要翻译: 用于在语音处理系统中实现语音焦点的方法可以包括建立默认焦点接收机作为第一实体的步骤,以基于语音焦点来共享具有共享语音资源的多个应用的​​语音处理系统的语音焦点。 可以检测到事件发生。 默认语音接收器的事件处理程序可以预先定义事件发生的行为,并且在事件发生的语音处理系统内可以实现默认系统行为。 在事件发生期间未分配语音焦点时,可以使用默认系统行为。 响应于事件发生,可以根据事件处理程序的机器可读指令执行至少一个编程动作。 默认系统行为不能响应于事件发生而实现。

    SELECTIVE ENABLEMENT OF SPEECH RECOGNITION GRAMMARS
    4.
    发明申请
    SELECTIVE ENABLEMENT OF SPEECH RECOGNITION GRAMMARS 有权
    选择性认知语音识别格式

    公开(公告)号:US20100049521A1

    公开(公告)日:2010-02-25

    申请号:US12605704

    申请日:2009-10-26

    IPC分类号: G10L15/18

    CPC分类号: G10L15/30 G10L15/19

    摘要: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

    摘要翻译: 一种用于在网络连接的客户端设备中处理语音的方法可以包括:选择在网络连接的客户端设备中的语音识别系统中使用的语音语法; 表征所选语音语法; 并且基于表征,确定是否在网络连接的客户端设备中本地处理语音语法,或者在网络中的语音服务器中进行远程处理。 在本发明的一个方面,选择步骤可以包括建立与语音服务器的通信会话; 并且通过所建立的通信会话向语音服务器询问语音语法。 此外,选择步骤还可以包括在语音识别系统中登记语音语法。

    Supporting multiple speech enabled user interface consoles within a motor vehicle
    5.
    发明申请
    Supporting multiple speech enabled user interface consoles within a motor vehicle 有权
    支持机动车辆中多语音使能的用户界面控制台

    公开(公告)号:US20070038461A1

    公开(公告)日:2007-02-15

    申请号:US11200811

    申请日:2005-08-10

    IPC分类号: G10L21/00

    CPC分类号: G10L15/30

    摘要: An in-vehicle system that shares speech processing resources among multiple applications located within a vehicle. The system can include one or more software applications, each associated with different functionally independent in-vehicle consoles. Each application can have a console specific user interface. The system can also include a single in-vehicle speech processing system implemented separately from the in-vehicle consoles. The speech processing system can execute speech processing tasks responsive to requests received from the applications. That is, the in-vehicle speech processing system can provide speech processing capabilities for the applications. The provided speech processing capabilities can include text-to-speech capabilities and speech recognition capabilities.

    摘要翻译: 一种在位于车辆内的多个应用中共享语音处理资源的车载系统。 该系统可以包括一个或多个软件应用程序,每个软件应用程序与不同的功能上独立的车载控制台相关联。 每个应用程序都可以有一个控制台专用的用户界面。 该系统还可以包括与车载控制台分开实现的单个车载语音处理系统。 语音处理系统可以响应于从应用接收到的请求来执行语音处理任务。 也就是说,车载语音处理系统可以为应用提供语音处理能力。 所提供的语音处理能力可以包括文本到语音能力和语音识别能力。

    Method and system for improved speech recognition by degrading utterance pronunciations
    6.
    发明申请
    Method and system for improved speech recognition by degrading utterance pronunciations 有权
    通过降低语音发音来改善语音识别的方法和系统

    公开(公告)号:US20070038454A1

    公开(公告)日:2007-02-15

    申请号:US11200810

    申请日:2005-08-10

    IPC分类号: G10L13/08

    CPC分类号: G10L15/063

    摘要: A speech recognition system (10) or method (20) can include a speech input device and a processor (14) coupled to the speech input. The processor can be programmed to identify (22) a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade (24) a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare (28) the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce (26) confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.

    摘要翻译: 语音识别系统(10)或方法(20)可以包括语音输入设备和耦合到语音输入的处理器(14)。 处理器可以被编程为识别(22)多个单词,其是每个对包括目标单词和替代单词的可混淆词组的成员。 处理器可以降低(24)取代字的发音,以提供取代词的更差的发音。 处理器可以进一步将目标词的发音与较差的发音(28)进行比较(28)到替换的单词。 处理器可被进一步编程以减少(26)取代词与语音识别引擎的识别语法中的其他单词之间的混淆,并且还可以缩小识别取代词的范围。

    Connecting and optimizing audio input devices
    7.
    发明授权
    Connecting and optimizing audio input devices 有权
    连接和优化音频输入设备

    公开(公告)号:US06492999B1

    公开(公告)日:2002-12-10

    申请号:US09257671

    申请日:1999-02-25

    IPC分类号: G09G500

    CPC分类号: G06F3/16 H04R5/00

    摘要: A method for connecting and optimizing audio input devices, comprises the steps of: determining an audio input type; generating a first GUI display screen for prompting and enabling user selection of an audio input device; generating a second GUI display screen for prompting and enabling user connection of the audio input device; testing the connected audio input device; configuring audio settings of the connected audio input device; and, storing for later retrieval an association of the connected audio input device and the configured audio settings. The audio settings are configured and the association is stored only if the testing step is successful. The second GUI display screen can include a device specific image and device specific instructions.

    摘要翻译: 一种用于连接和优化音频输入设备的方法,包括以下步骤:确定音频输入类型; 生成第一GUI显示屏幕,用于提示和启用用户对音频输入设备的选择; 生成第二GUI显示屏幕,用于提示和使得用户连接音频输入设备; 测试连接的音频输入设备; 配置连接的音频输入设备的音频设置; 并且存储用于稍后检索所连接的音频输入设备和配置的音频设置的关联。 配置音频设置,只有测试步骤成功,才能存储关联。 第二GUI显示屏幕可以包括设备特定图像和设备特定指令。

    Maintaining input device identity
    9.
    发明授权
    Maintaining input device identity 有权
    维护输入设备标识

    公开(公告)号:US06275805B1

    公开(公告)日:2001-08-14

    申请号:US09257673

    申请日:1999-02-25

    IPC分类号: G10L1100

    CPC分类号: G10L15/07 G10L15/26

    摘要: A method for maintaining input device identity in a speech application, comprising the steps of: storing a plurality of enrollments, each of the enrollments representing a speech file of training data associated with at least one of a specific audio input device and a specific audio environment for a specific user; generating a graphical user interface (GUI) display screen for prompting and enabling user selection of at least one of an audio input device and an audio environment; and, retrieving one of the enrollments responsive to the user selection, for use in a dictation or transcription session.

    摘要翻译: 一种用于在语音应用中维持输入设备身份的方法,包括以下步骤:存储多个注册,每个登记表示与特定音频输入设备和特定音频环境中的至少一个相关联的训练数据的语音文件 对于特定用户; 生成图形用户界面(GUI)显示屏幕,用于提示和启用用户选择音频输入设备和音频环境中的至少一个; 以及响应于用户选择检索其中一个注册,以用于听写或录音会话。