Overriding default speech processing behavior using a default focus receiver
    2.
    发明授权
    Overriding default speech processing behavior using a default focus receiver 有权
    使用默认焦点接收器覆盖默认语音处理行为

    公开(公告)号:US07848928B2

    公开(公告)日:2010-12-07

    申请号:US11201003

    申请日:2005-08-10

    IPC分类号: G10L20/00

    CPC分类号: G10L15/28

    摘要: A method for implementing speech focus in a speech processing system can include the step of establishing a default focus receiver as a first entity to request speech focus of a speech processing system having multiple applications that share speech resources based upon speech focus. An event occurrence can be detected. An event handler of the default speech receiver can previously define behavior for the event occurrence and where default system behavior can be implemented within the speech processing system for the event occurrence. The default system behavior can be utilized when speech focus is not assigned during the event occurrence. Responsive to the event occurrence, at least one programmatic action can be performed in accordance with machine readable instructions of the event handler. The default system behavior is not implemented responsive to the event occurrence.

    摘要翻译: 用于在语音处理系统中实现语音焦点的方法可以包括建立默认焦点接收机作为第一实体的步骤,以基于语音焦点来共享具有共享语音资源的多个应用的​​语音处理系统的语音焦点。 可以检测到事件发生。 默认语音接收器的事件处理程序可以预先定义事件发生的行为,并且在事件发生的语音处理系统内可以实现默认系统行为。 在事件发生期间未分配语音焦点时,可以使用默认系统行为。 响应于事件发生,可以根据事件处理程序的机器可读指令执行至少一个编程动作。 默认系统行为不能响应于事件发生而实现。

    Method of managing a speech cache
    3.
    发明授权
    Method of managing a speech cache 有权
    管理语音缓存的方法

    公开(公告)号:US06741963B1

    公开(公告)日:2004-05-25

    申请号:US09598603

    申请日:2000-06-21

    IPC分类号: G10L1500

    CPC分类号: G10L15/285 G10L15/22

    摘要: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data. Similarly, the exit criteria can include least frequently used speech data associated with each entry in the speech cache, least recently used speech data associated with each entry in the speech cache and least important speed data associated with each entry in the speech cache.

    摘要翻译: 一种用于改善计算机语音系统中的语音数据的回忆的方法和系统可以包括多个语音高速缓存管理步骤,包括提供语音高速缓存,接收语音系统输入和识别接收到的语音系统输入中的语音事件,语音 事件包括语音数据。 随后,语音数据可以与预定的语音高速缓存入口标准进行比较; 以及如果所述语音数据满足所述预定条目标准之一,则至少一个条目可以被添加到所述语音高速缓存,所述至少一个条目对应于所述语音数据。 另外,语音数据可以与预定语音高速缓存退出标准进行比较; 并且如果语音数据满足预定的退出准则之一,则可以从语音高速缓存中清除至少一个条目,该对应于该语音数据的至少一个条目。 入门标准可以包括经常使用的语音数据,最近使用的语音数据和重要的语音数据。 类似地,退出标准可以包括与语音高速缓存中的每个条目相关联的最少频繁使用的语音数据,与语音高速缓存中的每个条目相关联的最近最少使用的语音数据以及与语音高速缓存中的每个条目相关联的最小重要速度数据。

    Method and system for improved speech recognition by degrading utterance pronunciations
    4.
    发明授权
    Method and system for improved speech recognition by degrading utterance pronunciations 有权
    通过降低语音发音来改善语音识别的方法和系统

    公开(公告)号:US07983914B2

    公开(公告)日:2011-07-19

    申请号:US11200810

    申请日:2005-08-10

    IPC分类号: G10L15/04

    CPC分类号: G10L15/063

    摘要: A speech recognition system or method can include a speech input device and a processor coupled to the speech input device. The processor can be programmed to identify a plurality of words that are members of confusable pairs of words where each pair includes a target word and a substituted word. The processor can degrade a pronunciation of the substituted word to provide a worse pronunciation of the substituted word. The processor can further compare the pronunciation of the target word with the worse pronunciation to the substituted word. The processor can be further programmed to reduce confusion between the substituted word and other words in a recognition grammar of the speech recognition engine and can also narrow the scope within which the substituted word is recognized.

    摘要翻译: 语音识别系统或方法可以包括语音输入设备和耦合到语音输入设备的处理器。 处理器可以被编程为识别作为可混淆对的成对的多个单词,其中每对单词包括目标单词和替换单词。 处理器可以降低取代词的发音,以提供取代词的更差的发音。 处理器可以进一步将目标词的发音与替代词的较差发音进行比较。 该处理器可被进一步编程以减少语音识别引擎的识别语法中的替代单词和其他单词之间的混淆,并且还可以缩小识别替代单词的范围。

    Selective enablement of speech recognition grammars

    公开(公告)号:US09196252B2

    公开(公告)日:2015-11-24

    申请号:US12605704

    申请日:2009-10-26

    IPC分类号: G10L15/30 G10L15/19

    CPC分类号: G10L15/30 G10L15/19

    摘要: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. In one aspect of the invention, the selecting step can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Additionally, the selecting step can further include registering the speech grammar in the speech recognition system.

    Selective enablement of speech recognition grammars
    6.
    发明授权
    Selective enablement of speech recognition grammars 有权
    语音识别语法的选择性启用

    公开(公告)号:US07366673B2

    公开(公告)日:2008-04-29

    申请号:US09882472

    申请日:2001-06-15

    IPC分类号: G10L11/00

    CPC分类号: G10L15/30 G10L15/19

    摘要: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. Selecting can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Selecting can further include registering the speech grammar in the recognition system.

    摘要翻译: 一种用于在网络连接的客户端设备中处理语音的方法可以包括:选择在网络连接的客户端设备中的语音识别系统中使用的语音语法; 表征所选语音语法; 并且基于表征,确定是否在网络连接的客户端设备中本地处理语音语法,或者在网络中的语音服务器中进行远程处理。 选择可以包括建立与语音服务器的通信会话; 并且通过所建立的通信会话向语音服务器询问语音语法。 选择还可以包括在识别系统中注册语音语法。

    Audio device characterization for accurate predictable volume control

    公开(公告)号:US06999591B2

    公开(公告)日:2006-02-14

    申请号:US09794784

    申请日:2001-02-27

    IPC分类号: H04R29/00

    CPC分类号: H03G3/3089

    摘要: An automatic gain control method in accordance with the inventive arrangements can include the following steps. Initially, an audio signal can be provided to an audio device which has a range of permissible signal level settings and a signal level controller for establishing a particular signal level setting. In addition, an actual signal level can be measured for the audio signal at an established signal level setting. The measured actual signal level further can be stored in a volume map along with the corresponding established signal level setting. Following the storage of the measured actual signal level in the volume map, a different signal level setting can be established using the signal level controller. Subsequently, the actual signal level can be re-measured and the re-measured actual signal level and corresponding established different signal level setting can be stored in the volume map. Finally, the volume map can be used during an audio processing session to determine a signal level setting for the audio device, wherein the signal level setting corresponds to a desired actual audio signal level. In one aspect of the present invention, the method can also include detecting a hysteresis condition in the volume map.

    Dynamically adjusting speech grammar weights based on usage
    8.
    发明授权
    Dynamically adjusting speech grammar weights based on usage 有权
    根据使用情况动态调整语音语法权重

    公开(公告)号:US08131548B2

    公开(公告)日:2012-03-06

    申请号:US11369092

    申请日:2006-03-06

    IPC分类号: G10L15/18 G10L15/00 G10L11/00

    摘要: A speech processing method can automatically and dynamically adjust speech grammar weights at runtime based upon usage data. Each of the speech grammar weights can be associated with an available speech command contained within a speech grammar to which the speech grammar weights apply. The usage data can indicate a relative frequency with which each of the available speech commands is utilized.

    摘要翻译: 语音处理方法可以根据使用数据在运行时自动动态调整语音语法权重。 每个语音语法权重可以与语音语法权重所适用的语音语法中包含的可用语音命令相关联。 使用数据可以指示利用每个可用语音命令的相对频率。

    Multi-action voice macro method
    9.
    发明授权
    Multi-action voice macro method 失效
    多动作语音宏方法

    公开(公告)号:US5873064A

    公开(公告)日:1999-02-16

    申请号:US746426

    申请日:1996-11-08

    CPC分类号: G06F9/45512 G06F3/16

    摘要: Method for implementing a multi-action voice macro (140) for a voice recognition navigator program (102) on a computer system. The method involves analyzing a target application program (22) to determine a plurality of target application states (24). Each of the target application states (24) is comprised of a plurality of window objects. The target application states are arranged in the form of one or more sub-context trees, with each of the sub-context trees comprised of a plurality of sub-context objects (50, 52, 54, 56, 58, 60, 62, 64, 66, 68). A set of user inputs is determined to which each of the window objects will be responsive. Each user input is assigned a corresponding voice macro (140) which simulates the user inputs in response to a spoken utterance. The voice macro (140) includes a link field (148), which identifies at least one linked macro to be executed by the navigator program (102) when a specific vocabulary phrase for the voice macro (140) is spoken by a user.

    摘要翻译: 一种用于在计算机系统上实现语音识别导航程序(102)的多动作语音宏(140)的方法。 该方法包括分析目标应用程序(22)以确定多个目标应用状态(24)。 目标应用状态(24)中的每一个由多个窗口对象组成。 目标应用状态以一个或多个子上下文树的形式排列,其中每个子上下文树由多个子上下文对象组成(50,52,54,56,58,60,62,64,64,64,64,64,64,64,64,62,64,64,64,64,62,64,64,64,64,64,64,64,64,64,62 64,66,68)。 确定一组用户输入,每个窗口对象将响应于这些用户输入。 为每个用户输入分配相应的语音宏(140),其响应于口语发音模拟用户输入。 语音宏(140)包括链接字段(148),当由用户说出语音宏(140)的特定词汇短语时,识别由导航程序(102)执行的至少一个链接的宏。

    Internal window object tree method for representing graphical user
interface applications for speech navigation
    10.
    发明授权
    Internal window object tree method for representing graphical user interface applications for speech navigation 失效
    用于表示用于语音导航的图形用户界面应用的内部窗口对象树方法

    公开(公告)号:US5864819A

    公开(公告)日:1999-01-26

    申请号:US745282

    申请日:1996-11-08

    CPC分类号: G06F3/16 G06F8/38

    摘要: Method for representing a target software application program to a voice recognition navigator program on a computer system. The method requires analyzing an application program to determine a plurality of application states. Each of the application states is defined as a set of window objects within the application for performing a specific user task. According to the invention, each of the application states is preferably represented by a sub-context tree, comprised of a plurality of sub-context objects. The tree allows the navigator to associate decoded spoken commands to specific window objects.

    摘要翻译: 用于将目标软件应用程序表示到计算机系统上的语音识别导航程序的方法。 该方法需要分析应用程序以确定多个应用状态。 每个应用程序状态被定义为用于执行特定用户任务的应用程序中的一组窗口对象。 根据本发明,每个应用状态优选地由包括多个子上下文对象的子上下文树表示。 树允许导航器将解码的口头命令与特定的窗口对象相关联。