Utterance processing for network-based speech recognition utilizing a client-side cache
    1.
    发明授权
    Utterance processing for network-based speech recognition utilizing a client-side cache 有权
    使用客户端缓存的基于网络的语音识别的语音处理

    公开(公告)号:US08224644B2

    公开(公告)日:2012-07-17

    申请号:US12337810

    申请日:2008-12-18

    IPC分类号: G10L15/00

    摘要: Embodiments are provided for utilizing a client-side cache for utterance processing to facilitate network based speech recognition. An utterance comprising a query is received in a client computing device. The query is sent from the client to a network server for results processing. The utterance is processed to determine a speech profile. A cache lookup is performed based on the speech profile to determine whether results data for the query is stored in the cache. If the results data is stored in the cache, then a query is sent to cancel the results processing on the network server and the cached results data is displayed on the client computing device.

    摘要翻译: 提供实施例用于利用客户端缓存进行话语处理以促进基于网络的语音识别。 在客户端计算设备中接收到包含查询的话语。 查询从客户端发送到网络服务器进行结果处理。 处理话语以确定语音简档。 基于语音简档执行高速缓存查找,以确定查询的结果数据是否存储在高速缓存中。 如果结果数据存储在缓存中,则发送查询以取消网络服务器上的结果处理,并且缓存的结果数据显示在客户端计算设备上。

    Utterance Processing For Network-Based Speech Recognition Utilizing A Client-Side Cache
    2.
    发明申请
    Utterance Processing For Network-Based Speech Recognition Utilizing A Client-Side Cache 有权
    用于基于网络的语音识别利用客户端缓存的方法处理

    公开(公告)号:US20100161328A1

    公开(公告)日:2010-06-24

    申请号:US12337810

    申请日:2008-12-18

    IPC分类号: G10L15/00

    摘要: Embodiments are provided for utilizing a client-side cache for utterance processing to facilitate network based speech recognition. An utterance comprising a query is received in a client computing device. The query is sent from the client to a network server for results processing. The utterance is processed to determine a speech profile. A cache lookup is performed based on the speech profile to determine whether results data for the query is stored in the cache. If the results data is stored in the cache, then a query is sent to cancel the results processing on the network server and the cached results data is displayed on the client computing device.

    摘要翻译: 提供实施例用于利用客户端缓存进行话语处理以促进基于网络的语音识别。 在客户端计算设备中接收到包含查询的话语。 查询从客户端发送到网络服务器进行结果处理。 处理话语以确定语音简档。 基于语音简档执行高速缓存查找,以确定查询的结果数据是否存储在高速缓存中。 如果结果数据存储在缓存中,则发送查询以取消网络服务器上的结果处理,并且缓存的结果数据显示在客户端计算设备上。

    Combination and federation of local and remote speech recognition
    3.
    发明授权
    Combination and federation of local and remote speech recognition 有权
    本地和远程语音识别的组合和联合

    公开(公告)号:US08892439B2

    公开(公告)日:2014-11-18

    申请号:US12503191

    申请日:2009-07-15

    IPC分类号: G10L15/18 G10L15/30

    CPC分类号: G10L15/30 G10L2015/221

    摘要: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.

    摘要翻译: 描述在本地设备处提供自动语音识别的技术。 装置可以包括用于接收指示任务的音频数据的音频输入。 该装置还可以包括接收音频数据的局部识别器组件,以便在接收音频数据的同时将音频数据传送到远程识别器,并从音频数据识别语音。 该装置还可以包括联合组件,用于从本地识别器和/或远程识别器接收一个或多个识别结果,并联合多个识别结果以产生最可能的结果。 该装置还可以包括用于执行由最可能的结果指示的任务的应用。 描述和要求保护其他实施例。

    Speech Recognition Disambiguation on Mobile Devices
    5.
    发明申请
    Speech Recognition Disambiguation on Mobile Devices 有权
    移动设备语音识别消歧

    公开(公告)号:US20090234647A1

    公开(公告)日:2009-09-17

    申请号:US12049243

    申请日:2008-03-14

    IPC分类号: G10L15/26

    CPC分类号: G10L15/30

    摘要: A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.

    摘要翻译: 一种方法,程序存储设备和移动设备提供语音消歧。 用于语音识别处理的音频由移动设备发送。 接收到表示与发送音频相匹配的候补的结果。 替代物显示在消歧对话屏幕中,用于对替代物进行更正。 使用消歧对话框屏幕对替代品进行更正,直到显示正确的结果。 选择正确的结果。 与所选择的正确结果相关联的内容与接收到表示被识别为匹配所发送的音频的替代的结果并行地接收。

    Centralized method and system for clarifying voice commands
    7.
    发明授权
    Centralized method and system for clarifying voice commands 有权
    用于澄清语音命令的集中方法和系统

    公开(公告)号:US08942985B2

    公开(公告)日:2015-01-27

    申请号:US10990345

    申请日:2004-11-16

    IPC分类号: G10L21/00 G10L25/00 G06F3/16

    摘要: A method and system for facilitating centralized interaction with a user includes providing a recognized voice command to a plurality of application modules. A plurality of interpretations of the voice command are generated by at least one of the plurality of application modules. A centralized interface module visually renders the plurality of interpretations of the voice command on a centralized display. An indication of selection of an interpretation is received from the user.

    摘要翻译: 用于促进与用户的集中交互的方法和系统包括向多个应用模块提供识别的语音命令。 语音命令的多个解释由多个应用模块中的至少一个生成。 集中式界面模块在视觉上呈现了集中显示上的语音命令的多种解释。 从用户接收到对解释的选择的指示。

    Recognizing multiple semantic items from single utterance
    8.
    发明授权
    Recognizing multiple semantic items from single utterance 有权
    从单一语音识别多个语义项

    公开(公告)号:US08725492B2

    公开(公告)日:2014-05-13

    申请号:US12042460

    申请日:2008-03-05

    IPC分类号: G06F17/28

    CPC分类号: G10L15/1815

    摘要: Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.

    摘要翻译: 通过使用已经识别的语义项提供的约束重复地识别相同的话语,从单个话语中提取语义上不同的项目。 用于部分识别的话语的选择或校正的用户反馈可以以分层,多模式或单步的方式使用。 识别的准确性得到保留,而较少结构化和更自然的单个话语识别形式被允许使用。

    COMBINATION AND FEDERATION OF LOCAL AND REMOTE SPEECH RECOGNITION
    9.
    发明申请
    COMBINATION AND FEDERATION OF LOCAL AND REMOTE SPEECH RECOGNITION 有权
    当地和远程语音识别的组合与联合

    公开(公告)号:US20110015928A1

    公开(公告)日:2011-01-20

    申请号:US12503191

    申请日:2009-07-15

    IPC分类号: G10L15/18 G10L15/00

    CPC分类号: G10L15/30 G10L2015/221

    摘要: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.

    摘要翻译: 描述在本地设备处提供自动语音识别的技术。 装置可以包括用于接收指示任务的音频数据的音频输入。 该装置还可以包括接收音频数据的局部识别器组件,以便在接收音频数据的同时将音频数据传送到远程识别器,并从音频数据识别语音。 该装置还可以包括联合组件,用于从本地识别器和/或远程识别器接收一个或多个识别结果,并联合多个识别结果以产生最可能的结果。 该装置还可以包括用于执行由最可能的结果指示的任务的应用。 描述和要求保护其他实施例。