In-band signaling in interactive communications
    1.
    发明授权
    In-band signaling in interactive communications 有权
    交互式通信中的带内信令

    公开(公告)号:US08532269B2

    公开(公告)日:2013-09-10

    申请号:US12354799

    申请日:2009-01-16

    IPC分类号: H04M1/64 G10L11/00

    摘要: Architecture that employs a combination of in-band signaling (e.g., DTMF) with speech recognition to deliver usability improvements. The in-band signaling allows the user to indicate to the system when a barge-in operation is occurring and/or when to start listening to subsequent speech input and optionally, when to stop listening for further speech input. The in-band signaling can be utilized during a telephone call and using wireline and wireless telephones. Moreover, the architecture can be incorporated at the platform level requiring little, if any, application changes to support the new mode of operation.

    摘要翻译: 采用带内信令(例如,DTMF)与语音识别的组合来提供可用性改进的架构。 带内信令允许用户在发生插入操作和/或什么时候开始收听随后的语音输入时向系统指示,并且可选地,何时停止收听进一步的语音输入。 可以在电话呼叫期间使用带内信令,并使用有线和无线电话。 此外,架构可以并入平台级别,需要很少(如果有的话)应用程序更改以支持新的操作模式。

    IN-BAND SIGNALING IN INTERACTIVE COMMUNICATIONS
    2.
    发明申请
    IN-BAND SIGNALING IN INTERACTIVE COMMUNICATIONS 有权
    交互式通信中的带内信号

    公开(公告)号:US20100183126A1

    公开(公告)日:2010-07-22

    申请号:US12354799

    申请日:2009-01-16

    IPC分类号: H04M1/64 G10L21/02

    摘要: Architecture that employs a combination of in-band signaling (e.g., DTMF) with speech recognition to deliver usability improvements. The in-band signaling allows the user to indicate to the system when a barge-in operation is occurring and/or when to start listening to subsequent speech input and optionally, when to stop listening for further speech input. The in-band signaling can be utilized during a telephone call and using wireline and wireless telephones. Moreover, the architecture can be incorporated at the platform level requiring little, if any, application changes to support the new mode of operation.

    摘要翻译: 采用带内信令(例如,DTMF)与语音识别的组合来提供可用性改进的架构。 带内信令允许用户在发生插入操作和/或什么时候开始收听随后的语音输入时向系统指示,并且可选地,何时停止收听进一步的语音输入。 可以在电话呼叫期间使用带内信令,并使用有线和无线电话。 此外,架构可以并入平台级别,需要很少(如果有的话)应用程序更改以支持新的操作模式。

    Speech Recognition Disambiguation on Mobile Devices
    3.
    发明申请
    Speech Recognition Disambiguation on Mobile Devices 有权
    移动设备语音识别消歧

    公开(公告)号:US20090234647A1

    公开(公告)日:2009-09-17

    申请号:US12049243

    申请日:2008-03-14

    IPC分类号: G10L15/26

    CPC分类号: G10L15/30

    摘要: A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.

    摘要翻译: 一种方法,程序存储设备和移动设备提供语音消歧。 用于语音识别处理的音频由移动设备发送。 接收到表示与发送音频相匹配的候补的结果。 替代物显示在消歧对话屏幕中,用于对替代物进行更正。 使用消歧对话框屏幕对替代品进行更正,直到显示正确的结果。 选择正确的结果。 与所选择的正确结果相关联的内容与接收到表示被识别为匹配所发送的音频的替代的结果并行地接收。

    Speech recognition disambiguation on mobile devices
    4.
    发明授权
    Speech recognition disambiguation on mobile devices 有权
    移动设备上的语音识别消歧

    公开(公告)号:US08224656B2

    公开(公告)日:2012-07-17

    申请号:US12049243

    申请日:2008-03-14

    IPC分类号: G10L21/06

    CPC分类号: G10L15/30

    摘要: A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.

    摘要翻译: 一种方法,程序存储设备和移动设备提供语音消歧。 用于语音识别处理的音频由移动设备发送。 接收到表示与发送音频相匹配的候补的结果。 替代物显示在消歧对话屏幕中,用于对替代物进行更正。 使用消歧对话框屏幕对替代品进行更正,直到显示正确的结果。 选择正确的结果。 与所选择的正确结果相关联的内容与接收到表示被识别为匹配所发送的音频的替代的结果并行地接收。

    Combination and federation of local and remote speech recognition
    5.
    发明授权
    Combination and federation of local and remote speech recognition 有权
    本地和远程语音识别的组合和联合

    公开(公告)号:US08892439B2

    公开(公告)日:2014-11-18

    申请号:US12503191

    申请日:2009-07-15

    IPC分类号: G10L15/18 G10L15/30

    CPC分类号: G10L15/30 G10L2015/221

    摘要: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.

    摘要翻译: 描述在本地设备处提供自动语音识别的技术。 装置可以包括用于接收指示任务的音频数据的音频输入。 该装置还可以包括接收音频数据的局部识别器组件,以便在接收音频数据的同时将音频数据传送到远程识别器,并从音频数据识别语音。 该装置还可以包括联合组件,用于从本地识别器和/或远程识别器接收一个或多个识别结果,并联合多个识别结果以产生最可能的结果。 该装置还可以包括用于执行由最可能的结果指示的任务的应用。 描述和要求保护其他实施例。

    Centralized method and system for clarifying voice commands
    8.
    发明授权
    Centralized method and system for clarifying voice commands 有权
    用于澄清语音命令的集中方法和系统

    公开(公告)号:US08942985B2

    公开(公告)日:2015-01-27

    申请号:US10990345

    申请日:2004-11-16

    IPC分类号: G10L21/00 G10L25/00 G06F3/16

    摘要: A method and system for facilitating centralized interaction with a user includes providing a recognized voice command to a plurality of application modules. A plurality of interpretations of the voice command are generated by at least one of the plurality of application modules. A centralized interface module visually renders the plurality of interpretations of the voice command on a centralized display. An indication of selection of an interpretation is received from the user.

    摘要翻译: 用于促进与用户的集中交互的方法和系统包括向多个应用模块提供识别的语音命令。 语音命令的多个解释由多个应用模块中的至少一个生成。 集中式界面模块在视觉上呈现了集中显示上的语音命令的多种解释。 从用户接收到对解释的选择的指示。

    Recognizing multiple semantic items from single utterance
    9.
    发明授权
    Recognizing multiple semantic items from single utterance 有权
    从单一语音识别多个语义项

    公开(公告)号:US08725492B2

    公开(公告)日:2014-05-13

    申请号:US12042460

    申请日:2008-03-05

    IPC分类号: G06F17/28

    CPC分类号: G10L15/1815

    摘要: Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.

    摘要翻译: 通过使用已经识别的语义项提供的约束重复地识别相同的话语,从单个话语中提取语义上不同的项目。 用于部分识别的话语的选择或校正的用户反馈可以以分层,多模式或单步的方式使用。 识别的准确性得到保留,而较少结构化和更自然的单个话语识别形式被允许使用。

    COMBINATION AND FEDERATION OF LOCAL AND REMOTE SPEECH RECOGNITION
    10.
    发明申请
    COMBINATION AND FEDERATION OF LOCAL AND REMOTE SPEECH RECOGNITION 有权
    当地和远程语音识别的组合与联合

    公开(公告)号:US20110015928A1

    公开(公告)日:2011-01-20

    申请号:US12503191

    申请日:2009-07-15

    IPC分类号: G10L15/18 G10L15/00

    CPC分类号: G10L15/30 G10L2015/221

    摘要: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.

    摘要翻译: 描述在本地设备处提供自动语音识别的技术。 装置可以包括用于接收指示任务的音频数据的音频输入。 该装置还可以包括接收音频数据的局部识别器组件,以便在接收音频数据的同时将音频数据传送到远程识别器,并从音频数据识别语音。 该装置还可以包括联合组件,用于从本地识别器和/或远程识别器接收一个或多个识别结果,并联合多个识别结果以产生最可能的结果。 该装置还可以包括用于执行由最可能的结果指示的任务的应用。 描述和要求保护其他实施例。