-
公开(公告)号:US08532269B2
公开(公告)日:2013-09-10
申请号:US12354799
申请日:2009-01-16
摘要: Architecture that employs a combination of in-band signaling (e.g., DTMF) with speech recognition to deliver usability improvements. The in-band signaling allows the user to indicate to the system when a barge-in operation is occurring and/or when to start listening to subsequent speech input and optionally, when to stop listening for further speech input. The in-band signaling can be utilized during a telephone call and using wireline and wireless telephones. Moreover, the architecture can be incorporated at the platform level requiring little, if any, application changes to support the new mode of operation.
摘要翻译: 采用带内信令(例如,DTMF)与语音识别的组合来提供可用性改进的架构。 带内信令允许用户在发生插入操作和/或什么时候开始收听随后的语音输入时向系统指示,并且可选地,何时停止收听进一步的语音输入。 可以在电话呼叫期间使用带内信令,并使用有线和无线电话。 此外,架构可以并入平台级别,需要很少(如果有的话)应用程序更改以支持新的操作模式。
-
公开(公告)号:US20100183126A1
公开(公告)日:2010-07-22
申请号:US12354799
申请日:2009-01-16
摘要: Architecture that employs a combination of in-band signaling (e.g., DTMF) with speech recognition to deliver usability improvements. The in-band signaling allows the user to indicate to the system when a barge-in operation is occurring and/or when to start listening to subsequent speech input and optionally, when to stop listening for further speech input. The in-band signaling can be utilized during a telephone call and using wireline and wireless telephones. Moreover, the architecture can be incorporated at the platform level requiring little, if any, application changes to support the new mode of operation.
摘要翻译: 采用带内信令(例如,DTMF)与语音识别的组合来提供可用性改进的架构。 带内信令允许用户在发生插入操作和/或什么时候开始收听随后的语音输入时向系统指示,并且可选地,何时停止收听进一步的语音输入。 可以在电话呼叫期间使用带内信令,并使用有线和无线电话。 此外,架构可以并入平台级别,需要很少(如果有的话)应用程序更改以支持新的操作模式。
-
公开(公告)号:US20090234647A1
公开(公告)日:2009-09-17
申请号:US12049243
申请日:2008-03-14
IPC分类号: G10L15/26
CPC分类号: G10L15/30
摘要: A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.
摘要翻译: 一种方法,程序存储设备和移动设备提供语音消歧。 用于语音识别处理的音频由移动设备发送。 接收到表示与发送音频相匹配的候补的结果。 替代物显示在消歧对话屏幕中,用于对替代物进行更正。 使用消歧对话框屏幕对替代品进行更正,直到显示正确的结果。 选择正确的结果。 与所选择的正确结果相关联的内容与接收到表示被识别为匹配所发送的音频的替代的结果并行地接收。
-
公开(公告)号:US08224656B2
公开(公告)日:2012-07-17
申请号:US12049243
申请日:2008-03-14
IPC分类号: G10L21/06
CPC分类号: G10L15/30
摘要: A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.
摘要翻译: 一种方法,程序存储设备和移动设备提供语音消歧。 用于语音识别处理的音频由移动设备发送。 接收到表示与发送音频相匹配的候补的结果。 替代物显示在消歧对话屏幕中,用于对替代物进行更正。 使用消歧对话框屏幕对替代品进行更正,直到显示正确的结果。 选择正确的结果。 与所选择的正确结果相关联的内容与接收到表示被识别为匹配所发送的音频的替代的结果并行地接收。
-
公开(公告)号:US08892439B2
公开(公告)日:2014-11-18
申请号:US12503191
申请日:2009-07-15
CPC分类号: G10L15/30 , G10L2015/221
摘要: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.
摘要翻译: 描述在本地设备处提供自动语音识别的技术。 装置可以包括用于接收指示任务的音频数据的音频输入。 该装置还可以包括接收音频数据的局部识别器组件,以便在接收音频数据的同时将音频数据传送到远程识别器,并从音频数据识别语音。 该装置还可以包括联合组件,用于从本地识别器和/或远程识别器接收一个或多个识别结果,并联合多个识别结果以产生最可能的结果。 该装置还可以包括用于执行由最可能的结果指示的任务的应用。 描述和要求保护其他实施例。
-
公开(公告)号:US08473295B2
公开(公告)日:2013-06-25
申请号:US11255329
申请日:2005-10-21
申请人: David Mowatt , Robert E. Dewar , Robert L. Chambers , Felix Gerard Torquil Ifor Andrew , Oliver Scholz
发明人: David Mowatt , Robert E. Dewar , Robert L. Chambers , Felix Gerard Torquil Ifor Andrew , Oliver Scholz
CPC分类号: G10L15/22
摘要: Upon selection of a displayed word, a list of alternatives for the selected word is displayed. Each alternative in the list has an associated symbol. A speech signal is then decoded to identify a list of possible words and the list of possible words is displayed with each possible word having an associated symbol.
摘要翻译: 在选择所显示的单词时,显示所选择的单词的替换列表。 列表中的每个替代项都有一个关联的符号。 然后解码语音信号以识别可能的词的列表,并且显示可能的词的列表,其中每个可能的单词具有相关联的符号。
-
公开(公告)号:US07457821B2
公开(公告)日:2008-11-25
申请号:US11180147
申请日:2005-07-13
IPC分类号: G06F17/00
CPC分类号: H04L67/2804 , G06F9/4488 , G06F9/465 , G10L15/197 , G10L15/26 , G10L15/28 , H04L29/06027 , H04L67/2819 , H04L67/2842 , Y10S707/99932 , Y10S707/99933 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944 , Y10S707/99945
摘要: The present invention provides a method and computer-readable medium for searching for programming objects on a computer system. Under one aspect of the invention, optional search attributes are used to order a list of references to found programming objects. Under a second aspect of the invention, object attributes that are stored outside of a static attribute storage area are inspected during the search for programming objects. Under a third aspect of the invention, different sets of object data are allowed to reference the same programming object class, and different objects of a single programming object class may be initialized in different ways so that they exhibit different attributes.
摘要翻译: 本发明提供了一种用于在计算机系统上搜索编程对象的方法和计算机可读介质。 在本发明的一个方面,可选的搜索属性用于对所发现的编程对象的引用的列表进行排序。 在本发明的第二方面,在搜索编程对象期间检查存储在静态属性存储区域之外的对象属性。 在本发明的第三方面,允许不同的对象数据集引用相同的编程对象类,并且可以以不同的方式初始化单个编程对象类的不同对象,使得它们呈现不同的属性。
-
公开(公告)号:US08942985B2
公开(公告)日:2015-01-27
申请号:US10990345
申请日:2004-11-16
CPC分类号: G10L15/22 , G06F3/167 , G10L15/18 , G10L2015/223 , G10L2015/228
摘要: A method and system for facilitating centralized interaction with a user includes providing a recognized voice command to a plurality of application modules. A plurality of interpretations of the voice command are generated by at least one of the plurality of application modules. A centralized interface module visually renders the plurality of interpretations of the voice command on a centralized display. An indication of selection of an interpretation is received from the user.
摘要翻译: 用于促进与用户的集中交互的方法和系统包括向多个应用模块提供识别的语音命令。 语音命令的多个解释由多个应用模块中的至少一个生成。 集中式界面模块在视觉上呈现了集中显示上的语音命令的多种解释。 从用户接收到对解释的选择的指示。
-
公开(公告)号:US08725492B2
公开(公告)日:2014-05-13
申请号:US12042460
申请日:2008-03-05
IPC分类号: G06F17/28
CPC分类号: G10L15/1815
摘要: Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.
摘要翻译: 通过使用已经识别的语义项提供的约束重复地识别相同的话语,从单个话语中提取语义上不同的项目。 用于部分识别的话语的选择或校正的用户反馈可以以分层,多模式或单步的方式使用。 识别的准确性得到保留,而较少结构化和更自然的单个话语识别形式被允许使用。
-
公开(公告)号:US20110015928A1
公开(公告)日:2011-01-20
申请号:US12503191
申请日:2009-07-15
CPC分类号: G10L15/30 , G10L2015/221
摘要: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.
摘要翻译: 描述在本地设备处提供自动语音识别的技术。 装置可以包括用于接收指示任务的音频数据的音频输入。 该装置还可以包括接收音频数据的局部识别器组件,以便在接收音频数据的同时将音频数据传送到远程识别器,并从音频数据识别语音。 该装置还可以包括联合组件,用于从本地识别器和/或远程识别器接收一个或多个识别结果,并联合多个识别结果以产生最可能的结果。 该装置还可以包括用于执行由最可能的结果指示的任务的应用。 描述和要求保护其他实施例。
-
-
-
-
-
-
-
-
-