Invoking tapered prompts in a multimodal application
    21.
    发明授权
    Invoking tapered prompts in a multimodal application 有权
    在多模式应用程序中调用渐变提示

    公开(公告)号:US08744861B2

    公开(公告)日:2014-06-03

    申请号:US13410103

    申请日:2012-03-01

    IPC分类号: G10L21/00 G10L25/00

    摘要: Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element.

    摘要翻译: 描述了用于在多模式浏览器和多模式应用程序实现的多模式应用程序中调用渐变提示的方法,装置和计算机程序产品,该多模式应用程序在多模式设备上运行,该多模式应用程序支持与多模式应用程序的多种用户交互模式,用户交互模式包括 语音模式和一个或多个非语音模式。 实施例包括通过多模式浏览器识别多模式应用中的提示元素; 通过多模式浏览器识别与提示元素相关联的一个或多个属性; 以及根据与所述提示元素相关联的一个或多个属性播放语音提示。

    INVOKING TAPERED PROMPTS IN A MULTIMODAL APPLICATION
    22.
    发明申请
    INVOKING TAPERED PROMPTS IN A MULTIMODAL APPLICATION 有权
    在多模式应用程序中调用带宽提取

    公开(公告)号:US20120166201A1

    公开(公告)日:2012-06-28

    申请号:US13410103

    申请日:2012-03-01

    IPC分类号: G10L11/00

    摘要: Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element.

    摘要翻译: 描述了用于在多模式浏览器和多模式应用程序实现的多模式应用程序中调用渐变提示的方法,装置和计算机程序产品,该多模式应用程序在多模式设备上运行,该多模式应用程序支持与多模式应用程序的多种用户交互模式,用户交互模式包括 语音模式和一个或多个非语音模式。 实施例包括通过多模式浏览器识别多模式应用中的提示元素; 通过多模式浏览器识别与提示元素相关联的一个或多个属性; 以及根据与所述提示元素相关联的一个或多个属性播放语音提示。

    Enabling global grammars for a particular multimodal application
    23.
    发明授权
    Enabling global grammars for a particular multimodal application 有权
    启用特定多模式应用程序的全局语法

    公开(公告)号:US07809575B2

    公开(公告)日:2010-10-05

    申请号:US11679279

    申请日:2007-02-27

    IPC分类号: G10L21/00 G10L11/00 G10L15/18

    CPC分类号: G10L15/19

    摘要: Methods, apparatus, and computer program products are described for enabling global grammars for a particular multimodal application according to the present invention by loading a multimodal web page; determining whether the loaded multimodal web page is one of a plurality of multimodal web pages of the particular multimodal application. If the loaded multimodal web page is one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes loading any currently unloaded global grammars of the particular multimodal application identified in the multimodal web page and maintaining any previously loaded global grammars. If the loaded multimodal web page is not one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes unloading any currently loaded global grammars.

    摘要翻译: 描述了方法,装置和计算机程序产品,用于通过加载多模式网页来实现根据本发明的特定多模式应用的全局语法; 确定加载的多模式网页是否是特定多模式应用的多个多模式网页之一。 如果加载的多模式网页是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括加载在多模式网页中标识的特定多模式应用程序的任何当前未加载的全局语法,并维护任何先前加载的全局语法 。 如果加载的多模式网页不是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括卸载任何当前加载的全局语法。

    Altering Behavior Of A Multimodal Application Based On Location
    24.
    发明申请
    Altering Behavior Of A Multimodal Application Based On Location 有权
    改变基于位置的多模态应用的行为

    公开(公告)号:US20080208593A1

    公开(公告)日:2008-08-28

    申请号:US11679301

    申请日:2007-02-27

    IPC分类号: G10L21/00

    CPC分类号: G10L15/22 G10L15/24

    摘要: Methods, apparatus, and products are disclosed for altering behavior of a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application, including a voice mode and one or more non-voice modes. The voice mode of user interaction with the multimodal application is supported by a voice interpreter. Altering behavior of a multimodal application based on location includes: receiving a location change notification in the voice interpreter from a device location manager, the device location manager operatively coupled to a position detection component of the multimodal device, the location change notification specifying a current location of the multimodal device; updating, by the voice interpreter, location-based environment parameters for the voice interpreter in dependence upon the current location of the multimodal device; and interpreting, by the voice interpreter, the multimodal application in dependence upon the location-based environment parameters.

    摘要翻译: 公开了基于位置改变多模式应用的行为的方法,装置和产品。 多模式应用程序在多模式设备上运行,支持与多模式应用程序的多种用户交互模式,包括语音模式和一种或多种非语音模式。 与多模式应用程序的用户交互的语音模式由语音解释器支持。 基于位置改变多模式应用的行为包括:从设备位置管理器在语音解释器中接收位置改变通知,该设备位置管理器可操作地耦合到多模态设备的位置检测组件,位置变化通知指定当前位置 的多模式设备; 语音解释器根据多模式设备的当前位置更新语音解释器的基于位置的环境参数; 并且由语音解释器根据基于位置的环境参数来解释多模式应用。

    Pausing A VoiceXML Dialog Of A Multimodal Application
    25.
    发明申请
    Pausing A VoiceXML Dialog Of A Multimodal Application 有权
    暂停多模式应用程序的VoiceXML对话框

    公开(公告)号:US20080208584A1

    公开(公告)日:2008-08-28

    申请号:US11679236

    申请日:2007-02-27

    IPC分类号: G10L13/00 G10L11/00

    摘要: Pausing a VoiceXML dialog of a multimodal application, including generating by the multimodal application a pause event; responsive to the pause event, temporarily pausing the dialogue by the VoiceXML interpreter; generating by the multimodal application a resume event; and responsive to the resume event, resuming the dialog. Embodiments are implemented with the multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application is operatively coupled to a VoiceXML interpreter, and the VoiceXML interpreter is interpreting the VoiceXML dialog to be paused.

    摘要翻译: 暂停多模式应用程序的VoiceXML对话框,包括由多模态应用程序生成暂停事件; 响应暂停事件,VoiceXML解释器临时暂停对话; 由多模式应用程序生成一个简历事件; 并响应resume事件,恢复对话。 实施例是通过在多模式设备上操作的多模式应用来实现的,该多模式设备支持包括语音模式和一种或多种非语音模式的多种交互模式,多模式应用可操作地耦合到VoiceXML解释器,并且VoiceXML解释器正在解释VoiceXML对话 暂停

    SYSTEM AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES
    26.
    发明申请
    SYSTEM AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES 有权
    用于在多模式设备中提供用户演讲的系统和方法

    公开(公告)号:US20080162143A1

    公开(公告)日:2008-07-03

    申请号:US11616682

    申请日:2006-12-27

    IPC分类号: G10L21/00

    摘要: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    摘要翻译: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。

    Creating a mixed-initiative grammar from directed dialog grammars
    27.
    发明授权
    Creating a mixed-initiative grammar from directed dialog grammars 有权
    从定向对话语法创建混合主动语法

    公开(公告)号:US08229745B2

    公开(公告)日:2012-07-24

    申请号:US11163522

    申请日:2005-10-21

    CPC分类号: G10L15/193 G10L15/183

    摘要: A method of building a mixed-initiative grammar can include receiving one or more conjoin phrases, wherein each conjoin phrase is associated with a selected one of the plurality of directed dialog grammars, and receiving a user input specifying a selected grammar generation technique. The mixed-initiative grammar can be automatically generated, in accordance with the selected grammar generation technique, such that the mixed-initiative grammar specifies an allowable ordering of sets when interpreting a user spoken utterance and whether duplicative phrases are allowable within the user spoken utterance.

    摘要翻译: 构建混合主动语法的方法可以包括接收一个或多个连词短语,其中每个连词短语与多个定向对话语法中的所选择的一个相关联,并且接收指定所选语法生成技术的用户输入。 可以根据所选择的语法生成技术自动地生成混合主动语法,使得混合主动语法在解释用户说话话语时指定集合的​​允许排序以及在用户口语中是否允许重复短语。

    Partially filling mixed-initiative forms from utterances having sub-threshold confidence scores based upon word-level confidence data
    28.
    发明授权
    Partially filling mixed-initiative forms from utterances having sub-threshold confidence scores based upon word-level confidence data 有权
    从基于词级置信度数据的具有子阈值置信度得分的话语部分填充混合主动形式

    公开(公告)号:US07870000B2

    公开(公告)日:2011-01-11

    申请号:US11692741

    申请日:2007-03-28

    IPC分类号: G10L15/16

    CPC分类号: G10L15/22 G10L15/193

    摘要: The present disclosure relates to prompting for a spoken response that provides input for multiple elements. A single spoken utterance including content for multiple elements can be received, where each element is mapped to a data field. The spoken utterance can be speech-to-text converted to derive values for each of the multiple elements. An utterance level confidence score can be determined, which can fall below an associated certainty threshold. Element-level confidence scores for each of the derived elements can then be ascertained. A first set of the multiple elements can have element-level confidence scores above an associated certainty threshold and a second set can have scores below. Values can be stored in data fields mapped to the first set. A prompt for input for the second set can be played.

    摘要翻译: 本公开涉及提示提供多个元素的输入的口头响应。 可以接收包括多个元素的内容的单个语音话语,其中每个元素被映射到数据字段。 讲话语音可以是语音到文本转换,以导出每个多个元素的值。 可以确定话语等级置信度得分,其可以低于相关的确定性阈值。 然后可以确定每个派生元素的元素级置信度得分。 多个元素的第一组可以具有高于相关确定性阈值的元素级置信度得分,而第二组可以具有下面的得分。 值可以存储在映射到第一组的数据字段中。 可以播放第二组的输入提示。

    SYSTEMS AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES
    29.
    发明申请
    SYSTEMS AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES 审中-公开
    用于在多模式设备中提供用户演讲的系统和方法

    公开(公告)号:US20130227417A1

    公开(公告)日:2013-08-29

    申请号:US13847974

    申请日:2013-03-20

    IPC分类号: G06F3/16

    摘要: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    摘要翻译: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。

    System and methods for prompting user speech in multimodal devices
    30.
    发明授权
    System and methods for prompting user speech in multimodal devices 有权
    在多模式设备中提示用户演讲的系统和方法

    公开(公告)号:US08417529B2

    公开(公告)日:2013-04-09

    申请号:US11616682

    申请日:2006-12-27

    IPC分类号: G10L21/00

    摘要: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    摘要翻译: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。