Ordering recognition results produced by an automatic speech recognition engine for a multimodal application
    1.
    发明授权
    Ordering recognition results produced by an automatic speech recognition engine for a multimodal application 有权
    为多模式应用程序的自动语音识别引擎生成的订购识别结果

    公开(公告)号:US07840409B2

    公开(公告)日:2010-11-23

    申请号:US11679284

    申请日:2007-02-27

    IPC分类号: G10L21/06

    摘要: Ordering recognition results produced by an automatic speech recognition (‘ASR’) engine for a multimodal application implemented with a grammar of the multimodal application in the ASR engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, includes: receiving, in the VoiceXML interpreter from the multimodal application, a voice utterance; determining, by the VoiceXML interpreter using the ASR engine, a plurality of recognition results in dependence upon the voice utterance and the grammar; determining, by the VoiceXML interpreter according to semantic interpretation scripts of the grammar, a weight for each recognition result; and sorting, by the VoiceXML interpreter, the plurality of recognition results in dependence upon the weight for each recognition result.

    摘要翻译: 通过使用ASR引擎中的多模式应用程序的语法实现的多模式应用程序的自动语音识别(“ASR”)引擎进行的订购识别结果,多模式应用程序在支持多种交互模式的多模式设备的多模式浏览器中运行 包括语音模式和一个或多个非语音模式,通过VoiceXML解释器可操作地耦合到ASR引擎的多模式应用包括:在来自多模式应用的VoiceXML解释器中接收语音话语; 通过使用ASR引擎的VoiceXML解释器,根据语音发音和语法来确定多个识别结果; 通过VoiceXML解释器根据语法的语义解释脚本确定每个识别结果的权重; 以及由VoiceXML解释器根据每个识别结果的权重对多个识别结果进行排序。

    Altering behavior of a multimodal application based on location
    2.
    发明授权
    Altering behavior of a multimodal application based on location 有权
    基于位置改变多模式应用程序的行为

    公开(公告)号:US09208783B2

    公开(公告)日:2015-12-08

    申请号:US11679301

    申请日:2007-02-27

    CPC分类号: G10L15/22 G10L15/24

    摘要: Methods, apparatus, and products are disclosed for altering behavior of a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application, including a voice mode and one or more non-voice modes. The voice mode of user interaction with the multimodal application is supported by a voice interpreter. Altering behavior of a multimodal application based on location includes: receiving a location change notification in the voice interpreter from a device location manager, the device location manager operatively coupled to a position detection component of the multimodal device, the location change notification specifying a current location of the multimodal device; updating, by the voice interpreter, location-based environment parameters for the voice interpreter in dependence upon the current location of the multimodal device; and interpreting, by the voice interpreter, the multimodal application in dependence upon the location-based environment parameters.

    摘要翻译: 公开了基于位置改变多模式应用的行为的方法,装置和产品。 多模式应用程序在多模式设备上运行,支持与多模式应用程序的多种用户交互模式,包括语音模式和一种或多种非语音模式。 与多模式应用程序的用户交互的语音模式由语音解释器支持。 基于位置改变多模式应用的行为包括:从设备位置管理器在语音解释器中接收位置改变通知,该设备位置管理器可操作地耦合到多模态设备的位置检测组件,位置变化通知指定当前位置 的多模式设备; 语音解释器根据多模式设备的当前位置更新语音解释器的基于位置的环境参数; 并且由语音解释器根据基于位置的环境参数来解释多模式应用。

    Ordering Recognition Results Produced By An Automatic Speech Recognition Engine For A Multimodal Application
    3.
    发明申请
    Ordering Recognition Results Produced By An Automatic Speech Recognition Engine For A Multimodal Application 有权
    由多模式应用程序自动语音识别引擎生成的订购识别结果

    公开(公告)号:US20080208585A1

    公开(公告)日:2008-08-28

    申请号:US11679284

    申请日:2007-02-27

    IPC分类号: G10L21/00

    摘要: Ordering recognition results produced by an automatic speech recognition (‘ASR’) engine for a multimodal application implemented with a grammar of the multimodal application in the ASR engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, includes: receiving, in the VoiceXML interpreter from the multimodal application, a voice utterance; determining, by the VoiceXML interpreter using the ASR engine, a plurality of recognition results in dependence upon the voice utterance and the grammar; determining, by the VoiceXML interpreter according to semantic interpretation scripts of the grammar, a weight for each recognition result; and sorting, by the VoiceXML interpreter, the plurality of recognition results in dependence upon the weight for each recognition result.

    摘要翻译: 通过使用ASR引擎中的多模式应用程序的语法实现的多模式应用程序的自动语音识别(“ASR”)引擎进行的订购识别结果,多模式应用程序在支持多种交互模式的多模式设备的多模式浏览器中运行 包括语音模式和一个或多个非语音模式,通过VoiceXML解释器可操作地耦合到ASR引擎的多模式应用包括:在来自多模式应用的VoiceXML解释器中接收语音话语; 通过使用ASR引擎的VoiceXML解释器,根据语音发音和语法来确定多个识别结果; 通过VoiceXML解释器根据语法的语义解释脚本确定每个识别结果的权重; 以及由VoiceXML解释器根据每个识别结果的权重对多个识别结果进行排序。

    Altering Behavior Of A Multimodal Application Based On Location
    4.
    发明申请
    Altering Behavior Of A Multimodal Application Based On Location 有权
    改变基于位置的多模态应用的行为

    公开(公告)号:US20080208593A1

    公开(公告)日:2008-08-28

    申请号:US11679301

    申请日:2007-02-27

    IPC分类号: G10L21/00

    CPC分类号: G10L15/22 G10L15/24

    摘要: Methods, apparatus, and products are disclosed for altering behavior of a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application, including a voice mode and one or more non-voice modes. The voice mode of user interaction with the multimodal application is supported by a voice interpreter. Altering behavior of a multimodal application based on location includes: receiving a location change notification in the voice interpreter from a device location manager, the device location manager operatively coupled to a position detection component of the multimodal device, the location change notification specifying a current location of the multimodal device; updating, by the voice interpreter, location-based environment parameters for the voice interpreter in dependence upon the current location of the multimodal device; and interpreting, by the voice interpreter, the multimodal application in dependence upon the location-based environment parameters.

    摘要翻译: 公开了基于位置改变多模式应用的行为的方法,装置和产品。 多模式应用程序在多模式设备上运行,支持与多模式应用程序的多种用户交互模式,包括语音模式和一种或多种非语音模式。 与多模式应用程序的用户交互的语音模式由语音解释器支持。 基于位置改变多模式应用的行为包括:从设备位置管理器在语音解释器中接收位置改变通知,该设备位置管理器可操作地耦合到多模态设备的位置检测组件,位置变化通知指定当前位置 的多模式设备; 语音解释器根据多模式设备的当前位置更新语音解释器的基于位置的环境参数; 并且由语音解释器根据基于位置的环境参数来解释多模式应用。

    Enabling speech recognition grammars in web page frames
    5.
    发明授权
    Enabling speech recognition grammars in web page frames 有权
    在网页框架中启用语音识别语法

    公开(公告)号:US08073692B2

    公开(公告)日:2011-12-06

    申请号:US12917741

    申请日:2010-11-02

    IPC分类号: G10L15/22 G06F17/20

    摘要: Enabling grammars in web page frames, including receiving, in a multimodal application on a multimodal device, a frameset document, where the frameset document includes markup defining web page frames; obtaining by the multimodal application content documents for display in each of the web page frames, where the content documents include navigable markup elements; generating by the multimodal application, for each navigable markup element in each content document, a segment of markup defining a speech recognition grammar, including inserting in each such grammar markup identifying content to be displayed when words in the grammar are matched and markup identifying a frame where the content is to be displayed; and enabling by the multimodal application all the generated grammars for speech recognition.

    摘要翻译: 在网页框架中启用语法,包括在多模式设备上的多模式应用程序中接收框架集文档,其中框架集文档包括定义网页框架的标记; 通过多模式应用程序内容文档获取以在每个网页帧中显示,其中内容文档包括可导航标记元素; 由多模式应用为每个内容文档中的每个可导航标记元素生成定义语音识别语法的标记段,包括在每个这样的语法标记中插入标识要在语法中的词匹配时要显示的内容,并且标识标识帧 要显示的内容; 并通过多模式应用程序实现所有生成的语法用于语音识别。

    Method and system of building a grammar rule with baseforms generated dynamically from user utterances
    6.
    发明授权
    Method and system of building a grammar rule with baseforms generated dynamically from user utterances 有权
    使用从用户话语动态生成的基本形式构建语法规则的方法和系统

    公开(公告)号:US07962343B2

    公开(公告)日:2011-06-14

    申请号:US12276036

    申请日:2008-11-21

    IPC分类号: G10L21/00

    CPC分类号: G10L15/187 G10L2015/0631

    摘要: A method (200) of building a grammar with baseforms generated dynamically from user utterances can include the steps of recording (205) a user utterance, generating (210) a baseform using the user utterance, creating or adding to (215) a grammar rule using the baseform, and binding (230) the grammar rule in a grammar document of a voice extensible markup language program. Generating a baseform can optionally include introducing a new element to VoiceXML with attributes that enable generating the baseform from a referenced recording such as the user utterance. In one embodiment, the method can be used to create (235) a phonebook and a grammar to access the phonebook by repeatedly visiting a form containing the grammar rule with attributes that enable generating the baseform from the referenced recording.

    摘要翻译: 用用户话语动态生成基本形式的语法的方法(200)可包括以下步骤:(205)用户话语,使用用户话语产生(210)基形,创建或添加(215)语法规则 使用基本形式,并在语音可扩展标记语言程序的语法文档中绑定(230)语法规则。 生成基本形式可以选择性地包括向VoiceXML引入新元素,该属性使得能够从引用的记录(例如用户话语)生成基本形式。 在一个实施例中,该方法可以用于通过重复地访问包含语法规则的表单来创建(235)电话簿和语法来访问电话簿,该属性可以使得能够从引用的记录生成基本形式。

    ENABLING GRAMMARS IN WEB PAGE FRAME
    7.
    发明申请
    ENABLING GRAMMARS IN WEB PAGE FRAME 有权
    在网页框架中启用GRAMMARS

    公开(公告)号:US20110047452A1

    公开(公告)日:2011-02-24

    申请号:US12917741

    申请日:2010-11-02

    IPC分类号: G06F17/28 G06F17/27 G06F17/00

    摘要: Enabling grammars in web page frames, including receiving, in a multimodal application on a multimodal device, a frameset document, where the frameset document includes markup defining web page frames; obtaining by the multimodal application content documents for display in each of the web page frames, where the content documents include navigable markup elements; generating by the multimodal application, for each navigable markup element in each content document, a segment of markup defining a speech recognition grammar, including inserting in each such grammar markup identifying content to be displayed when words in the grammar are matched and markup identifying a frame where the content is to be displayed; and enabling by the multimodal application all the generated grammars for speech recognition.

    摘要翻译: 在网页框架中启用语法,包括在多模式设备上的多模式应用程序中接收框架集文档,其中框架集文档包括定义网页框架的标记; 通过多模式应用程序内容文档获取以在每个网页帧中显示,其中内容文档包括可导航标记元素; 由多模式应用为每个内容文档中的每个可导航标记元素生成定义语音识别语法的标记段,包括在每个这样的语法标记中插入标识要在语法中的词匹配时要显示的内容,并且标识标识帧 要显示的内容; 并通过多模式应用程序实现所有生成的语法用于语音识别。

    ENABLING GLOBAL GRAMMARS FOR A PARTICULAR MULTIMODAL APPLICATION
    8.
    发明申请
    ENABLING GLOBAL GRAMMARS FOR A PARTICULAR MULTIMODAL APPLICATION 有权
    为特定的多模式应用启用全球GRAMMARS

    公开(公告)号:US20100324889A1

    公开(公告)日:2010-12-23

    申请号:US12873149

    申请日:2010-08-31

    IPC分类号: G06F17/27 G10L11/00

    CPC分类号: G10L15/19

    摘要: Methods, apparatus, and computer program products are described for enabling global grammars for a particular multimodal application according to the present invention by loading a multimodal web page; determining whether the loaded multimodal web page is one of a plurality of multimodal web pages of the particular multimodal application. If the loaded multimodal web page is one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes loading any currently unloaded global grammars of the particular multimodal application identified in the multimodal web page and maintaining any previously loaded global grammars. If the loaded multimodal web page is not one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes unloading any currently loaded global grammars.

    摘要翻译: 描述了方法,装置和计算机程序产品,用于通过加载多模式网页来实现根据本发明的特定多模式应用的全局语法; 确定加载的多模式网页是否是特定多模式应用的多个多模式网页之一。 如果加载的多模式网页是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括加载在多模式网页中标识的特定多模式应用程序的任何当前未加载的全局语法,并维护任何先前加载的全局语法 。 如果加载的多模式网页不是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括卸载任何当前加载的全局语法。

    METHOD AND SYSTEM OF BUILDING A GRAMMAR RULE WITH BASEFORMS GENERATED DYNAMICALLY FROM USER UTTERANCES
    9.
    发明申请
    METHOD AND SYSTEM OF BUILDING A GRAMMAR RULE WITH BASEFORMS GENERATED DYNAMICALLY FROM USER UTTERANCES 有权
    基于用户生成动态生成基数的GRAMMAR规则的方法和系统

    公开(公告)号:US20090076818A1

    公开(公告)日:2009-03-19

    申请号:US12276036

    申请日:2008-11-21

    IPC分类号: G10L15/18

    CPC分类号: G10L15/187 G10L2015/0631

    摘要: A method (200) of building a grammar with baseforms generated dynamically from user utterances can include the steps of recording (205) a user utterance, generating (210) a baseform using the user utterance, creating or adding to (215) a grammar rule using the baseform, and binding (230) the grammar rule in a grammar document of a voice extensible markup language program. Generating a baseform can optionally include introducing a new element to VoiceXML with attributes that enable generating the baseform from a referenced recording such as the user utterance. In one embodiment, the method can be used to create (235) a phonebook and a grammar to access the phonebook by repeatedly visiting a form containing the grammar rule with attributes that enable generating the baseform from the referenced recording.

    摘要翻译: 用用户话语动态生成基本形式的语法的方法(200)可包括以下步骤:(205)用户话语,使用用户话语产生(210)基形,创建或添加(215)语法规则 使用基本形式,并在语音可扩展标记语言程序的语法文档中绑定(230)语法规则。 生成基本形式可以选择性地包括向VoiceXML引入新元素,该属性使得能够从引用的记录(例如用户话语)生成基本形式。 在一个实施例中,该方法可以用于通过重复地访问包含语法规则的表单来创建(235)电话簿和语法来访问电话簿,该属性可以使得能够从引用的记录生成基本形式。

    Method and system of building a grammar rule with baseforms generated dynamically from user utterances
    10.
    发明授权
    Method and system of building a grammar rule with baseforms generated dynamically from user utterances 有权
    使用从用户话语动态生成的基本形式构建语法规则的方法和系统

    公开(公告)号:US07487085B2

    公开(公告)日:2009-02-03

    申请号:US10924520

    申请日:2004-08-24

    IPC分类号: G01L15/26

    CPC分类号: G10L15/187 G10L2015/0631

    摘要: A method (200) of building a grammar with baseforms generated dynamically from user utterances can include the steps of recording (205) a user utterance, generating (210) a baseform using the user utterance, creating or adding to (215) a grammar rule using the baseform, and binding (230) the grammar rule in a grammar document of a voice extensible markup language program. Generating a baseform can optionally include introducing a new element to VoiceXML with attributes that enable generating the baseform from a referenced recording such as the user utterance. In one embodiment, the method can be used to create (235) a phonebook and a grammar to access the phonebook by repeatedly visiting a form containing the grammar rule with attributes that enable generating the baseform from the referenced recording.

    摘要翻译: 用用户话语动态生成基本形式的语法的方法(200)可包括以下步骤:(205)用户话语,使用用户话语产生(210)基形,创建或添加(215)语法规则 使用基本形式,并在语音可扩展标记语言程序的语法文档中绑定(230)语法规则。 生成基本形式可以选择性地包括向VoiceXML引入新元素,该属性使得能够从引用的记录(例如用户话语)生成基本形式。 在一个实施例中,该方法可以用于通过重复地访问包含语法规则的表单来创建(235)电话簿和语法来访问电话簿,该属性可以使得能够从引用的记录生成基本形式。