Enabling speech recognition grammars in web page frames
    1.
    发明授权
    Enabling speech recognition grammars in web page frames 有权
    在网页框架中启用语音识别语法

    公开(公告)号:US08073692B2

    公开(公告)日:2011-12-06

    申请号:US12917741

    申请日:2010-11-02

    IPC分类号: G10L15/22 G06F17/20

    摘要: Enabling grammars in web page frames, including receiving, in a multimodal application on a multimodal device, a frameset document, where the frameset document includes markup defining web page frames; obtaining by the multimodal application content documents for display in each of the web page frames, where the content documents include navigable markup elements; generating by the multimodal application, for each navigable markup element in each content document, a segment of markup defining a speech recognition grammar, including inserting in each such grammar markup identifying content to be displayed when words in the grammar are matched and markup identifying a frame where the content is to be displayed; and enabling by the multimodal application all the generated grammars for speech recognition.

    摘要翻译: 在网页框架中启用语法,包括在多模式设备上的多模式应用程序中接收框架集文档,其中框架集文档包括定义网页框架的标记; 通过多模式应用程序内容文档获取以在每个网页帧中显示,其中内容文档包括可导航标记元素; 由多模式应用为每个内容文档中的每个可导航标记元素生成定义语音识别语法的标记段,包括在每个这样的语法标记中插入标识要在语法中的词匹配时要显示的内容,并且标识标识帧 要显示的内容; 并通过多模式应用程序实现所有生成的语法用于语音识别。

    Method and system of building a grammar rule with baseforms generated dynamically from user utterances
    2.
    发明授权
    Method and system of building a grammar rule with baseforms generated dynamically from user utterances 有权
    使用从用户话语动态生成的基本形式构建语法规则的方法和系统

    公开(公告)号:US07962343B2

    公开(公告)日:2011-06-14

    申请号:US12276036

    申请日:2008-11-21

    IPC分类号: G10L21/00

    CPC分类号: G10L15/187 G10L2015/0631

    摘要: A method (200) of building a grammar with baseforms generated dynamically from user utterances can include the steps of recording (205) a user utterance, generating (210) a baseform using the user utterance, creating or adding to (215) a grammar rule using the baseform, and binding (230) the grammar rule in a grammar document of a voice extensible markup language program. Generating a baseform can optionally include introducing a new element to VoiceXML with attributes that enable generating the baseform from a referenced recording such as the user utterance. In one embodiment, the method can be used to create (235) a phonebook and a grammar to access the phonebook by repeatedly visiting a form containing the grammar rule with attributes that enable generating the baseform from the referenced recording.

    摘要翻译: 用用户话语动态生成基本形式的语法的方法(200)可包括以下步骤:(205)用户话语,使用用户话语产生(210)基形,创建或添加(215)语法规则 使用基本形式,并在语音可扩展标记语言程序的语法文档中绑定(230)语法规则。 生成基本形式可以选择性地包括向VoiceXML引入新元素,该属性使得能够从引用的记录(例如用户话语)生成基本形式。 在一个实施例中,该方法可以用于通过重复地访问包含语法规则的表单来创建(235)电话簿和语法来访问电话簿,该属性可以使得能够从引用的记录生成基本形式。

    ENABLING GRAMMARS IN WEB PAGE FRAME
    3.
    发明申请
    ENABLING GRAMMARS IN WEB PAGE FRAME 有权
    在网页框架中启用GRAMMARS

    公开(公告)号:US20110047452A1

    公开(公告)日:2011-02-24

    申请号:US12917741

    申请日:2010-11-02

    IPC分类号: G06F17/28 G06F17/27 G06F17/00

    摘要: Enabling grammars in web page frames, including receiving, in a multimodal application on a multimodal device, a frameset document, where the frameset document includes markup defining web page frames; obtaining by the multimodal application content documents for display in each of the web page frames, where the content documents include navigable markup elements; generating by the multimodal application, for each navigable markup element in each content document, a segment of markup defining a speech recognition grammar, including inserting in each such grammar markup identifying content to be displayed when words in the grammar are matched and markup identifying a frame where the content is to be displayed; and enabling by the multimodal application all the generated grammars for speech recognition.

    摘要翻译: 在网页框架中启用语法,包括在多模式设备上的多模式应用程序中接收框架集文档,其中框架集文档包括定义网页框架的标记; 通过多模式应用程序内容文档获取以在每个网页帧中显示,其中内容文档包括可导航标记元素; 由多模式应用为每个内容文档中的每个可导航标记元素生成定义语音识别语法的标记段,包括在每个这样的语法标记中插入标识要在语法中的词匹配时要显示的内容,并且标识标识帧 要显示的内容; 并通过多模式应用程序实现所有生成的语法用于语音识别。

    ENABLING GLOBAL GRAMMARS FOR A PARTICULAR MULTIMODAL APPLICATION
    4.
    发明申请
    ENABLING GLOBAL GRAMMARS FOR A PARTICULAR MULTIMODAL APPLICATION 有权
    为特定的多模式应用启用全球GRAMMARS

    公开(公告)号:US20100324889A1

    公开(公告)日:2010-12-23

    申请号:US12873149

    申请日:2010-08-31

    IPC分类号: G06F17/27 G10L11/00

    CPC分类号: G10L15/19

    摘要: Methods, apparatus, and computer program products are described for enabling global grammars for a particular multimodal application according to the present invention by loading a multimodal web page; determining whether the loaded multimodal web page is one of a plurality of multimodal web pages of the particular multimodal application. If the loaded multimodal web page is one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes loading any currently unloaded global grammars of the particular multimodal application identified in the multimodal web page and maintaining any previously loaded global grammars. If the loaded multimodal web page is not one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes unloading any currently loaded global grammars.

    摘要翻译: 描述了方法,装置和计算机程序产品,用于通过加载多模式网页来实现根据本发明的特定多模式应用的全局语法; 确定加载的多模式网页是否是特定多模式应用的多个多模式网页之一。 如果加载的多模式网页是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括加载在多模式网页中标识的特定多模式应用程序的任何当前未加载的全局语法,并维护任何先前加载的全局语法 。 如果加载的多模式网页不是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括卸载任何当前加载的全局语法。

    METHOD AND SYSTEM OF BUILDING A GRAMMAR RULE WITH BASEFORMS GENERATED DYNAMICALLY FROM USER UTTERANCES
    5.
    发明申请
    METHOD AND SYSTEM OF BUILDING A GRAMMAR RULE WITH BASEFORMS GENERATED DYNAMICALLY FROM USER UTTERANCES 有权
    基于用户生成动态生成基数的GRAMMAR规则的方法和系统

    公开(公告)号:US20090076818A1

    公开(公告)日:2009-03-19

    申请号:US12276036

    申请日:2008-11-21

    IPC分类号: G10L15/18

    CPC分类号: G10L15/187 G10L2015/0631

    摘要: A method (200) of building a grammar with baseforms generated dynamically from user utterances can include the steps of recording (205) a user utterance, generating (210) a baseform using the user utterance, creating or adding to (215) a grammar rule using the baseform, and binding (230) the grammar rule in a grammar document of a voice extensible markup language program. Generating a baseform can optionally include introducing a new element to VoiceXML with attributes that enable generating the baseform from a referenced recording such as the user utterance. In one embodiment, the method can be used to create (235) a phonebook and a grammar to access the phonebook by repeatedly visiting a form containing the grammar rule with attributes that enable generating the baseform from the referenced recording.

    摘要翻译: 用用户话语动态生成基本形式的语法的方法(200)可包括以下步骤:(205)用户话语,使用用户话语产生(210)基形,创建或添加(215)语法规则 使用基本形式,并在语音可扩展标记语言程序的语法文档中绑定(230)语法规则。 生成基本形式可以选择性地包括向VoiceXML引入新元素,该属性使得能够从引用的记录(例如用户话语)生成基本形式。 在一个实施例中,该方法可以用于通过重复地访问包含语法规则的表单来创建(235)电话簿和语法来访问电话簿,该属性可以使得能够从引用的记录生成基本形式。

    Method and system of building a grammar rule with baseforms generated dynamically from user utterances
    6.
    发明授权
    Method and system of building a grammar rule with baseforms generated dynamically from user utterances 有权
    使用从用户话语动态生成的基本形式构建语法规则的方法和系统

    公开(公告)号:US07487085B2

    公开(公告)日:2009-02-03

    申请号:US10924520

    申请日:2004-08-24

    IPC分类号: G01L15/26

    CPC分类号: G10L15/187 G10L2015/0631

    摘要: A method (200) of building a grammar with baseforms generated dynamically from user utterances can include the steps of recording (205) a user utterance, generating (210) a baseform using the user utterance, creating or adding to (215) a grammar rule using the baseform, and binding (230) the grammar rule in a grammar document of a voice extensible markup language program. Generating a baseform can optionally include introducing a new element to VoiceXML with attributes that enable generating the baseform from a referenced recording such as the user utterance. In one embodiment, the method can be used to create (235) a phonebook and a grammar to access the phonebook by repeatedly visiting a form containing the grammar rule with attributes that enable generating the baseform from the referenced recording.

    摘要翻译: 用用户话语动态生成基本形式的语法的方法(200)可包括以下步骤:(205)用户话语,使用用户话语产生(210)基形,创建或添加(215)语法规则 使用基本形式,并在语音可扩展标记语言程序的语法文档中绑定(230)语法规则。 生成基本形式可以选择性地包括向VoiceXML引入新元素,该属性使得能够从引用的记录(例如用户话语)生成基本形式。 在一个实施例中,该方法可以用于通过重复地访问包含语法规则的表单来创建(235)电话簿和语法来访问电话簿,该属性可以使得能够从引用的记录生成基本形式。

    Enabling Global Grammars For A Particular Multimodal Application
    7.
    发明申请
    Enabling Global Grammars For A Particular Multimodal Application 有权
    为特定的多模式应用程序启用全局语法

    公开(公告)号:US20080208591A1

    公开(公告)日:2008-08-28

    申请号:US11679279

    申请日:2007-02-27

    IPC分类号: G10L21/00

    CPC分类号: G10L15/19

    摘要: Methods, apparatus, and computer program products are described for enabling global grammars for a particular multimodal application according to the present invention by loading a multimodal web page; determining whether the loaded multimodal web page is one of a plurality of multimodal web pages of the particular multimodal application. If the loaded multimodal web page is one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes loading any currently unloaded global grammars of the particular multimodal application identified in the multimodal web page and maintaining any previously loaded global grammars. If the loaded multimodal web page is not one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes unloading any currently loaded global grammars.

    摘要翻译: 描述了方法,装置和计算机程序产品,用于通过加载多模式网页来实现根据本发明的特定多模式应用的全局语法; 确定加载的多模式网页是否是特定多模式应用的多个多模式网页之一。 如果加载的多模式网页是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括加载在多模式网页中标识的特定多模式应用程序的任何当前未加载的全局语法,并维护任何先前加载的全局语法 。 如果加载的多模式网页不是特定多模式应用程序的多个多模式网页之一,则使全局语法通常包括卸载任何当前加载的全局语法。

    Speech-enabled content navigation and control of a distributed multimodal browser
    8.
    发明授权
    Speech-enabled content navigation and control of a distributed multimodal browser 有权
    语音启用的内容导航和分布式多模式浏览器的控制

    公开(公告)号:US08862475B2

    公开(公告)日:2014-10-14

    申请号:US11734445

    申请日:2007-04-12

    摘要: Speech-enabled content navigation and control of a distributed multimodal browser is disclosed, the browser providing an execution environment for a multimodal application, the browser including a graphical user agent (‘GUA’) and a voice user agent (‘VUA’), the GUA operating on a multimodal device, the VUA operating on a voice server, that includes: transmitting, by the GUA, a link message to the VUA, the link message specifying voice commands that control the browser and an event corresponding to each voice command; receiving, by the GUA, a voice utterance from a user, the voice utterance specifying a particular voice command; transmitting, by the GUA, the voice utterance to the VUA for speech recognition by the VUA; receiving, by the GUA, an event message from the VUA, the event message specifying a particular event corresponding to the particular voice command; and controlling, by the GUA, the browser in dependence upon the particular event.

    摘要翻译: 公开了一种分布式多模式浏览器的语音启用内容导航和控制,浏览器为多模式应用提供执行环境,浏览器包括图形用户代理(“GUA”)和语音用户代理(“VUA”), GUA在多模式设备上操作,VUA在语音服务器上操作,其包括:由GUA向VUA发送链接消息,指定控制浏览器的语音命令的链接消息和与每个语音命令相对应的事件; 由GUA接收来自用户的语音发音,指定特定语音命令的语音话语; 通过GUA向VUA发送语音识别语音识别语音; 由GUA接收来自VUA的事件消息,事件消息指定与特定语音命令对应的特定事件; 并由GUA根据特定事件控制浏览器。

    Automatic speech recognition with a selection list
    9.
    发明授权
    Automatic speech recognition with a selection list 有权
    具有选择列表的自动语音识别

    公开(公告)号:US08612230B2

    公开(公告)日:2013-12-17

    申请号:US11619209

    申请日:2007-01-03

    IPC分类号: G10L21/00

    摘要: Methods, apparatus, and computer program products are described for automatic speech recognition (‘ASR’) that include accepting by the multimodal application speech input and visual input for selecting or deselecting items in a selection list, the speech input enabled by a speech recognition grammar; providing, from the multimodal application to the grammar interpreter, the speech input and the speech recognition grammar; receiving, by the multimodal application from the grammar interpreter, interpretation results including matched words from the grammar that correspond to items in the selection list and a semantic interpretation token that specifies whether to select or deselect items in the selection list; and determining, by the multimodal application in dependence upon the value of the semantic interpretation token, whether to select or deselect items in the selection list that correspond to the matched words.

    摘要翻译: 描述用于自动语音识别(“ASR”)的方法,装置和计算机程序产品,其包括通过多模式应用语音输入的接受和用于在选择列表中选择或取消选择项目的可视输入,由语音识别语法启用的语音输入 ; 从多模式应用程序提供语法解释器,语音输入和语音识别语法; 通过多模式应用从语法解释器接收包括对应于选择列表中的项目的语法的匹配词的解释结果以及指定是否选择或取消选择列表中的项目的语义解释令牌; 以及根据所述语义解释令牌的值由所述多模式应用程序确定是否选择或取消选择列表中对应于所述匹配词的项目。

    Creating a mixed-initiative grammar from directed dialog grammars
    10.
    发明授权
    Creating a mixed-initiative grammar from directed dialog grammars 有权
    从定向对话语法创建混合主动语法

    公开(公告)号:US08229745B2

    公开(公告)日:2012-07-24

    申请号:US11163522

    申请日:2005-10-21

    CPC分类号: G10L15/193 G10L15/183

    摘要: A method of building a mixed-initiative grammar can include receiving one or more conjoin phrases, wherein each conjoin phrase is associated with a selected one of the plurality of directed dialog grammars, and receiving a user input specifying a selected grammar generation technique. The mixed-initiative grammar can be automatically generated, in accordance with the selected grammar generation technique, such that the mixed-initiative grammar specifies an allowable ordering of sets when interpreting a user spoken utterance and whether duplicative phrases are allowable within the user spoken utterance.

    摘要翻译: 构建混合主动语法的方法可以包括接收一个或多个连词短语,其中每个连词短语与多个定向对话语法中的所选择的一个相关联,并且接收指定所选语法生成技术的用户输入。 可以根据所选择的语法生成技术自动地生成混合主动语法,使得混合主动语法在解释用户说话话语时指定集合的​​允许排序以及在用户口语中是否允许重复短语。