Dynamic switching between local and remote speech rendering
    1.
    发明授权
    Dynamic switching between local and remote speech rendering 有权
    本地和远程语音呈现之间的动态切换

    公开(公告)号:US08024194B2

    公开(公告)日:2011-09-20

    申请号:US11007830

    申请日:2004-12-08

    IPC分类号: G10L21/00 G10L13/00 G10L15/00

    摘要: A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

    摘要翻译: 用于在定义主机的终端系统上呈现多模式文档的多模式浏览器可以包括用于呈现多模式文档的视觉内容(如果有的话)的视觉浏览器组件,以及用于呈现基于语音的内容(如果有的话)的语音浏览器组件 多模式文件。 语音浏览器组件可以确定主机在渲染基于语音的内容中使用多个语音处理配置中的哪一个。 确定可以基于运行应用程序的主机的资源。 该确定还可以基于应用中包含的处理指令。

    Pausing a VoiceXML dialog of a multimodal application
    2.
    发明授权
    Pausing a VoiceXML dialog of a multimodal application 有权
    暂停多模式应用程序的VoiceXML对话框

    公开(公告)号:US08713542B2

    公开(公告)日:2014-04-29

    申请号:US11679236

    申请日:2007-02-27

    摘要: Pausing a VoiceXML dialog of a multimodal application, including generating by the multimodal application a pause event; responsive to the pause event, temporarily pausing the dialogue by the VoiceXML interpreter; generating by the multimodal application a resume event; and responsive to the resume event, resuming the dialog. Embodiments are implemented with the multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application is operatively coupled to a VoiceXML interpreter, and the VoiceXML interpreter is interpreting the VoiceXML dialog to be paused.

    摘要翻译: 暂停多模式应用程序的VoiceXML对话框,包括由多模态应用程序生成暂停事件; 响应暂停事件,VoiceXML解释器临时暂停对话; 由多模式应用程序生成一个简历事件; 并响应resume事件,恢复对话。 实施例是通过在多模式设备上操作的多模式应用来实现的,该多模式设备支持包括语音模式和一种或多种非语音模式的多种交互模式,多模式应用可操作地耦合到VoiceXML解释器,并且VoiceXML解释器正在解释VoiceXML对话 暂停

    Pausing A VoiceXML Dialog Of A Multimodal Application
    3.
    发明申请
    Pausing A VoiceXML Dialog Of A Multimodal Application 有权
    暂停多模式应用程序的VoiceXML对话框

    公开(公告)号:US20080208584A1

    公开(公告)日:2008-08-28

    申请号:US11679236

    申请日:2007-02-27

    IPC分类号: G10L13/00 G10L11/00

    摘要: Pausing a VoiceXML dialog of a multimodal application, including generating by the multimodal application a pause event; responsive to the pause event, temporarily pausing the dialogue by the VoiceXML interpreter; generating by the multimodal application a resume event; and responsive to the resume event, resuming the dialog. Embodiments are implemented with the multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application is operatively coupled to a VoiceXML interpreter, and the VoiceXML interpreter is interpreting the VoiceXML dialog to be paused.

    摘要翻译: 暂停多模式应用程序的VoiceXML对话框,包括由多模态应用程序生成暂停事件; 响应暂停事件,VoiceXML解释器临时暂停对话; 由多模式应用程序生成一个简历事件; 并响应resume事件,恢复对话。 实施例是通过在多模式设备上操作的多模式应用来实现的,该多模式设备支持包括语音模式和一种或多种非语音模式的多种交互模式,多模式应用可操作地耦合到VoiceXML解释器,并且VoiceXML解释器正在解释VoiceXML对话 暂停

    Dynamic help including available speech commands from content contained within speech grammars
    4.
    发明授权
    Dynamic help including available speech commands from content contained within speech grammars 有权
    动态帮助,包括语音语法中包含的内容的可用语音命令

    公开(公告)号:US08311836B2

    公开(公告)日:2012-11-13

    申请号:US11375417

    申请日:2006-03-13

    IPC分类号: G10L21/00 G10L15/04

    CPC分类号: G06F3/167 G10L2015/228

    摘要: A method for providing help to voice-enabled applications, including multimodal applications, can include a step of identifying at least one speech grammar associated with a voice-enabled application. Help fields can be defined within the speech grammar. The help fields can include available speech commands for the voice enabled application. When the speech grammar is activated for use by the voice-enabled application, the available speech commands can be presented to a user of the voice-enabled application. The presented speech commands can be obtained from the help fields.

    摘要翻译: 用于向包括多模式应用在内的支持语音的应用提供帮助的方法可以包括识别与支持语音的应用相关联的至少一个语音语法的步骤。 在语言语法中可以定义帮助字段。 帮助字段可以包括用于支持语音的应用程序的可用语音命令。 当语音语法激活以供由语音使能的应用使用时,可以将语音命令呈现给支持语音的应用的用户。 所提供的语音命令可以从帮助字段获得。

    Creating a mixed-initiative grammar from directed dialog grammars
    5.
    发明授权
    Creating a mixed-initiative grammar from directed dialog grammars 有权
    从定向对话语法创建混合主动语法

    公开(公告)号:US08229745B2

    公开(公告)日:2012-07-24

    申请号:US11163522

    申请日:2005-10-21

    CPC分类号: G10L15/193 G10L15/183

    摘要: A method of building a mixed-initiative grammar can include receiving one or more conjoin phrases, wherein each conjoin phrase is associated with a selected one of the plurality of directed dialog grammars, and receiving a user input specifying a selected grammar generation technique. The mixed-initiative grammar can be automatically generated, in accordance with the selected grammar generation technique, such that the mixed-initiative grammar specifies an allowable ordering of sets when interpreting a user spoken utterance and whether duplicative phrases are allowable within the user spoken utterance.

    摘要翻译: 构建混合主动语法的方法可以包括接收一个或多个连词短语,其中每个连词短语与多个定向对话语法中的所选择的一个相关联,并且接收指定所选语法生成技术的用户输入。 可以根据所选择的语法生成技术自动地生成混合主动语法,使得混合主动语法在解释用户说话话语时指定集合的​​允许排序以及在用户口语中是否允许重复短语。

    Dynamically generating a vocal help prompt in a multimodal application
    6.
    发明授权
    Dynamically generating a vocal help prompt in a multimodal application 有权
    在多模式应用程序中动态生成声乐帮助提示

    公开(公告)号:US08086463B2

    公开(公告)日:2011-12-27

    申请号:US11530930

    申请日:2006-09-12

    IPC分类号: G10L21/00 G10L21/06

    摘要: Dynamically generating a vocal help prompt in a multimodal application that include detecting a help-triggering event for an input element of a VoiceXML dialog, where the detecting is implemented with a multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application is operatively coupled to a VoiceXML interpreter, and the multimodal application has no static help text. Dynamically generating a vocal help prompt in a multimodal application according to embodiments of the present invention typically also includes retrieving, by the VoiceXML interpreter from a source of help text, help text for an element of a speech recognition grammar, forming by the VoiceXML interpreter the help text into a vocal help prompt, and presenting by the multimodal application the vocal help prompt through a computer user interface to a user.

    摘要翻译: 在多模式应用中动态地产生声乐帮助提示,包括检测VoiceXML对话框的输入元素的帮助触发事件,其中使用在支持多种交互模式的多模式设备上操作的多模式应用来实现检测,包括语音模式 和一个或多个非语音模式,多模式应用程序可操作地耦合到VoiceXML解释器,并且多模式应用程序没有静态帮助文本。 在根据本发明的实施例的多模式应用中动态地产生声乐帮助提示通常还包括由VoiceXML解释器从帮助文本的源中检索帮助语音识别语法的元素的文本,由VoiceXML解释器形成 帮助文本进入声乐帮助提示,并通过多用途应用程序向用户提供通过计算机用户界面的声乐帮助提示。

    Systems and methods for inputting graphical data into a graphical input field
    7.
    发明授权
    Systems and methods for inputting graphical data into a graphical input field 失效
    将图形数据输入图形输入字段的系统和方法

    公开(公告)号:US08296149B2

    公开(公告)日:2012-10-23

    申请号:US12363580

    申请日:2009-01-30

    IPC分类号: G10L15/22

    CPC分类号: G10L2015/228

    摘要: A system (20) for inputting graphical data into a graphical input field includes a graphical input device (22) for inputting the graphical data into the graphical input field, and a processor-executable voice-form module (28) responsive to an initial presentation of graphical data to the graphical input device. The voice-form module (28) causes a determination of whether the inputting of the graphical data into the graphical input field is complete. A method for inputting graphical data into a graphical input field includes initiating an input of graphical data via a graphical input device into the graphical input field, and actuating a voice-form module in response to initiating the input of graphical data into the graphical input field.

    摘要翻译: 用于将图形数据输入到图形输入字段的系统(20)包括用于将图形数据输入图形输入字段的图形输入装置(22)和响应于初始呈现的处理器可执行语音模块(28) 的图形数据输入到图形输入设备。 声音形式模块(28)确定图形输入字段中的图形数据的输入是否完成。 用于将图形数据输入到图形输入字段的方法包括:通过图形输入装置将图形数据的输入启动到图形输入字段中,以及响应于启动图形数据输入到图形输入字段来启动语音模块模块 。

    DYNAMICALLY GENERATING A VOCAL HELP PROMPT IN A MULTIMODAL APPLICATION
    8.
    发明申请
    DYNAMICALLY GENERATING A VOCAL HELP PROMPT IN A MULTIMODAL APPLICATION 审中-公开
    动态地在多模式应用程序中生成VOCAL帮助提示

    公开(公告)号:US20120065982A1

    公开(公告)日:2012-03-15

    申请号:US13303380

    申请日:2011-11-23

    IPC分类号: G10L21/00

    摘要: Dynamically generating a vocal help prompt in a multimodal application that include detecting a help-triggering event for an input element of a VoiceXML dialog, where the detecting is implemented with a multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application is operatively coupled to a VoiceXML interpreter, and the multimodal application has no static help text. Dynamically generating a vocal help prompt in a multimodal application according to embodiments of the present invention typically also includes retrieving, by the VoiceXML interpreter from a source of help text, help text for an element of a speech recognition grammar, forming by the VoiceXML interpreter the help text into a vocal help prompt, and presenting by the multimodal application the vocal help prompt through a computer user interface to a user.

    摘要翻译: 在多模式应用中动态地产生声乐帮助提示,包括检测VoiceXML对话框的输入元素的帮助触发事件,其中使用在支持多种交互模式的多模式设备上操作的多模式应用来实现检测,包括语音模式 和一个或多个非语音模式,多模式应用程序可操作地耦合到VoiceXML解释器,并且多模式应用程序没有静态帮助文本。 在根据本发明的实施例的多模式应用中动态地产生声乐帮助提示通常还包括由VoiceXML解释器从帮助文本的源中检索帮助语音识别语法的元素的文本,由VoiceXML解释器形成 帮助文本进入声乐帮助提示,并通过多用途应用程序向用户提供通过计算机用户界面的声乐帮助提示。

    SYSTEMS AND METHODS FOR INPUTTING GRAPHICAL DATA INTO A GRAPHICAL INPUT FIELD
    9.
    发明申请
    SYSTEMS AND METHODS FOR INPUTTING GRAPHICAL DATA INTO A GRAPHICAL INPUT FIELD 失效
    将图形数据输入图形输入场的系统和方法

    公开(公告)号:US20090199101A1

    公开(公告)日:2009-08-06

    申请号:US12363580

    申请日:2009-01-30

    IPC分类号: G06F3/048 G10L11/00 G06F3/16

    CPC分类号: G10L2015/228

    摘要: A system (20) for inputting graphical data into a graphical input field includes a graphical input device (22) for inputting the graphical data into the graphical input field, and a processor-executable voice-form module (28) responsive to an initial presentation of graphical data to the graphical input device. The voice-form module (28) causes a determination of whether the inputting of the graphical data into the graphical input field is complete. A method for inputting graphical data into a graphical input field includes initiating an input of graphical data via a graphical input device into the graphical input field, and actuating a voice-form module in response to initiating the input of graphical data into the graphical input field.

    摘要翻译: 用于将图形数据输入图形输入字段的系统(20)包括用于将图形数据输入图形输入字段的图形输入装置(22),以及响应于初始呈现的处理器可执行语音模块(28) 的图形数据输入到图形输入设备。 声音形式模块(28)确定图形输入字段中的图形数据的输入是否完成。 用于将图形数据输入到图形输入字段的方法包括:通过图形输入装置将图形数据的输入启动到图形输入字段中,以及响应于启动图形数据输入到图形输入字段来启动语音模块模块 。

    Systems and methods for inputting graphical data into a graphical input field
    10.
    发明授权
    Systems and methods for inputting graphical data into a graphical input field 失效
    将图形数据输入图形输入字段的系统和方法

    公开(公告)号:US07509260B2

    公开(公告)日:2009-03-24

    申请号:US10945119

    申请日:2004-09-20

    IPC分类号: G10L15/22

    CPC分类号: G10L2015/228

    摘要: A method for inputting graphical data into a graphical input field includes initiating an input of graphical data via a graphical input device into the graphical input field, and actuating a voice-form module in response to initiating the input of graphical data. Actuating the voice-form module includes actuating a first voice-form function for capturing an initial value corresponding to the graphical input field and actuating a second voice-form function based upon a final value corresponding to the graphical input field. The first voice-form function initiates a timing function for polling the graphical input field at a predefined interval to determine subsequent values corresponding to the graphical input field in order to determine whether the input of graphical data into the graphical input field is complete. The second voice-form function determines whether the final value corresponding to the graphical input field is contained within a predefined set of valid values.

    摘要翻译: 用于将图形数据输入到图形输入字段的方法包括:通过图形输入装置将图形数据的输入启动到图形输入字段中,以及响应于启动图形数据的输入来启动语音模块模块。 启动语音模块包括启动第一语音形式功能,用于捕获与图形输入字段对应的初始值,并且基于对应于图形输入字段的最终值来启动第二语音形式功能。 第一语音形式功能启动定时功能,用于以预定间隔轮询图形输入字段,以确定对应于图形输入字段的后续值,以便确定图形输入字段中图形数据的输入是否完整。 第二语音形式功能确定对应于图形输入字段的最终值是否包含在预定义的一组有效值内。