Creating a Mixed-Initiative Grammar from Directed Dialog Grammars
    1.
    发明申请
    Creating a Mixed-Initiative Grammar from Directed Dialog Grammars 有权
    从定向对话语法创建混合主动语法

    公开(公告)号:US20070094026A1

    公开(公告)日:2007-04-26

    申请号:US11163522

    申请日:2005-10-21

    IPC分类号: G10L15/18

    CPC分类号: G10L15/193 G10L15/183

    摘要: A method of building a mixed-initiative grammar can include identifying a plurality of directed dialog grammars for inclusion in the mixed-initiative grammar and automatically generating the mixed-initiative grammar, in accordance with a selected grammar generation technique, such that the mixed-initiative grammar specifies the plurality of directed dialog grammars.

    摘要翻译: 构建混合主动语法的方法可以包括根据所选择的语法生成技术来识别用于包含在混合主动语法中的多个定向对话语法并自动生成混合主动语法,使得混合主动语法 语法规定了多个定向对话语法。

    Method of enhancing voice interactions using visual messages
    2.
    发明授权
    Method of enhancing voice interactions using visual messages 有权
    使用视觉消息增强语音交互的方法

    公开(公告)号:US07966188B2

    公开(公告)日:2011-06-21

    申请号:US10441839

    申请日:2003-05-20

    IPC分类号: G10L11/00 G10L21/00

    CPC分类号: G06F3/038 G06F3/167 G10L15/26

    摘要: A method for enhancing voice interactions within a portable multimodal computing device using visual messages. A multimodal interface can be provided that includes an audio interface and a visual interface. A speech input can then be received and a voice recognition task can be performed upon at least a portion of the speech input. At least one message within the multimodal interface can be visually presented, wherein the message is a prompt for the speech input and/or a confirmation of the speech input.

    摘要翻译: 一种使用可视消息在便携式多模式计算设备内增强语音交互的方法。 可以提供包括音频接口和可视界面的多模式接口。 然后可以接收语音输入,并且可以在语音输入的至少一部分上执行语音识别任务。 可以在视觉呈现多模式界面内的至少一个消息,其中消息是用于语音输入的提示和/或语音输入的确认。

    Dynamic help including available speech commands from content contained within speech grammars
    3.
    发明申请
    Dynamic help including available speech commands from content contained within speech grammars 有权
    动态帮助,包括语音语法中包含的内容的可用语音命令

    公开(公告)号:US20070213984A1

    公开(公告)日:2007-09-13

    申请号:US11375417

    申请日:2006-03-13

    IPC分类号: G10L15/18

    CPC分类号: G06F3/167 G10L2015/228

    摘要: A method for providing help to voice-enabled applications, including multimodal applications, can include a step of identifying at least one speech grammar associated with a voice-enabled application. Help fields can be defined within the speech grammar. The help fields can include available speech commands for the voice enabled application. When the speech grammar is activated for use by the voice-enabled application, the available speech commands can be presented to a user of the voice-enabled application. The presented speech commands can be obtained from the help fields.

    摘要翻译: 用于向包括多模式应用在内的支持语音的应用提供帮助的方法可以包括识别与支持语音的应用相关联的至少一个语音语法的步骤。 在语言语法中可以定义帮助字段。 帮助字段可以包括用于支持语音的应用程序的可用语音命令。 当语音语法激活以供由语音使能的应用使用时,可以将语音命令呈现给支持语音的应用的用户。 所提供的语音命令可以从帮助字段获得。

    Method and system for voice-enabled autofill

    公开(公告)号:US20060074652A1

    公开(公告)日:2006-04-06

    申请号:US11199672

    申请日:2005-08-09

    IPC分类号: G10L15/00

    摘要: A computer-implemented method and system are provided for filling a graphic-based form field in response to a speech utterance. The computer-implemented method includes generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The method further includes creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the auto-fill event causing the filling of the form field with data corresponding to the user profile. The system includes a grammar-generating module for generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The system also includes an event module for creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the event causing the filling of the form field with data corresponding to the user profile.

    Ordering recognition results produced by an automatic speech recognition engine for a multimodal application
    5.
    发明授权
    Ordering recognition results produced by an automatic speech recognition engine for a multimodal application 有权
    为多模式应用程序的自动语音识别引擎生成的订购识别结果

    公开(公告)号:US07840409B2

    公开(公告)日:2010-11-23

    申请号:US11679284

    申请日:2007-02-27

    IPC分类号: G10L21/06

    摘要: Ordering recognition results produced by an automatic speech recognition (‘ASR’) engine for a multimodal application implemented with a grammar of the multimodal application in the ASR engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, includes: receiving, in the VoiceXML interpreter from the multimodal application, a voice utterance; determining, by the VoiceXML interpreter using the ASR engine, a plurality of recognition results in dependence upon the voice utterance and the grammar; determining, by the VoiceXML interpreter according to semantic interpretation scripts of the grammar, a weight for each recognition result; and sorting, by the VoiceXML interpreter, the plurality of recognition results in dependence upon the weight for each recognition result.

    摘要翻译: 通过使用ASR引擎中的多模式应用程序的语法实现的多模式应用程序的自动语音识别(“ASR”)引擎进行的订购识别结果,多模式应用程序在支持多种交互模式的多模式设备的多模式浏览器中运行 包括语音模式和一个或多个非语音模式,通过VoiceXML解释器可操作地耦合到ASR引擎的多模式应用包括:在来自多模式应用的VoiceXML解释器中接收语音话语; 通过使用ASR引擎的VoiceXML解释器,根据语音发音和语法来确定多个识别结果; 通过VoiceXML解释器根据语法的语义解释脚本确定每个识别结果的权重; 以及由VoiceXML解释器根据每个识别结果的权重对多个识别结果进行排序。

    Ordering Recognition Results Produced By An Automatic Speech Recognition Engine For A Multimodal Application
    6.
    发明申请
    Ordering Recognition Results Produced By An Automatic Speech Recognition Engine For A Multimodal Application 有权
    由多模式应用程序自动语音识别引擎生成的订购识别结果

    公开(公告)号:US20080208585A1

    公开(公告)日:2008-08-28

    申请号:US11679284

    申请日:2007-02-27

    IPC分类号: G10L21/00

    摘要: Ordering recognition results produced by an automatic speech recognition (‘ASR’) engine for a multimodal application implemented with a grammar of the multimodal application in the ASR engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, includes: receiving, in the VoiceXML interpreter from the multimodal application, a voice utterance; determining, by the VoiceXML interpreter using the ASR engine, a plurality of recognition results in dependence upon the voice utterance and the grammar; determining, by the VoiceXML interpreter according to semantic interpretation scripts of the grammar, a weight for each recognition result; and sorting, by the VoiceXML interpreter, the plurality of recognition results in dependence upon the weight for each recognition result.

    摘要翻译: 通过使用ASR引擎中的多模式应用程序的语法实现的多模式应用程序的自动语音识别(“ASR”)引擎进行的订购识别结果,多模式应用程序在支持多种交互模式的多模式设备的多模式浏览器中运行 包括语音模式和一个或多个非语音模式,通过VoiceXML解释器可操作地耦合到ASR引擎的多模式应用包括:在来自多模式应用的VoiceXML解释器中接收语音话语; 通过使用ASR引擎的VoiceXML解释器,根据语音发音和语法来确定多个识别结果; 通过VoiceXML解释器根据语法的语义解释脚本确定每个识别结果的权重; 以及由VoiceXML解释器根据每个识别结果的权重对多个识别结果进行排序。

    Method and system for voice-enabled autofill
    7.
    发明申请
    Method and system for voice-enabled autofill 有权
    语音自动填充的方法和系统

    公开(公告)号:US20060064302A1

    公开(公告)日:2006-03-23

    申请号:US10945112

    申请日:2004-09-20

    IPC分类号: G10L15/26

    摘要: A computer-implemented method and system are provided for filling a graphic-based form field in response to a speech utterance. The computer-implemented method includes generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The method further includes creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the auto-fill event causing the filling of the form field with data corresponding to the user profile. The system includes a grammar-generating module for generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The system also includes an event module for creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the event causing the filling of the form field with data corresponding to the user profile.

    摘要翻译: 提供了一种计算机实现的方法和系统,用于响应于语音说话填充基于图形的表单字段。 计算机实现的方法包括生成对应于表单域的语法,语法基于用户简档并且包括语义解释字符串。 所述方法还包括基于所述至少一个语法并且响应于所述语音话语来创建自动填充事件,所述自动填充事件导致用与所述用户简档对应的数据填写所述表单域。 该系统包括用于生成对应于表单字段的语法的语法生成模块,该语法基于用户简档并且包括语义解释字符串。 该系统还包括一个事件模块,用于基于该至少一个语法创建一个自动填充事件,并且响应于语音话语,该事件导致用对应于用户简档的数据填写表单域。

    Verifying a user using speaker verification and a multimodal web-based interface
    8.
    发明申请
    Verifying a user using speaker verification and a multimodal web-based interface 有权
    使用讲话者验证和基于多模态的基于Web的界面验证用户

    公开(公告)号:US20060190264A1

    公开(公告)日:2006-08-24

    申请号:US11062731

    申请日:2005-02-22

    IPC分类号: G10L11/00

    摘要: A method of verifying a user identity using a Web-based multimodal interface can include sending, to a remote computing device, a multimodal markup language document that, when rendered by the remote computing device, queries a user for a user identifier and causes audio of the user's voice to be sent to a multimodal, Web-based application. The user identifier and the audio can be received at about a same time from the client device. The audio can be compared with a voice print associated with the user identifier. The user at the remote computing device can be selectively granted access to the system according to a result obtained from the comparing step.

    摘要翻译: 使用基于Web的多模式接口验证用户身份的方法可以包括向远程计算设备发送多模式标记语言文档,该多式联动标记语言文档在由远程计算设备呈现时向用户查询用户标识符并导致 将用户的语音发送到多模式的基于Web的应用程序。 可以在大约相同的时间从客户端设备接收用户标识符和音频。 音频可以与与用户标识符相关联的语音打印进行比较。 可以根据从比较步骤获得的结果,选择性地授予对远程计算设备的用户对系统的访问。

    Automatically adding code to voice enable a GUI component
    9.
    发明申请
    Automatically adding code to voice enable a GUI component 失效
    自动添加代码到语音启用GUI组件

    公开(公告)号:US20060136868A1

    公开(公告)日:2006-06-22

    申请号:US11017314

    申请日:2004-12-20

    IPC分类号: G06F9/44

    CPC分类号: G06F3/0481 G06F9/451

    摘要: A method to facilitate programming of multimodal access. The method can include receiving a user selection of at least one graphical user interface (GUI) component defined in visual markup code, and receiving a user selection of at least one voice component. Responsive to the user selections, voice markup code corresponding to the selected voice component can be automatically generated and linked to the GUI component.

    摘要翻译: 一种促进多模式访问编程的方法。 该方法可以包括接收用户选择在视觉标记代码中定义的至少一个图形用户界面(GUI)组件,以及接收至少一个语音组件的用户选择。 响应于用户选择,可以自动生成与所选择的语音组件相对应的语音标记代码并链接到GUI组件。

    Dynamic switching between local and remote speech rendering
    10.
    发明申请
    Dynamic switching between local and remote speech rendering 有权
    本地和远程语音呈现之间的动态切换

    公开(公告)号:US20060122836A1

    公开(公告)日:2006-06-08

    申请号:US11007830

    申请日:2004-12-08

    IPC分类号: G10L13/08 G10L21/00

    摘要: A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

    摘要翻译: 用于在定义主机的终端系统上呈现多模式文档的多模式浏览器可以包括用于呈现多模式文档的可视内容(如果有的话)的视觉浏览器组件,以及用于渲染基于语音的内容(如果有的话)的语音浏览器组件 多模式文件。 语音浏览器组件可以确定主机在渲染基于语音的内容中使用多个语音处理配置中的哪一个。 确定可以基于运行应用程序的主机的资源。 该确定还可以基于应用中包含的处理指令。