Multimodal Teleconferencing
    11.
    发明申请
    Multimodal Teleconferencing 有权
    多模式电话会议

    公开(公告)号:US20110032845A1

    公开(公告)日:2011-02-10

    申请号:US12535923

    申请日:2009-08-05

    IPC分类号: H04L12/16 G10L15/00

    摘要: Multimodal teleconferencing including receiving, by a multimodal teleconferencing module, a speech utterance from one of a plurality of participants in the multimodal teleconference; identifying the participant making the speech utterance as a current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to the current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to one or more other participants in the multimodal teleconference; providing, by the multimodal teleconferencing module to a multimodal teleconferencing client for display to the current speaker, an identification of the speaker and the content retrieved for the speaker; and providing, by the multimodal teleconferencing module to one or more of multimodal teleconferencing clients for display to the other participants, an identification of the current speaker with the content retrieved for the one or more other participants in the multimodal teleconference.

    摘要翻译: 多模式电话会议包括由多模式电话会议模块接收来自多模式电话会议中的多个参与者之一的演讲话语; 将作为演讲话语的参与者识别为当前的演讲者; 由多模式电话会议模块从当前说话者的帐户检索用于显示给当前说话者的内容; 由多模式电话会议模块从当前说话者的帐户中检索用于向多模式电话会议中的一个或多个其他参与者显示的内容; 由多模式电话会议模块向多模式电话会议客户端提供用于向当前扬声器显示的扬声器的标识和为扬声器检索的内容; 以及由所述多模式电话会议模块向一个或多个多模式电话会议客户端提供用于向所述其他参与者显示的当前说话者的识别,所述内容是为所述多模式电话会议中的所述一个或多个其他参与者检索的内容。

    Dynamically Extending The Speech Prompts Of A Multimodal Application
    12.
    发明申请
    Dynamically Extending The Speech Prompts Of A Multimodal Application 有权
    动态扩展多模式应用程序的语音提示

    公开(公告)号:US20100332234A1

    公开(公告)日:2010-12-30

    申请号:US12490443

    申请日:2009-06-24

    IPC分类号: G10L15/22

    摘要: Dynamically extending the speech prompts of a multimodal application including receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt.

    摘要翻译: 动态地扩展多模式应用的语音提示,包括由提示产生引擎接收具有元数据容器的媒体文件; 由提示产生引擎从元数据容器检索与存储在媒体文件中的内容有关的语音提示,以便包含在多模式应用中; 并且由提示生成引擎修改多模式应用以包括语音提示。

    Speech Capabilities Of A Multimodal Application
    13.
    发明申请
    Speech Capabilities Of A Multimodal Application 有权
    多模式应用程序的语音能力

    公开(公告)号:US20100299146A1

    公开(公告)日:2010-11-25

    申请号:US12468166

    申请日:2009-05-19

    IPC分类号: G10L15/08

    摘要: Improving speech capabilities of a multimodal application including receiving, by the multimodal browser, a media file having a metadata container; retrieving, by the multimodal browser, from the metadata container a speech artifact related to content stored in the media file for inclusion in the speech engine available to the multimodal browser; determining whether the speech artifact includes a grammar rule or a pronunciation rule; if the speech artifact includes a grammar rule, modifying, by the multimodal browser, the grammar of the speech engine to include the grammar rule; and if the speech artifact includes a pronunciation rule, modifying, by the multimodal browser, the lexicon of the speech engine to include the pronunciation rule.

    摘要翻译: 改善多模式应用的语音能力,包括由多模式浏览器接收具有元数据容器的媒体文件; 由所述多模式浏览器从所述元数据容器检索与存储在所述媒体文件中的内容相关的语音伪像,以包括在所述多模式浏览器中可用的语音引擎中; 确定语音伪影是否包括语法规则或发音规则; 如果语音工件包括语法规则,则由多模式浏览器修改语音引擎的语法以包括语法规则; 并且如果语音伪影包括发音规则,则由多模式浏览器修改语音引擎的词典以包括发音规则。

    Speech Enabled Media Sharing In A Multimodal Application
    14.
    发明申请
    Speech Enabled Media Sharing In A Multimodal Application 有权
    多媒体应用程序中的语音启用媒体共享

    公开(公告)号:US20110010180A1

    公开(公告)日:2011-01-13

    申请号:US12500029

    申请日:2009-07-09

    IPC分类号: G10L15/22

    摘要: Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing.

    摘要翻译: 在多模式应用程序中启用语音启用媒体共享,包括通过多模式浏览器解析多模式应用程序的一个或多个标记文档; 由多模式浏览器在一个或多个标记文档中识别用于在多模式浏览器中显示的网络资源; 由多模式浏览器加载包括资源共享模式的关键字和用于接收网络资源的目标的关键字的网络资源共享语法; 通过多模式浏览器接收与web资源匹配的关键词,用于资源共享模式的关键字和用于在web资源共享语法中接收web资源的目标的关键字,从而识别web资源, 资源共享模式,以及Web资源接收目标; 以及使用所识别的资源共享模式,将多个模式浏览器将web资源发送到所识别的web资源的目标。

    ESTABLISHING A MULTIMODAL ADVERTISING PERSONALITY FOR A SPONSOR OF A MULTIMODAL APPLICATION
    15.
    发明申请
    ESTABLISHING A MULTIMODAL ADVERTISING PERSONALITY FOR A SPONSOR OF A MULTIMODAL APPLICATION 有权
    建立多模式应用的赞助者的多媒体广告个性

    公开(公告)号:US20110202349A1

    公开(公告)日:2011-08-18

    申请号:US13095037

    申请日:2011-04-27

    IPC分类号: G10L21/00

    CPC分类号: G10L21/00 G06Q30/02

    摘要: Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor.

    摘要翻译: 为多模式应用的赞助者建立多模式广告个性,包括将一个或多个声音风格与多模态应用的赞助者联系起来,并使用至少一个与所述多模态应用相关联的声音风格向赞助者呈现多模式应用的语音部分 赞助。

    DYNAMICALLY GENERATING A VOCAL HELP PROMPT IN A MULTIMODAL APPLICATION
    16.
    发明申请
    DYNAMICALLY GENERATING A VOCAL HELP PROMPT IN A MULTIMODAL APPLICATION 审中-公开
    动态地在多模式应用程序中生成VOCAL帮助提示

    公开(公告)号:US20120065982A1

    公开(公告)日:2012-03-15

    申请号:US13303380

    申请日:2011-11-23

    IPC分类号: G10L21/00

    摘要: Dynamically generating a vocal help prompt in a multimodal application that include detecting a help-triggering event for an input element of a VoiceXML dialog, where the detecting is implemented with a multimodal application operating on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application is operatively coupled to a VoiceXML interpreter, and the multimodal application has no static help text. Dynamically generating a vocal help prompt in a multimodal application according to embodiments of the present invention typically also includes retrieving, by the VoiceXML interpreter from a source of help text, help text for an element of a speech recognition grammar, forming by the VoiceXML interpreter the help text into a vocal help prompt, and presenting by the multimodal application the vocal help prompt through a computer user interface to a user.

    摘要翻译: 在多模式应用中动态地产生声乐帮助提示,包括检测VoiceXML对话框的输入元素的帮助触发事件,其中使用在支持多种交互模式的多模式设备上操作的多模式应用来实现检测,包括语音模式 和一个或多个非语音模式,多模式应用程序可操作地耦合到VoiceXML解释器,并且多模式应用程序没有静态帮助文本。 在根据本发明的实施例的多模式应用中动态地产生声乐帮助提示通常还包括由VoiceXML解释器从帮助文本的源中检索帮助语音识别语法的元素的文本,由VoiceXML解释器形成 帮助文本进入声乐帮助提示,并通过多用途应用程序向用户提供通过计算机用户界面的声乐帮助提示。

    SYNCHRONIZING VISUAL AND SPEECH EVENTS IN A MULTIMODAL APPLICATION
    17.
    发明申请
    SYNCHRONIZING VISUAL AND SPEECH EVENTS IN A MULTIMODAL APPLICATION 有权
    在多模式应用程序中同步视觉和语音活动

    公开(公告)号:US20120022875A1

    公开(公告)日:2012-01-26

    申请号:US13249717

    申请日:2011-09-30

    IPC分类号: G10L21/00

    CPC分类号: G10L15/1815 G10L2021/105

    摘要: Exemplary methods, systems, and products are disclosed for synchronizing visual and speech events in a multimodal application, including receiving from a user speech; determining a semantic interpretation of the speech; calling a global application update handler; identifying, by the global application update handler, an additional processing function in dependence upon the semantic interpretation; and executing the additional function. Typical embodiments may include updating a visual element after executing the additional function. Typical embodiments may include updating a voice form after executing the additional function. Typical embodiments also may include updating a state table after updating the voice form. Typical embodiments also may include restarting the voice form after executing the additional function.

    摘要翻译: 公开了用于在多模式应用中同步视觉和语音事件的示例性方法,系统和产品,包括从用户语音接收; 确定语音的语义解释; 调用全局应用程序更新处理程序; 由全局应用程序更新处理程序识别依赖于语义解释的附加处理功能; 并执行附加功能。 典型实施例可以包括在执行附加功能之后更新视觉元素。 典型实施例可以包括在执行附加功能之后更新语音表单。 典型实施例还可以包括在更新语音形式之后更新状态表。 典型实施例还可以包括在执行附加功能之后重新启动语音形式。

    SYSTEMS AND METHODS FOR INPUTTING GRAPHICAL DATA INTO A GRAPHICAL INPUT FIELD
    18.
    发明申请
    SYSTEMS AND METHODS FOR INPUTTING GRAPHICAL DATA INTO A GRAPHICAL INPUT FIELD 失效
    将图形数据输入图形输入场的系统和方法

    公开(公告)号:US20090199101A1

    公开(公告)日:2009-08-06

    申请号:US12363580

    申请日:2009-01-30

    IPC分类号: G06F3/048 G10L11/00 G06F3/16

    CPC分类号: G10L2015/228

    摘要: A system (20) for inputting graphical data into a graphical input field includes a graphical input device (22) for inputting the graphical data into the graphical input field, and a processor-executable voice-form module (28) responsive to an initial presentation of graphical data to the graphical input device. The voice-form module (28) causes a determination of whether the inputting of the graphical data into the graphical input field is complete. A method for inputting graphical data into a graphical input field includes initiating an input of graphical data via a graphical input device into the graphical input field, and actuating a voice-form module in response to initiating the input of graphical data into the graphical input field.

    摘要翻译: 用于将图形数据输入图形输入字段的系统(20)包括用于将图形数据输入图形输入字段的图形输入装置(22),以及响应于初始呈现的处理器可执行语音模块(28) 的图形数据输入到图形输入设备。 声音形式模块(28)确定图形输入字段中的图形数据的输入是否完成。 用于将图形数据输入到图形输入字段的方法包括:通过图形输入装置将图形数据的输入启动到图形输入字段中,以及响应于启动图形数据输入到图形输入字段来启动语音模块模块 。

    CONTEXT-BASED GRAMMARS FOR AUTOMATED SPEECH RECOGNITION
    19.
    发明申请
    CONTEXT-BASED GRAMMARS FOR AUTOMATED SPEECH RECOGNITION 有权
    用于自动语音识别的基于语境的语法

    公开(公告)号:US20130006621A1

    公开(公告)日:2013-01-03

    申请号:US13614886

    申请日:2012-09-13

    IPC分类号: G10L15/00

    CPC分类号: G10L2015/228

    摘要: Methods, apparatus, and computer program products for providing a context-based grammar for automatic speech recognition, including creating by a multimodal application a context, the context comprising words associated with user activity in the multimodal application, and supplementing by the multimodal application a grammar for automatic speech recognition in dependence upon the context.

    摘要翻译: 用于提供用于自动语音识别的基于上下文的语法的方法,装置和计算机程序产品,包括由多模式应用程序创建上下文,包括与多模式应用程序中的用户活动相关联的单词的上下文,以及由多模态应用程序补充语法 用于根据上下文自动语音识别。

    INVOKING TAPERED PROMPTS IN A MULTIMODAL APPLICATION
    20.
    发明申请
    INVOKING TAPERED PROMPTS IN A MULTIMODAL APPLICATION 有权
    在多模式应用程序中调用带宽提取

    公开(公告)号:US20120166201A1

    公开(公告)日:2012-06-28

    申请号:US13410103

    申请日:2012-03-01

    IPC分类号: G10L11/00

    摘要: Methods, apparatus, and computer program products are described for invoking tapered prompts in a multimodal application implemented with a multimodal browser and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes. Embodiments include identifying, by a multimodal browser, a prompt element in a multimodal application; identifying, by the multimodal browser, one or more attributes associated with the prompt element; and playing a speech prompt according to the one or more attributes associated with the prompt element.

    摘要翻译: 描述了用于在多模式浏览器和多模式应用程序实现的多模式应用程序中调用渐变提示的方法,装置和计算机程序产品,该多模式应用程序在多模式设备上运行,该多模式应用程序支持与多模式应用程序的多种用户交互模式,用户交互模式包括 语音模式和一个或多个非语音模式。 实施例包括通过多模式浏览器识别多模式应用中的提示元素; 通过多模式浏览器识别与提示元素相关联的一个或多个属性; 以及根据与所述提示元素相关联的一个或多个属性播放语音提示。