Text grouping for disambiguation in a speech application
    41.
    发明申请
    Text grouping for disambiguation in a speech application 审中-公开
    在语音应用中消除歧义的文本分组

    公开(公告)号:US20060136195A1

    公开(公告)日:2006-06-22

    申请号:US11022466

    申请日:2004-12-22

    CPC classification number: G10L15/187

    Abstract: A method, system and apparatus for text grouping in a disambiguation process. A text grouping method for use in a disambiguation process can include producing a phonetic representation for each entry in a text list, sorting the list according to the phonetic representation, grouping phonetically similar entries in the list, and providing the sorted list with the groupings to the disambiguation process. The producing step can include producing a phonetic representation for each word in the text list. The producing step also can include producing a phonetic representation for each phrase in the text list.

    Abstract translation: 消歧过程中文本分组的方法,系统和装置。 用于消歧过程的文本分组方法可以包括为文本列表中的每个条目生成语音表示,根据语音表示对该列表进行排序,在该列表中分组语音上相似的条目,以及将排序的列表分组到 消歧过程 生成步骤可以包括为文本列表中的每个单词产生语音表示。 制作步骤还可以包括为文本列表中的每个短语产生语音表示。

    Method and system for selecting speech or DTMF interfaces or a mixture of both
    42.
    发明申请
    Method and system for selecting speech or DTMF interfaces or a mixture of both 失效
    用于选择语音或DTMF接口或两者的混合的方法和系统

    公开(公告)号:US20050169440A1

    公开(公告)日:2005-08-04

    申请号:US11026720

    申请日:2004-12-30

    CPC classification number: C23C16/4486 C23C16/4481 C23C16/4485 C23C16/52

    Abstract: A wizard that from a fixed design can create various audio interfaces. The generated interfaces can be speech only, DTMF only, or various mixed speech and DTMF UIs. When specifying both speech and DTMF prompts, a number of combinations of these interfaces could be automatically generated. Robust speech recognition systems can be built by automatically generating a “shadow” DTMF application. The DTMF application will perform the same task as the primary speech application; however the transfer to a DTMF application could be done explicitly by the user, or could be transferred automatically (either a temporary or permanent transition) at a point in the call flow where there was a problem with the speech recognition.

    Abstract translation: 从固定设计的向导可以创建各种音频接口。 生成的接口可以是仅语音,仅限DTMF,或者各种混合语音和DTMF UI。 当指定语音和DTMF提示时,可以自动生成这些接口的多个组合。 可以通过自动生成“影子”DTMF应用来构建强大的语音识别系统。 DTMF应用程序将执行与主要语音应用程序相同的任务; 然而,转移到DTMF应用程序可以由用户明确地完成,或者可以在语音识别出现问题的呼叫流程中的一个点自动传输(暂时或永久转换)。

    Method and system for defining standard catch styles for speech application code generation
    45.
    发明申请
    Method and system for defining standard catch styles for speech application code generation 有权
    用于定义语音应用程序代码生成的标准捕获样式的方法和系统

    公开(公告)号:US20050108015A1

    公开(公告)日:2005-05-19

    申请号:US10715316

    申请日:2003-11-17

    CPC classification number: G10L13/027

    Abstract: A method and system for defining standard catch styles used in generating speech application code for managing catch events, in which a style-selection menu that allows for selection of one or more catch styles is presented. Each catch style represents a system response to a catch event. A catch style can be selected from the style-selection menu. For each selected catch style, the system can prepare a response for each catch event. If the selected catch style requires playing a new audio message in response to a particular catch event, a contextual message can be entered in one or more text fields. The contextual message entered in each text field corresponds to the new audio message that will be played in response to the particular catch event. In certain catch styles, the entered contextual message is different for each catch event, while in other catch styles, the entered contextual message is the same for each catch event. Finally, if the selected catch style does not require playing of a new audio message in response to a particular catch event, the system can replay the system prompt.

    Abstract translation: 一种用于定义用于生成用于管理捕捉事件的语音应用程序代码的标准捕获样式的方法和系统,其中呈现允许选择一个或多个捕捉样式的样式选择菜单。 每个catch样式表示对catch事件的系统响应。 可以从样式选择菜单中选择捕捉样式。 对于每个选定的捕捉样式,系统可以为每个捕获事件准备响应。 如果选择的捕捉样式需要响应于特定的捕获事件播放新的音频消息,则可以在一个或多个文本字段中输入上下文消息。 在每个文本字段中输入的上下文消息对应于将响应于特定捕获事件而播放的新的音频消息。 在某些catch样式中,输入的上下文消息对于每个catch事件是不同的,而在其他catch样式中,输入的上下文消息对于每个catch事件是相同的。 最后,如果所选抓取样式不需要播放响应于特定捕获事件的新音频消息,则系统可以重播系统提示。

    Adjusting a speech engine for a mobile computing device based on background noise
    46.
    发明授权
    Adjusting a speech engine for a mobile computing device based on background noise 有权
    基于背景噪声调整移动计算设备的语音引擎

    公开(公告)号:US09076454B2

    公开(公告)日:2015-07-07

    申请号:US13358097

    申请日:2012-01-25

    CPC classification number: G10L21/0208 G10L15/20

    Abstract: Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.

    Abstract translation: 公开了用于基于背景噪声调整用于移动计算设备的语音引擎的方法,装置和产品,该移动计算设备可操作地耦合到麦克风,其包括:通过麦克风对多个操作环境的背景噪声进行采样 其中移动计算设备运行; 根据所述操作环境的采样背景噪声,为每个操作环境产生噪声模型; 以及为移动计算设备当前操作的操作环境的噪声模型配置移动计算设备的语音引擎。

    Speech enabled media sharing in a multimodal application
    47.
    发明授权
    Speech enabled media sharing in a multimodal application 有权
    在多模式应用程序中启用语音启用媒体共享

    公开(公告)号:US08510117B2

    公开(公告)日:2013-08-13

    申请号:US12500029

    申请日:2009-07-09

    CPC classification number: G06F17/30923 G06F17/30861 G10L15/26

    Abstract: Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing.

    Abstract translation: 在多模式应用程序中启用语音启用媒体共享,包括通过多模式浏览器解析多模式应用程序的一个或多个标记文档; 由多模式浏览器在一个或多个标记文档中识别用于在多模式浏览器中显示的网络资源; 由多模式浏览器加载包括资源共享模式的关键字和用于接收网络资源的目标的关键字的网络资源共享语法; 通过多模式浏览器接收与web资源匹配的关键词,用于资源共享模式的关键字和用于在web资源共享语法中接收web资源的目标的关键字,从而识别web资源, 资源共享模式,以及Web资源接收目标; 以及使用所识别的资源共享模式,将多个模式浏览器将web资源发送到所识别的web资源的目标。

    Methods and system for creating and editing an XML-based speech synthesis document
    48.
    发明授权
    Methods and system for creating and editing an XML-based speech synthesis document 失效
    用于创建和编辑基于XML的语音合成文档的方法和系统

    公开(公告)号:US08265936B2

    公开(公告)日:2012-09-11

    申请号:US12132412

    申请日:2008-06-03

    CPC classification number: G10L13/08 G10L15/26

    Abstract: A method for creating and editing an XML-based speech synthesis document for input to a text-to-speech engine is provided. The method includes recording voice utterances of a user reading a pre-selected text and parsing the recorded voice utterances into individual words and periods of silence. The method also includes recording a synthesized speech output generated by a text-to-speech engine, the synthesized speech output being an audible rendering of the pre-selected text, and parsing the synthesized speech output into individual words and periods of silence. The method further includes annotating the XML-based speech synthesis document based upon a comparison of the recorded voice utterances and the recorded synthesized speech output.

    Abstract translation: 提供了一种用于创建和编辑用于输入到文本到语音引擎的基于XML的语音合成文档的方法。 该方法包括记录读取预先选择的文本的用户的语音话语,并将记录的语音话语解析为单独的单词和静音时段。 该方法还包括记录由文本到语音引擎生成的合成语音输出,合成语音输出是预选文本的可听渲染,以及将合成的语音输出解析为单独的单词和静音时段。 该方法还包括基于记录的语音发音和所记录的合成语音输出的比较来注释基于XML的语音合成文档。

    Performing a safety analysis for user-defined voice commands to ensure that the voice commands do not cause speech recognition ambiguities
    49.
    发明授权
    Performing a safety analysis for user-defined voice commands to ensure that the voice commands do not cause speech recognition ambiguities 有权
    对用户定义的语音命令执行安全分析,以确保语音命令不会导致语音识别模糊

    公开(公告)号:US08234120B2

    公开(公告)日:2012-07-31

    申请号:US11460075

    申请日:2006-07-26

    CPC classification number: G10L15/075

    Abstract: The present invention discloses a solution for assuring user-defined voice commands are unambiguous. The solution can include a step of identifying a user attempt to enter a user-defined voice command into a voice-enabled system. A safety analysis can be performed on the user-defined voice command to determine a likelihood that the user-defined voice command will be confused with preexisting voice commands recognized by the voice-enabled system. When a high likelihood of confusion is determined by the safety analysis, a notification can be presented that the user-defined voice command is subject to confusion. A user can then define a different voice command or can choose to continue to use the potentially confusing command, possibly subject to a system imposed confusion mitigating condition or action.

    Abstract translation: 本发明公开了用于确定用户定义的语音命令的解决方案是明确的。 解决方案可以包括识别用户尝试将用户定义的语音命令输入到启用语音的系统中的步骤。 可以对用户定义的语音命令执行安全性分析,以确定用户定义的语音命令将与由支持语音的系统识别的预先存在的语音命令相混淆的可能性。 当通过安全性分析确定高混淆可能性时,可以呈现用户定义的语音命令受到混淆的通知。 然后,用户可以定义不同的语音命令,或者可以选择继续使用可能混淆的命令,这可能受制于系统的混淆减轻条件或动作。

    Signaling correspondence between a meeting agenda and a meeting discussion
    50.
    发明授权
    Signaling correspondence between a meeting agenda and a meeting discussion 失效
    会议议程和会议讨论之间的信号通信

    公开(公告)号:US08214242B2

    公开(公告)日:2012-07-03

    申请号:US12109227

    申请日:2008-04-24

    CPC classification number: G06Q10/109 G06Q10/1095

    Abstract: Signaling correspondence between a meeting agenda and a meeting discussion includes: receiving a meeting agenda specifying one or more topics for a meeting; analyzing, for each topic, one or more documents to identify topic keywords for that topic; receiving meeting discussions among participants for the meeting; identifying a current topic for the meeting in dependence upon the meeting agenda; determining a correspondence indicator in dependence upon the meeting discussions and the topic keywords for the current topic, the correspondence indicator specifying the correspondence between the meeting agenda and the meeting discussion; and rendering the correspondence indicator to the participants of the meeting.

    Abstract translation: 会议议程和会议讨论之间的信号通信包括:收到会议议程,列出会议的一个或多个主题; 为每个主题分析一个或多个文档以标识该主题的主题关键字; 接受会议与会者的会议讨论; 根据会议议程确定会议目前的议题; 根据会议讨论和当前主题的主题关键词确定一个通信指标,指定会议议程和会议讨论之间的对应关系的对应指标; 并将通信指标提交给会议与会者。

Patent Agency Ranking