Records disambiguation in a multimodal application operating on a multimodal device
    31.
    发明授权
    Records disambiguation in a multimodal application operating on a multimodal device 有权
    记录在多模式设备上运行的多模式应用程序中的歧义

    公开(公告)号:US09349367B2

    公开(公告)日:2016-05-24

    申请号:US12109167

    申请日:2008-04-24

    摘要: Methods, apparatus, and products are disclosed for record disambiguation in a multimodal application operating on a multimodal device, the multimodal device supporting multiple modes of interaction including at least a voice mode and a visual mode, that include: prompting, by the multimodal application, a user to identify a particular record among a plurality of records; receiving, by the multimodal application in response to the prompt, a voice utterance from the user; determining, by the multimodal application, that the voice utterance ambiguously identifies more than one of the plurality of records; generating, by the multimodal application, a user interaction to disambiguate the records ambiguously identified by the voice utterance in dependence upon record attributes of the records ambiguously identified by the voice utterance; and selecting, by the multimodal application for further processing, one of the records ambiguously identified by the voice utterance in dependence upon the user interaction.

    摘要翻译: 公开了用于在多模式设备上操作的多模式应用中的记录消歧的方法,装置和产品,所述多模式设备支持包括至少语音模式和视觉模式的多种交互模式,其包括:由多模式应用提示, 用户识别多个记录中的特定记录; 由多模式应用程序响应于该提示,接收来自用户的语音发声; 由所述多模式应用程序确定所述语音发音含糊地识别所述多​​个记录中的多于一个的记录; 由多模式应用程序产生用户交互,以消除由声音话语模糊识别的记录,依赖于由语音话语模糊识别的记录的记录属性; 以及通过多模式应用程序进行进一步处理,根据用户交互,通过语音话语模糊识别的记录之一。

    Method and system for defining standard catch styles for speech application code generation
    32.
    发明授权
    Method and system for defining standard catch styles for speech application code generation 有权
    用于定义语音应用程序代码生成的标准捕获样式的方法和系统

    公开(公告)号:US08799001B2

    公开(公告)日:2014-08-05

    申请号:US10715316

    申请日:2003-11-17

    IPC分类号: G10L15/22

    CPC分类号: G10L13/027

    摘要: A method and system for defining standard catch styles used in generating speech application code for managing catch events, in which a style-selection menu that allows for selection of one or more catch styles is presented. Each catch style represents a system response to a catch event. A catch style can be selected from the style-selection menu. For each selected catch style, the system can prepare a response for each catch event. If the selected catch style requires playing a new audio message in response to a particular catch event, a contextual message can be entered in one or more text fields. The contextual message entered in each text field corresponds to the new audio message that will be played in response to the particular catch event. In certain catch styles, the entered contextual message is different for each catch event, while in other catch styles, the entered contextual message is the same for each catch event. Finally, if the selected catch style does not require playing of a new audio message in response to a particular catch event, the system can replay the system prompt.

    摘要翻译: 一种用于定义用于生成用于管理捕捉事件的语音应用程序代码的标准捕获样式的方法和系统,其中呈现允许选择一个或多个捕捉样式的样式选择菜单。 每个catch样式表示对catch事件的系统响应。 可以从样式选择菜单中选择捕捉样式。 对于每个选定的捕捉样式,系统可以为每个捕获事件准备响应。 如果选择的捕捉样式需要响应于特定的捕获事件播放新的音频消息,则可以在一个或多个文本字段中输入上下文消息。 在每个文本字段中输入的上下文消息对应于将响应于特定捕获事件而播放的新的音频消息。 在某些catch样式中,输入的上下文消息对于每个catch事件是不同的,而在其他catch样式中,输入的上下文消息对于每个catch事件是相同的。 最后,如果所选抓取样式不需要播放响应于特定捕获事件的新音频消息,则系统可以重播系统提示。

    TESTING A GRAMMAR USED IN SPEECH RECOGNITION FOR RELIABILITY IN A PLURALITY OF OPERATING ENVIRONMENTS HAVING DIFFERENT BACKGROUND NOISE
    33.
    发明申请
    TESTING A GRAMMAR USED IN SPEECH RECOGNITION FOR RELIABILITY IN A PLURALITY OF OPERATING ENVIRONMENTS HAVING DIFFERENT BACKGROUND NOISE 有权
    测试在具有不同背景噪声的多种操作环境中可靠性的语音识别中使用的灰度

    公开(公告)号:US20120053934A1

    公开(公告)日:2012-03-01

    申请号:US13289233

    申请日:2011-11-04

    IPC分类号: G10L15/20

    CPC分类号: G10L15/01

    摘要: Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.

    摘要翻译: 用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法,系统和产品,包括:为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合,导致多个混合测试语音话语,每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语,使用语法和混合测试语音话语进行语音识别,导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声,根据具有记录的背景噪声的混合测试语音话语的语音识别结果,评估语法的语音识别可靠性。

    Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
    34.
    发明授权
    Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise 有权
    在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性

    公开(公告)号:US08082148B2

    公开(公告)日:2011-12-20

    申请号:US12109204

    申请日:2008-04-24

    IPC分类号: G10L15/20

    CPC分类号: G10L15/01

    摘要: Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.

    摘要翻译: 用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法,系统和产品,包括:为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合,导致多个混合测试语音话语,每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语,使用语法和混合测试语音话语进行语音识别,导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声,根据具有记录的背景噪声的混合测试语音话语的语音识别结果来评估语法的语音识别可靠性。

    Method and system for switching between prototype and real code production in a graphical call flow builder
    35.
    发明授权
    Method and system for switching between prototype and real code production in a graphical call flow builder 有权
    用于在图形调用流构建器中切换原型和实际代码生成的方法和系统

    公开(公告)号:US07797676B2

    公开(公告)日:2010-09-14

    申请号:US10827852

    申请日:2004-04-20

    CPC分类号: G06F8/34 G06Q10/06

    摘要: A method and system for automated code generation in a call flow builder (10) can include a display coupled to a processor. The processor can be programmed to select a real code (database connection) or a prototype code using a graphical interface (20) to provide a selected code and develop a call flow using the selected code. The processor can be programmed to select the prototype code as the selected code, test the call flow in a local development environment and further enable the switching of the selected code from the prototype to the real code to complete a database connection. The processor can be further programmed to enable specification of a default or range of values. Additionally, the processor can be programmed to use a database connection code that replaces a prototype assignment of values to variables when the real code is the selected code.

    摘要翻译: 在呼叫流程构建器(10)中用于自动代码生成的方法和系统可以包括耦合到处理器的显示器。 处理器可以被编程为使用图形界面(20)选择真实代码(数据库连接)或原型代码,以提供所选择的代码并使用所选择的代码开发呼叫流程。 处理器可以编程为选择原型代码作为所选代码,测试本地开发环境中的调用流程,并进一步使所选代码从原型切换到实际代码以完成数据库连接。 处理器可以被进一步编程以使得能够指定默认值或范围值。 此外,当实际代码是所选择的代码时,处理器可以被编程为使用数据库连接代码来代替值的原型分配给变量。

    Method and system for automatic generation and testing of voice applications
    36.
    发明授权
    Method and system for automatic generation and testing of voice applications 有权
    自动生成和测试语音应用的方法和系统

    公开(公告)号:US07787598B2

    公开(公告)日:2010-08-31

    申请号:US11170120

    申请日:2005-06-29

    IPC分类号: H04M1/24 H04M3/08 H04M3/22

    CPC分类号: H04M3/323 H04M3/493 H04Q1/45

    摘要: A method (100) and system (30) to enable automatic generation and testing of voice applications includes generating (102) a test driver application (TDA) (32) and generating (104) a modified original voice application (34) to be tested by the TDA within a call flow builder (10). The modified application can include or generate (106) “test hooks” or more particularly DTMF tones and DTMF grammars that can be used to synchronize the modified original voice application with the TDA. The TDA can test (110) all possible paths of the modified original voice application. Note the TDA and the modified original voice application can be generated and/or tested (112) in a test environment within the call flow builder or a telephony environment. The TDA can be automatically generated (108) to exercise all possible flows where the DTMF tones define the current state and location of the modified application.

    摘要翻译: 实现语音应用的自动生成和测试的方法(100)和系统(30)包括生成(102)测试驱动器应用(TDA)(32)并生成(104)待测试的修改的原始语音应用(34) 由TDA在呼叫流程构建器(10)内。 经修改的应用程序可以包括或生成(106)“测试挂钩”,或更具体地可以用于将修改的原始语音应用与TDA同步的DTMF音和DTMF语法。 TDA可以测试(110)修改的原始语音应用程序的所有可能路径。 请注意,TDA和修改的原始语音应用程序可以在呼叫流程构建器或电话环境中的测试环境中生成和/或测试(112)。 TDA可以被自动生成(108)来运行所有可能的流,其中DTMF音定义了修改后的应用的当前状态和位置。

    Testing A Grammar Used In Speech Recognition For Reliability In A Plurality Of Operating Environments Having Different Background Noise
    37.
    发明申请
    Testing A Grammar Used In Speech Recognition For Reliability In A Plurality Of Operating Environments Having Different Background Noise 有权
    在具有不同背景噪声的多种操作环境中测试用于语音识别中的可用性的语法

    公开(公告)号:US20090271189A1

    公开(公告)日:2009-10-29

    申请号:US12109204

    申请日:2008-04-24

    IPC分类号: G10L15/00

    CPC分类号: G10L15/01

    摘要: Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.

    摘要翻译: 用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法,系统和产品,包括:为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合,导致多个混合测试语音话语,每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语,使用语法和混合测试语音话语进行语音识别,导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声,根据具有记录的背景噪声的混合测试语音话语的语音识别结果,评估语法的语音识别可靠性。

    Adjusting A Speech Engine For A Mobile Computing Device Based On Background Noise
    38.
    发明申请
    Adjusting A Speech Engine For A Mobile Computing Device Based On Background Noise 有权
    基于背景噪声调整移动计算设备的语音引擎

    公开(公告)号:US20090271188A1

    公开(公告)日:2009-10-29

    申请号:US12109151

    申请日:2008-04-24

    IPC分类号: G10L15/00

    CPC分类号: G10L21/0208 G10L15/20

    摘要: Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.

    摘要翻译: 公开了用于基于背景噪声调整用于移动计算设备的语音引擎的方法,装置和产品,该移动计算设备可操作地耦合到麦克风,其包括:通过麦克风对多个操作环境的背景噪声进行采样 其中移动计算设备运行; 根据所述操作环境的采样背景噪声,为每个操作环境产生噪声模型; 以及为移动计算设备当前操作的操作环境的噪声模型配置移动计算设备的语音引擎。

    Compressed list presentation for speech user interfaces
    39.
    发明授权
    Compressed list presentation for speech user interfaces 有权
    用于语音用户界面的压缩列表显示

    公开(公告)号:US07289962B2

    公开(公告)日:2007-10-30

    申请号:US09894608

    申请日:2001-06-28

    IPC分类号: G10L15/18 G10L11/00 G06F17/28

    摘要: A list presentation method. The list presentation method can include the steps of: dynamically grouping selected items in a list based on sequentially positioned symbols in the items which are common to one another; labeling each group of selected items; audibly presenting each group label through a speech user interface; and, responsive to a selection of one of the presented group labels, presenting through the speech user interface items in a group corresponding to the selected group label.

    摘要翻译: 列表演示方法。 列表呈现方法可以包括以下步骤:基于彼此相同的项目中的顺序定位的符号来动态地对列表中的选定项目进行分组; 标示每组所选项目; 通过语音用户界面可听见地呈现每个组标签; 并且响应于所呈现的组标签之一的选择,通过语音用户界面呈现与所选择的组标签相对应的组中的项目。

    Method and system for testing sections of large speech applications
    40.
    发明申请
    Method and system for testing sections of large speech applications 有权
    用于测试大型语音应用程序的方法和系统

    公开(公告)号:US20070129947A1

    公开(公告)日:2007-06-07

    申请号:US11292833

    申请日:2005-12-02

    IPC分类号: G10L15/18

    CPC分类号: G06F11/3688

    摘要: Embodiments in accordance with the invention can include a new method (500) and system (100) for testing code within a speech application. A test file (101) can be automatically generated to verify the functionality of a new section of code (172) presented within a graphical call flow builder application (156). In one arrangement, a user can specify through a wizard two points on a path identifying the code section to be tested. The wizard can generate a test file (101) and can configure a path (151) to a new subpath (152) and automatically assign predetermined values to graphical call flow prompts along the path. In this manner, the new code section is reached under the same path conditions for allowing repeatable testing. The system can include a test harness (110) configured to test a new code section from within a context of the speech application, and a test controller (120) for transitioning to the new code section. The test controller can run the test harness within the speech application to evaluate a functionality of the new code section.

    摘要翻译: 根据本发明的实施例可以包括用于测试语音应用内的代码的新方法(500)和系统(100)。 可以自动地生成测试文件(101)以验证在图形呼叫流程构建器应用(156)内呈现的新的代码段(172)的功能。 在一种安排中,用户可以通过向导指定标识要测试的代码段的路径上的两个点。 向导可以生成测试文件(101),并且可以将路径(151)配置到新的子路径(152),并自动将预定值分配给沿着路径的图形呼叫流提示。 以这种方式,在相同的路径条件下达到新的代码段以允许可重复的测试。 该系统可以包括被配置为在语音应用的上下文内测试新的代码段的测试线束(110)以及用于转换到新的代码段的测试控制器(120)。 测试控制器可以在语音应用程序中运行测试工具,以评估新代码部分的功能。