Automatic generation of a callflow statistics application for speech systems
    61.
    发明授权
    Automatic generation of a callflow statistics application for speech systems 失效
    自动生成语音系统的呼叫流统计应用程序

    公开(公告)号:US08005202B2

    公开(公告)日:2011-08-23

    申请号:US11297537

    申请日:2005-12-08

    IPC分类号: H04M3/42

    摘要: A method, system and computer program for automatically generating call flow statistics in a voice application. Embodiments of the present invention address deficiencies of the art in respect to call flow statistics generation systems and provide a novel and non-obvious method, system and computer program product for automatically generating a call flow statistics-generating application and presenting updated statistics on a call flow representation. Various statistics collection points are identified on the visual representation. Upon running of the voice application, call flow statistics are gathered and presented for each statistics collection point. Call identifiers corresponding to each call path can be selected and call paths corresponding to the selected call identifier may be highlighted and their call statistics displayed.

    摘要翻译: 一种用于在语音应用中自动生成呼叫流统计的方法,系统和计算机程序。 本发明的实施例解决了与呼叫流统计生成系统有关的本领域的缺陷,并且提供了一种新颖且非显而易见的方法,系统和计算机程序产品,用于自动生成呼叫流统计生成应用并呈现呼叫上的更新统计信息 流程表示。 在视觉表示上确定了各种统计数据收集点。 在运行语音应用程序时,将收集并显示每个统计信息收集点的呼叫流统计信息。 可以选择对应于每个呼叫路径的呼叫标识符,并且可以突出显示与所选呼叫标识符相对应的呼叫路径,并显示其呼叫统计信息。

    Client / server application task allocation based upon client resources
    62.
    发明授权
    Client / server application task allocation based upon client resources 有权
    基于客户端资源的客户/服务器应用程序任务分配

    公开(公告)号:US07548977B2

    公开(公告)日:2009-06-16

    申请号:US11056493

    申请日:2005-02-11

    IPC分类号: G06F15/173

    CPC分类号: G06F9/505 G06F9/5055

    摘要: A software method for allocating application tasks between a client and a server can include the step of detecting client-based computing resources for executing at least one application task. At least one indicator of the detected client-based computing resources can be conveyed to a remotely located application server, the application server can determine whether to allocate at least one application task to the client or to a server component based upon at least one indicator.

    摘要翻译: 用于在客户机和服务器之间分配应用任务的软件方法可以包括检测基于客户机的计算资源以执行至少一个应用任务的步骤。 检测到的基于客户端的计算资源的至少一个指示符可以被传送到位于远程的应用服务器,所述应用服务器可以基于至少一个指示符来确定是否向客户端或服务器组件分配至少一个应用任务。

    USER POSITIONABLE AUDIO ANCHORS FOR DIRECTIONAL AUDIO PLAYBACK FROM VOICE-ENABLED INTERFACES
    63.
    发明申请
    USER POSITIONABLE AUDIO ANCHORS FOR DIRECTIONAL AUDIO PLAYBACK FROM VOICE-ENABLED INTERFACES 审中-公开
    用户可通过语音播放界面进行方向音频播放的可位置音频锚杆

    公开(公告)号:US20080262847A1

    公开(公告)日:2008-10-23

    申请号:US11737437

    申请日:2007-04-19

    IPC分类号: G10L21/00 G06F3/041

    CPC分类号: G11B27/105 G10L15/26

    摘要: The present invention discloses a concept and a use of audio anchors within voice-enabled interfaces. Audio anchors can be user configurable points from which audio playback occurs. In the invention, a user can identify an interface position at which an audio anchor is to be established. The computing device can determine an anchor direction setting, with values that include forward playback and backward playback. Interface items can then be audibly enumerated from the audio anchor in a direction indicated by the anchor direction setting. For example, if a set of interface items are alphabetically ordered items and if an audio anchor is set at a first item beginning with a letter “G” and an anchor direction is set to indicate backward playback, then the interface items beginning with letters “A-F” can be audibly played in reverse alphabetical order. Additionally, a rate of audio playback can be user adjustable.

    摘要翻译: 本发明公开了在支持语音的接口内的音频锚的概念和用途。 音频锚点可以是发生音频播放的用户可配置点。 在本发明中,用户可以识别要建立音频锚的接口位置。 计算设备可以确定锚方向设置,其值包括前向播放和向后播放。 然后可以从锚定方向设置指示的方向从音频锚点可听见地列举接口项目。 例如,如果一组接口项是按字母排序的项目,并且如果音频锚点被设置在以字母“G”开始的第一项目,并且将锚定方向设置为指示向后播放,则以字母“ AF“可以以相反的字母顺序播放。 此外,音频播放速率可以是用户可调节的。

    SYSTEM AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES
    64.
    发明申请
    SYSTEM AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES 有权
    用于在多模式设备中提供用户演讲的系统和方法

    公开(公告)号:US20080162143A1

    公开(公告)日:2008-07-03

    申请号:US11616682

    申请日:2006-12-27

    IPC分类号: G10L21/00

    摘要: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    摘要翻译: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。

    Printing to a text-to-speech output device
    65.
    发明申请
    Printing to a text-to-speech output device 有权
    打印到文本到语音输出设备

    公开(公告)号:US20060287860A1

    公开(公告)日:2006-12-21

    申请号:US11156958

    申请日:2005-06-20

    IPC分类号: G10L13/08

    CPC分类号: G10L13/00

    摘要: A method for producing speech output can include the step of selecting a TTS output device from a plurality of available output devices. The selected output device can be associated with outputting content of an application responsive to a print command. According to the method, the print command can be detected, which results in the content of the application being conveyed to the selected TTS output device. The TTS output device can be associated with at least one text-to-speech engine. Upon content conveyance to the TTS output device, at least a portion of the content can be automatically converted using the text-to-speech engine. The speech converted content can be outputted.

    摘要翻译: 用于产生语音输出的方法可以包括从多个可用输出设备中选择TTS输出设备的步骤。 选择的输出设备可以响应于打印命令与输出应用的内容相关联。 根据该方法,可以检测打印命令,这导致应用程序的内容被传送到所选择的TTS输出设备。 TTS输出设备可以与至少一个文本到语音引擎相关联。 当内容传送到TTS输出设备时,可以使用文本到语音引擎自动转换内容的至少一部分。 可以输出语音转换的内容。

    Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
    66.
    发明授权
    Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise 有权
    在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性

    公开(公告)号:US09396721B2

    公开(公告)日:2016-07-19

    申请号:US13289233

    申请日:2011-11-04

    IPC分类号: G10L15/20 G10L15/01

    CPC分类号: G10L15/01

    摘要: Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.

    摘要翻译: 用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法,系统和产品,包括:为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合,导致多个混合测试语音话语,每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语,使用语法和混合测试语音话语进行语音识别,导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声,根据具有记录的背景噪声的混合测试语音话语的语音识别结果,评估语法的语音识别可靠性。

    SYSTEMS AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES
    67.
    发明申请
    SYSTEMS AND METHODS FOR PROMPTING USER SPEECH IN MULTIMODAL DEVICES 审中-公开
    用于在多模式设备中提供用户演讲的系统和方法

    公开(公告)号:US20130227417A1

    公开(公告)日:2013-08-29

    申请号:US13847974

    申请日:2013-03-20

    IPC分类号: G06F3/16

    摘要: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    摘要翻译: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。

    Dynamically extending the speech prompts of a multimodal application
    68.
    发明授权
    Dynamically extending the speech prompts of a multimodal application 有权
    动态扩展多模式应用程序的语音提示

    公开(公告)号:US08521534B2

    公开(公告)日:2013-08-27

    申请号:US13612014

    申请日:2012-09-12

    IPC分类号: G10L15/00

    摘要: A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.

    摘要翻译: 提示生成引擎操作以动态地扩展多模式应用程序的提示。 提示生成引擎接收具有元数据容器的媒体文件。 提示生成引擎在支持语音模式和非语音模式的多模式设备上进行操作,以与多模式设备进行交互。 提示生成引擎从元数据容器检索与存储在媒体文件中的内容有关的语音提示,以便包含在多模式应用中。 提示生成引擎修改多模式应用程序以包括语音提示。

    System and methods for prompting user speech in multimodal devices
    69.
    发明授权
    System and methods for prompting user speech in multimodal devices 有权
    在多模式设备中提示用户演讲的系统和方法

    公开(公告)号:US08417529B2

    公开(公告)日:2013-04-09

    申请号:US11616682

    申请日:2006-12-27

    IPC分类号: G10L21/00

    摘要: A method for prompting user input for a multimodal interface including the steps of providing a multimodal interface to a user, where the interface includes a visual interface having a plurality of input regions, each having at least one input field; selecting an input region and processing a multi-token speech input provided by the user, where the processed speech input includes at least one value for at least one input field of the selected input region; and storing at least one value in at least one input field.

    摘要翻译: 一种用于提示用户输入多模式接口的方法,包括向用户提供多模式接口的步骤,其中该接口包括具有多个输入区域的视觉接口,每个输入区域具有至少一个输入区域; 选择输入区域并处理由用户提供的多令牌语音输入,其中处理的语音输入包括用于所选输入区域的至少一个输入字段的至少一个值; 以及在至少一个输入字段中存储至少一个值。

    Method and arrangement for managing grammar options in a graphical callflow builder
    70.
    发明授权
    Method and arrangement for managing grammar options in a graphical callflow builder 有权
    用于管理图形调用流构建器中的语法选项的方法和布置

    公开(公告)号:US08355918B2

    公开(公告)日:2013-01-15

    申请号:US13344193

    申请日:2012-01-05

    IPC分类号: G10L15/00

    CPC分类号: G10L2015/228

    摘要: A method (10) in a speech recognition application callflow can include the steps of assigning (11) an individual option and a pre-built grammar to a same prompt, treating (15) the individual option as a valid output of the pre-built grammar if the individual option is a potential valid match to a recognition phrase (12) or an annotation (13) in the pre-built grammar, and treating (14) the individual option as an independent grammar from the pre-built grammar if the individual option fails to be a potential valid match to the recognition phrase or the annotation in the pre-built grammar.

    摘要翻译: 语音识别应用程序调用流程中的方法(10)可以包括以下步骤:将单个选项和预先构建的语法分配给相同的提示,将(15)个别选项视为预先构建的有效输出 如果个人选项是预先构建的语法中的识别短语(12)或注释(13)的潜在有效匹配,则将语法(14)作为独立语法从预先构建的语法处理(14),如果 单个选项不能成为预先构建的语法中的识别短语或注释的潜在有效匹配。