Voice user interface authoring tool
    1.
    发明授权
    Voice user interface authoring tool 有权
    语音用户界面创作工具

    公开(公告)号:US08315874B2

    公开(公告)日:2012-11-20

    申请号:US11401823

    申请日:2006-04-11

    IPC分类号: G10L21/00

    CPC分类号: G10L2015/228

    摘要: A voice user interface authoring tool is configured to use categorized example caller responses, from which callflow paths, automatic speech recognition, and natural language processing control files can be generated automatically within a single, integrated authoring user interface. A voice user interface (VUI) design component allows an author to create an application incorporating various types of action nodes, including Prompt/Response Processing (PRP) nodes. At runtime, the system uses the information from each PRP node to prompt a user to say something, and to process the user's response in order to extract its meaning. An Automatic Speech Recognition/Natural Language Processing (ASR/NLP) Control Design component allows the author to associate sample inputs with each possible meaning, and automatically generates the necessary ASR and NLP runtime control files. The VUI design component allows the author to associate the appropriate ASR and NLP control files with each PRP node, and to associate an action node with each possible meaning, as indicated by the NLP control file.

    摘要翻译: 语音用户界面创作工具被配置为使用分类示例呼叫者响应,可以在单个集成创作用户界面内自动生成呼叫流程路径,自动语音识别和自然语言处理控制文件。 语音用户界面(VUI)设计组件允许作者创建并入各种动作节点的应用程序,包括提示/响应处理(PRP)节点。 在运行时,系统使用来自每个PRP节点的信息来提示用户说出某些内容,并处理用户的响应以提取其含义。 自动语音识别/自然语言处理(ASR / NLP)控制设计组件允许作者将样本输入与每个可能的含义相关联,并自动生成必要的ASR和NLP运行时控制文件。 VUI设计组件允许作者将适当的ASR和NLP控制文件与每个PRP节点相关联,并将动作节点与每个可能的含义相关联,如NLP控制文件所示。

    Voice user interface authoring tool
    2.
    发明申请
    Voice user interface authoring tool 有权
    语音用户界面创作工具

    公开(公告)号:US20070156406A1

    公开(公告)日:2007-07-05

    申请号:US11401823

    申请日:2006-04-11

    IPC分类号: G10L15/18

    CPC分类号: G10L2015/228

    摘要: A voice user interface authoring tool is configured to use categorized example caller responses, from which callflow paths, automatic speech recognition, and natural language processing control files can be generated automatically within a single, integrated authoring user interface. A voice user interface (VUI) design component allows an author to create an application incorporating various types of action nodes, including Prompt/Response Processing (PRP) nodes. At runtime, the system uses the information from each PRP node to prompt a user to say something, and to process the user's response in order to extract its meaning. An Automatic Speech Recognition/Natural Language Processing (ASR/NLP) Control Design component allows the author to associate sample inputs with each possible meaning, and automatically generates the necessary ASR and NLP runtime control files. The VUI design component allows the author to associate the appropriate ASR and NLP control files with each PRP node, and to associate an action node with each possible meaning, as indicated by the NLP control file.

    摘要翻译: 语音用户界面创作工具被配置为使用分类示例呼叫者响应,可以在单个集成创作用户界面内自动生成呼叫流程路径,自动语音识别和自然语言处理控制文件。 语音用户界面(VUI)设计组件允许作者创建并入各种动作节点的应用程序,包括提示/响应处理(PRP)节点。 在运行时,系统使用来自每个PRP节点的信息来提示用户说出某些内容,并处理用户的响应以提取其含义。 自动语音识别/自然语言处理(ASR / NLP)控制设计组件允许作者将样本输入与每个可能的含义相关联,并自动生成必要的ASR和NLP运行时控制文件。 VUI设计组件允许作者将适当的ASR和NLP控制文件与每个PRP节点相关联,并将动作节点与每个可能的含义相关联,如NLP控制文件所示。

    Method and apparatus for executing tasks in voice-activated command systems
    3.
    发明授权
    Method and apparatus for executing tasks in voice-activated command systems 有权
    用于在语音激活的命令系统中执行任务的方法和装置

    公开(公告)号:US07460999B2

    公开(公告)日:2008-12-02

    申请号:US10939605

    申请日:2004-09-13

    IPC分类号: G10L21/06

    CPC分类号: H04M3/42204 H04M1/271

    摘要: A method of executing operations in a voice-activated command system includes automatically initiating execution of a default operation. A user is then prompted, after the default operation has been initiated, to determine whether the user wishes to execute a second operation instead of the default operation. If the user wishes to execute the second operation instead of the default operation, execution of the default operation is terminated and execution of the second operation is initiated. In voice-activated and other command systems, such as voice dialing systems, this method allows the command system to execute the most probable operation without delay, while still making the system easily navigable by naïve users. Systems, computer readable medium and apparatus which implement the methods of the present invention are also disclosed.

    摘要翻译: 在语音激活命令系统中执行操作的方法包括自动启动默认操作的执行。 然后在默认操作被启动之后提示用户确定用户是否希望执行第二操作而不是默认操作。 如果用户希望执行第二操作而不是默认操作,则停止执行默认操作,并启动第二操作的执行。 在诸如语音拨号系统的语音激活和其他命令系统中,该方法允许命令系统无延迟地执行最可能的操作,同时仍然使得系统能够由天真的用户导航。 还公开了实现本发明的方法的系统,计算机可读介质和装置。

    Automated follow-up call in a telephone interaction system
    4.
    发明授权
    Automated follow-up call in a telephone interaction system 失效
    在电话交互系统中自动跟进呼叫

    公开(公告)号:US08111821B2

    公开(公告)日:2012-02-07

    申请号:US11077882

    申请日:2005-03-11

    申请人: David G. Ollason

    发明人: David G. Ollason

    IPC分类号: H04M3/42 H04M3/00

    摘要: A follow-up call to a user is made after completion of a first call with a voice user interface module operable on a computer. The voice user interface module inquiries about information communicated in the first call.

    摘要翻译: 在用计算机可操作的语音用户界面模块完成第一次呼叫之后,对用户进行后续呼叫。 语音用户界面模块查询第一次通话中传递的信息。

    Method and apparatus for automatic grammar generation from data entries
    5.
    发明授权
    Method and apparatus for automatic grammar generation from data entries 失效
    从数据输入中自动语法生成的方法和装置

    公开(公告)号:US07636657B2

    公开(公告)日:2009-12-22

    申请号:US11007880

    申请日:2004-12-09

    IPC分类号: G06F17/21

    CPC分类号: G10L15/063 G10L15/193

    摘要: A method of generating an optimized grammar, for use in speech recognition, from a data set or big list of items, is disclosed. The method includes the steps of obtaining a tree representing items in the data set, and generating the grammar using the tree. The tree or tree data structure representing items in the data set is a simulated recognition search tree, representing items in the data set, which can be automatically generated from the data set.

    摘要翻译: 公开了一种从数据集或大项目列表中产生用于语音识别的优化语法的方法。 该方法包括以下步骤:获得表示数据集中的项目的树,并使用该树生成语法。 表示数据集中项目的树或树数据结构是表示可以从数据集中自动生成的数据集中的项目的模拟识别搜索树。

    Context retention across multiple calls in a telephone interaction system
    6.
    发明授权
    Context retention across multiple calls in a telephone interaction system 有权
    在电话交互系统中的多个呼叫的上下文保持

    公开(公告)号:US07623651B2

    公开(公告)日:2009-11-24

    申请号:US10938714

    申请日:2004-09-10

    IPC分类号: H04M3/00 H04M5/00

    摘要: A method of providing information to a user in a telephone interactive system includes receiving a new call. A comparison is then made between an identifier associated with the new call with stored call information pertaining to previous calls. If the identifier associated with the new call matches an identifier associated with a previous call, a subsequent action taken in the new call is based on context information stored from the previous call.

    摘要翻译: 在电话交互系统中向用户提供信息的方法包括接收新的呼叫。 然后,将与新呼叫相关联的标识符与存储的与先前呼叫有关的呼叫信息进行比较。 如果与新呼叫相关联的标识符与与先前呼叫相关联的标识匹配,则在新呼叫中采取的后续动作基于从先前呼叫存储的上下文信息。

    Method and apparatus for robustly locating user barge-ins in voice-activated command systems
    7.
    发明授权
    Method and apparatus for robustly locating user barge-ins in voice-activated command systems 有权
    用于在语音激活的命令系统中鲁棒地定位用户插入的方法和装置

    公开(公告)号:US07624016B2

    公开(公告)日:2009-11-24

    申请号:US10897800

    申请日:2004-07-23

    IPC分类号: G10L21/00

    CPC分类号: G10L15/22

    摘要: A method of querying a user to select from a list in a voice-activated command system is provided. The method includes generating command prompt phrases during which the user can select items on the list. The command prompt phrases include an item on the list and an index for another item on the list. In some embodiments, each command prompt phrase also includes a period of silence between item on the list and the index for another item on the list. If a user selecting barge-in is received during a particular command prompt phrase, the corresponding item on the list is selected.

    摘要翻译: 提供了一种查询用户从语音激活的命令系统中的列表中进行选择的方法。 该方法包括生成命令提示短语,用户可以在其中选择列表中的项目。 命令提示短语包括列表上的项目和列表中另一项目的索引。 在一些实施例中,每个命令提示短语还包括列表上的项目与列表上另一项目的索引之间的静默期。 如果在特定命令提示短语期间接收到用户选择插入,则选择列表上的相应项目。

    Speech recognition enhanced caller identification
    8.
    发明授权
    Speech recognition enhanced caller identification 有权
    语音识别增强了呼叫者识别

    公开(公告)号:US07852993B2

    公开(公告)日:2010-12-14

    申请号:US10638902

    申请日:2003-08-11

    IPC分类号: H04M1/64

    摘要: A process for collecting the identity of a telephone caller is disclosed. In one embodiment, a personalized Context Free Grammar (CFG) is created for each potential call recipient, and is configured to support identification of incoming callers utilizing voice recognition. Each CFG incorporates an indication of high probability callers and probability weights in each CFG are altered accordingly. When a recipient receives a call, the relevant CFG is applied in association with a voice recognition application to enable at least a preliminary identification of the caller. In accordance with another embodiment, the caller confirms identifications. In accordance with one embodiment, standard caller-ID functionality is utilized if possible at least to assist in the caller identification process. In accordance with still another embodiment, voice recognition enhanced caller identification is utilized to provide intelligent call routing functionality.

    摘要翻译: 公开了收集电话呼叫者身份的过程。 在一个实施例中,为每个潜在呼叫接收者创建个性化上下文自由语法(CFG),并且被配置为支持利用语音识别来识别传入呼叫者。 每个CFG包含高概率呼叫者的指示和每个CFG中的概率权重相应地被改变。 当接收者接收到呼叫时,将相关的CFG与语音识别应用相关联地应用,以至少使呼叫者的初步识别。 根据另一个实施例,呼叫者确认标识。 根据一个实施例,如果可能的话至少使用呼叫者识别过程来使用标准呼叫者ID功能。 根据另一个实施例,利用语音识别增强呼叫者识别来提供智能呼叫路由功能。

    Speech recognition application or server using iterative recognition constraints
    9.
    发明授权
    Speech recognition application or server using iterative recognition constraints 有权
    语音识别应用或服务器采用迭代识别约束

    公开(公告)号:US07809567B2

    公开(公告)日:2010-10-05

    申请号:US10897817

    申请日:2004-07-23

    IPC分类号: G10L15/18

    摘要: A speech recognition application including a recognition module configured to receive input utterances and an application module configured to select a recognition from the speech recognition module using output from a first iteration to select a recognition result for a second iteration. In one embodiment, the application module eliminates a previous rejected recognition result or results from the N-Best list for recognition. In another embodiment, the application module rescores N-Best entries based upon N-Best lists or information from another iteration. In another illustrated embodiment, the application module uses a limited grammar from a current N-Best list for subsequent recognition, for example for rerecognition using a recorded input from a previous iteration.

    摘要翻译: 一种语音识别应用,包括被配置为接收输入话音的识别模块和应用模块,该应用模块被配置为使用来自第一迭代的输出从语音识别模块中选择识别,以选择用于第二次迭代的识别结果。 在一个实施例中,应用模块消除了先前拒绝的识别结果或来自N-Best列表以用于识别的结果。 在另一个实施例中,应用模块基于N-Best列表或来自另一次迭代的信息来分配N-Best条目。 在另一示出的实施例中,应用模块使用来自当前N-Best列表的有限语法进行后续识别,例如使用来自先前迭代的记录输入进行重新识别。

    Method and apparatus to improve name confirmation in voice-dialing systems
    10.
    发明授权
    Method and apparatus to improve name confirmation in voice-dialing systems 有权
    提高语音拨号系统名称确认的方法和装置

    公开(公告)号:US07475017B2

    公开(公告)日:2009-01-06

    申请号:US10900051

    申请日:2004-07-27

    IPC分类号: G10L15/00 H04M1/64

    CPC分类号: G10L15/22 H04M3/42204

    摘要: A method of providing voice dialing assistance includes providing a first input to a speech recognition engine, with the first input corresponding to a speech sample provided by a caller attempting to reach an intended call recipient. A speech recognition output is generated in response to the first input. A potential call recipient is identified based upon the speech recognition output. A confirmation that the potential call recipient is the intended call recipient is implemented using a personal recording made by the potential call recipient.

    摘要翻译: 提供语音拨号辅助的方法包括向语音识别引擎提供第一输入,其中第一输入对应于由呼叫者提供的语音样本,该呼叫者尝试到达预期的呼叫接收者。 响应于第一输入产生语音识别输出。 基于语音识别输出识别潜在的呼叫接收者。 使用由潜在呼叫接收者进行的个人录音来实现潜在呼叫接收者是预期呼叫接收者的确认。