Scalable low resource dialog manager
    1.
    发明授权
    Scalable low resource dialog manager 有权
    可扩展的低资源对话管理器

    公开(公告)号:US06513009B1

    公开(公告)日:2003-01-28

    申请号:US09460961

    申请日:1999-12-14

    IPC分类号: G10L2100

    CPC分类号: G10L15/22

    摘要: A spoken language interface between a user and at least one application or system includes a dialog manager operatively coupled to the application or system, an audio input system, an audio output system, a speech decoding engine and a speech synthesizing engine; and at least one user interface data set operatively coupled to the dialog manager, the user interface data set representing spoken language interface elements and data recognizable by the application. The dialog manager enables connection between the input audio system and the speech decoding engine such that a spoken utterance provided by the user is provided from the input audio system to the speech decoding engine. The speech decoding engine decodes the spoken utterance to generate a decoded output which is returned to the dialog manager. The dialog manager uses the decoded output to search the user interface data set for a corresponding spoken language interface element and data which is returned to the dialog manager when found, and provides the spoken language interface element associated data to the application for processing in accordance therewith. The application, on processing that element, provides a reference to an interface element to be spoken. The dialog manager enables connection between the audio output system and the speech synthesizing engine such that the speech synthesizing engine which, accepting data from that element, generates a synthesized output that expresses that element, the audio output system audibly presenting the synthesized output to the user.

    摘要翻译: 用户与至少一个应用或系统之间的口语界面包括可操作地耦合到应用或系统的对话管理器,音频输入系统,音频输出系统,语音解码引擎和语音合成引擎; 以及可操作地耦合到对话管理器的至少一个用户界面数据集,表示语言界面元素的用户界面数据集和应用可识别的数据。 对话管理器使得输入音频系统和语音解码引擎之间能够连接,使得由用户提供的讲话话语从输入音频系统提供给语音解码引擎。 语音解码引擎对口语发音进行解码以产生被返回给对话管理器的解码输出。 对话管理器使用解码的输出来搜索用于相应的语言接口元素的用户界面数据集和在发现对话管理器时返回到对话管理器的数据,并且将口语语言接口元素相关联的数据提供给应用以进行处理 。 在处理该元素时,该应用程序提供了要使用的接口元素的引用。 该对话管理器使音频输出系统与语音合成引擎之间能够连接,从而使从该元件接收数据的语音合成引擎生成表示该元素的合成输出,音频输出系统向用户可听地呈现合成输出 。

    Methods and apparatus for contingent transfer and execution of spoken language interfaces
    2.
    发明授权
    Methods and apparatus for contingent transfer and execution of spoken language interfaces 失效
    口语接口的或有转移和执行的方法和装置

    公开(公告)号:US07024363B1

    公开(公告)日:2006-04-04

    申请号:US09460913

    申请日:1999-12-14

    IPC分类号: G10L21/00

    CPC分类号: G10L15/22

    摘要: A method for managing spoken language interface data structures and collections of user interface service engines in a spoken language dialog manager in a personal speech assistant. Interfaces, designed as part of applications, may by these methods be added to or removed from the set of such interfaces used by a dialog manager. Interface service engines, required by new applications, but not already present in the dialog manager, may be made available to the new and subsequently added applications.

    摘要翻译: 一种用于在个人语音助理中的口语对话管理器中管理口语接口数据结构和用户界面服务引擎的集合的方法。 作为应用程序的一部分设计的接口可以通过这些方法添加到或从对话管理器使用的一组这样的接口中删除。 新应用程序所需的但不在对话管理器中的接口服务引擎可以被提供给新的和随后添加的应用程序。

    Personal speech assistant supporting a dialog manager
    3.
    发明授权
    Personal speech assistant supporting a dialog manager 失效
    支持对话管理员的个人演讲助理

    公开(公告)号:US06748361B1

    公开(公告)日:2004-06-08

    申请号:US09460077

    申请日:1999-12-14

    IPC分类号: G10L1522

    CPC分类号: G10L15/28

    摘要: A Personal Speech Assistant (PSA) is a computing apparatus which provides a spoken language interface to another apparatus to which it is attached by supporting execution of a conversational dialog manager and its supporting service engines. In operation, a PSA is connected to a device which provides some service to a user. Any “appliance” is a candidate for enhancement with the PSA. Devices such as, for example, video cassette recorders (VCRs) or Personal Digital Assistants (PDAs), which offer rich, but frequently difficult interfaces, may be made more useful by the integration of a PSA according to the invention. It is a preferred feature of a dialog manager used by the PSA that the user interface properties, in terms of the vocabulary the device understands, the informative prompts it provides, and other aspects of its conversational behavior, are all easily modified to correspond to the preferences or limitations of the user.

    摘要翻译: 个人语音助理(PSA)是一种通过支持会话对话管理器及其支持服务引擎的执行来向附加的另一设备提供口语语言接口的计算设备。 在操作中,PSA连接到向用户提供一些服务的设备。 任何“家电”都是使用PSA进行增强的候选人。 通过集成根据本发明的PSA,可提供诸如例如提供丰富而且经常困难的界面的诸如录像机(VCR)或个人数字助理(PDA)的设备更有用。 PSA使用的对话管理器的优选特征是,用户接口属性(根据设备理解的词汇表),其提供的信息提示以及其会话行为的其他方面都是容易地被修改为对应于 用户的偏好或限制。

    Methods and Apparatus for Buffering Data for Use in Accordance with a Speech Recognition System
    4.
    发明申请
    Methods and Apparatus for Buffering Data for Use in Accordance with a Speech Recognition System 有权
    用于缓冲数据的方法和装置,用于语音识别系统

    公开(公告)号:US20080172228A1

    公开(公告)日:2008-07-17

    申请号:US12056001

    申请日:2008-03-26

    IPC分类号: G10L15/06

    CPC分类号: G10L15/28

    摘要: Techniques are disclosed for overcoming errors in speech recognition systems. For example, a technique for processing acoustic data in accordance with a speech recognition system comprises the following steps/operations. Acoustic data is obtained in association with the speech recognition system. The acoustic data is recorded using a combination of a first buffer area and a second buffer area, such that the recording of the acoustic data using the combination of the two buffer areas at least substantially minimizes one or more truncation errors associated with operation of the speech recognition system.

    摘要翻译: 公开了用于克服语音识别系统中的错误的技术。 例如,根据语音识别系统处理声学数据的技术包括以下步骤/操作。 与语音识别系统相关联地获得声学数据。 使用第一缓冲区域和第二缓冲区域的组合记录声学数据,使得使用两个缓冲区域的组合的声学数据的记录至少基本上最小化与语音操作相关联的一个或多个截断误差 识别系统。

    Methods and apparatus for buffering data for use in accordance with a speech recognition system
    5.
    发明授权
    Methods and apparatus for buffering data for use in accordance with a speech recognition system 有权
    用于根据语音识别系统缓冲数据的方法和装置

    公开(公告)号:US08781832B2

    公开(公告)日:2014-07-15

    申请号:US12056001

    申请日:2008-03-26

    IPC分类号: G10L15/04

    CPC分类号: G10L15/28

    摘要: Techniques are disclosed for overcoming errors in speech recognition systems. For example, a technique for processing acoustic data in accordance with a speech recognition system comprises the following steps/operations. Acoustic data is obtained in association with the speech recognition system. The acoustic data is recorded using a combination of a first buffer area and a second buffer area, such that the recording of the acoustic data using the combination of the two buffer areas at least substantially minimizes one or more truncation errors associated with operation of the speech recognition system.

    摘要翻译: 公开了用于克服语音识别系统中的错误的技术。 例如,根据语音识别系统处理声学数据的技术包括以下步骤/操作。 与语音识别系统相关联地获得声学数据。 使用第一缓冲区域和第二缓冲区域的组合记录声学数据,使得使用两个缓冲区域的组合的声学数据的记录至少基本上最小化与语音操作相关联的一个或多个截断误差 识别系统。

    Methods and apparatus for buffering data for use in accordance with a speech recognition system
    6.
    发明授权
    Methods and apparatus for buffering data for use in accordance with a speech recognition system 有权
    用于根据语音识别系统缓冲数据的方法和装置

    公开(公告)号:US07962340B2

    公开(公告)日:2011-06-14

    申请号:US11209004

    申请日:2005-08-22

    IPC分类号: G10L15/04 G10L17/00 G10L11/06

    CPC分类号: G10L15/28

    摘要: Techniques are disclosed for overcoming errors in speech recognition systems. For example, a technique for processing acoustic data in accordance with a speech recognition system comprises the following steps/operations. Acoustic data is obtained in association with the speech recognition system. The acoustic data is recorded using a combination of a first buffer area and a second buffer area, such that the recording of the acoustic data using the combination of the two buffer areas at least substantially minimizes one or more truncation errors associated with operation of the speech recognition system.

    摘要翻译: 公开了用于克服语音识别系统中的错误的技术。 例如,根据语音识别系统处理声学数据的技术包括以下步骤/操作。 与语音识别系统相关联地获得声学数据。 使用第一缓冲区域和第二缓冲区域的组合记录声学数据,使得使用两个缓冲区域的组合的声学数据的记录至少基本上最小化与语音操作相关联的一个或多个截断误差 识别系统。

    System and method for scaling video
    7.
    发明授权
    System and method for scaling video 失效
    用于缩放视频的系统和方法

    公开(公告)号:US5790714A

    公开(公告)日:1998-08-04

    申请号:US332961

    申请日:1994-11-01

    IPC分类号: H04N1/393 G06T3/40 G06K9/32

    CPC分类号: G06T3/4007

    摘要: Scaling of video is performed using area weighted averaging of input pixels to calculate coefficients to multiply with luminescence and crominence of input pixels. Such coefficients are produced for both the vertical and horizontal scaling directions of the input video stream. When scaling down or scaling up, scaling is first performed in the vertical direction to produce partially scaled pixels, which are then utilized for scaling in the horizontal direction. When scaling up, a pre-interpolation or pre-replication process is utilized to double the inputted pixel grid which doubled pixel grid is then utilized to scale down to the desired pixel grid size, which is greater than the originally inputted pixel grid size.

    摘要翻译: 使用输入像素的区域加权平均来执行视频的缩放,以计算与输入像素的发光和色度相乘的系数。 对于输入视频流的垂直和水平缩放方向都产生这样的系数。 当缩小或缩小时,首先在垂直方向上执行缩放以产生部分缩放的像素,然后将其用于水平方向上的缩放。 当放大时,使用预插入或预复制过程将输入的像素网格加倍,然后将双倍像素网格用于缩小到大于原始输入的像素网格尺寸的期望像素网格大小。