Interactive computer system recognizing spoken commands
    1.
    发明授权
    Interactive computer system recognizing spoken commands 失效
    识别口语命令的交互式计算机系统

    公开(公告)号:US5664061A

    公开(公告)日:1997-09-02

    申请号:US462735

    申请日:1995-06-05

    摘要: An interactive computer system having a processor executing a target computer program, and having a speech recognizer for converting an utterance into a command signal for the target computer program. The target computer program has a series of active program states occurring over a series of time periods. At least a first active-state image is displayed for a first active state occurring during a first time period. At least one object displayed in the first active-state image is identified, and a list of one or more first active-state commands identifying functions which can be performed in the first active state of the target computer program is generated from the identified object. A first active-state vocabulary of acoustic command models for the first active state comprises the acoustic command models from a system vocabulary representing the first active-state commands. A speech recognizer measures the value of at least one feature of an utterance during each of a series of successive time intervals within the first time period to produce a series of feature signals. The measured feature signals are compared to each of the acoustic command models in the first active-state vocabulary to generate a match score for the utterance and each acoustic command model. The speech recognizer outputs a command signal corresponding to the command model from the first active-state vocabulary having the best match score.

    摘要翻译: 一种交互式计算机系统,具有执行目标计算机程序的处理器,并具有用于将话语转换成目标计算机程序的命令信号的语音识别器。 目标计算机程序具有一系列在一系列时间段内发生的活动程序状态。 至少第一活动状态图像被显示为在第一时间段期间发生的第一活动状态。 识别在第一活动状态图像中显示的至少一个对象,并且从识别的对象生成可以在目标计算机程序的第一活动状态中执行的识别功能的一个或多个第一主动状态命令的列表。 用于第一活动状态的声学命令模型的第一主动状态词汇包括来自表示第一活动状态命令的系统词汇表的声学命令模型。 语音识别器在第一时间段内的一系列连续时间间隔的每一个期间测量话音中的至少一个特征的值,以产生一系列特征信号。 将测量的特征信号与第一活动状态词汇表中的每个声学命令模型进行比较,以产生用于发音和每个声学命令模型的匹配分数。 语音识别器从具有最佳匹配分数的第一活动状态词汇表输出与命令模型对应的命令信号。

    MVC (Model-View-Controller) based multi-modal authoring tool and development environment
    2.
    发明申请
    MVC (Model-View-Controller) based multi-modal authoring tool and development environment 有权
    基于MVC(Model-View-Controller)的多模式创作工具和开发环境

    公开(公告)号:US20050273759A1

    公开(公告)日:2005-12-08

    申请号:US11190572

    申请日:2005-07-27

    IPC分类号: G06F9/44 G06F7/00

    CPC分类号: G06F8/38

    摘要: Application development tools and method for building multi-channel, multi-device and multi-modal applications, and in particular, to systems and methods for developing applications whereby a user can interact in parallel with the same information via a multiplicity of channels and user interfaces, while a unified, synchronized views of the information are presented across the various channels or devices deployed by the user to interact with the information. In a preferred embodiment, application frameworks and development tools are preferably based on a MVC (Model-View-Controller) design paradigm that is adapted to provide synchronized multi-modal interactions. Multi-channel authoring can be developed using a similar methodology.

    摘要翻译: 用于构建多通道,多设备和多模式应用的应用程序开发工具和方法,特别是用于开发应用程序的系统和方法,由此用户可以通过多个通道和用户界面与相同的信息并行交互 而信息的统一的,同步的视图则呈现在用户部署的各种渠道或设备上,以与信息进行交互。 在优选实施例中,应用框架和开发工具优选地基于适于提供同步多模态交互的MVC(模型 - 视图 - 控制器)设计范例。 可以使用类似的方法开发多渠道创作。