Method and apparatus for speech recognition using second order statistics and linear estimation of cepstral coefficients
    1.
    发明授权
    Method and apparatus for speech recognition using second order statistics and linear estimation of cepstral coefficients 失效
    使用二阶统计学和倒谱系数的线性估计的语音识别方法和装置

    公开(公告)号:US06202047B1

    公开(公告)日:2001-03-13

    申请号:US09050301

    申请日:1998-03-30

    IPC分类号: G10L1514

    摘要: A method and apparatus for speech recognition using second order statistics and linear estimation of cepstral coefficients. In one embodiment, a speech input signal is received and cepstral features are extracted. An answer is generated using the extracted cepstral features and a fixed signal independent diagonal matrix as the covariance matrix for the cepstral components of the speech input signal and, for example, a hidden Markov model. In another embodiment, a noisy speech input signal is received and a cepstral vector representing a clean speech input signal is generated based on the noisy speech input signal and an explicit linear minimum mean square error cepstral estimator.

    摘要翻译: 一种使用二阶统计学和倒谱系数线性​​估计的语音识别的方法和装置。 在一个实施例中,接收语音输入信号并提取倒谱特征。 使用提取的倒谱特征和固定信号独立对角矩阵作为用于语音输入信号的倒谱分量的协方差矩阵和例如隐马尔可夫模型来生成答案。 在另一个实施例中,接收噪声语音输入信号,并且基于噪声语音输入信号和显式线性最小均方误差倒谱估计器产生表示干净语音输入信号的倒谱矢量。

    Recognizing the numeric language in natural spoken dialogue
    4.
    发明授权
    Recognizing the numeric language in natural spoken dialogue 有权
    认识到自然语言对话中的数字语言

    公开(公告)号:US08655658B2

    公开(公告)日:2014-02-18

    申请号:US13280884

    申请日:2011-10-25

    IPC分类号: G10L15/14 G10L15/18

    CPC分类号: G10L15/142

    摘要: A system and a method are provided. A speech recognition processor receives unconstrained input speech and outputs a string of words. The speech recognition processor is based on a numeric language that represents a subset of a vocabulary. The subset includes a set of words identified as being for interpreting and understanding number strings. A numeric understanding processor contains classes of rules for converting the string of words into a sequence of digits. The speech recognition processor utilizes an acoustic model database. A validation database stores a set of valid sequences of digits. A string validation processor outputs validity information based on a comparison of a sequence of digits output by the numeric understanding processor with valid sequences of digits in the validation database.

    摘要翻译: 提供了一种系统和方法。 语音识别处理器接收无约束输入语音并输出一串字。 语音识别处理器基于代表词汇子集的数字语言。 该子集包括被识别为用于解释和理解数字串的一组单词。 数字理解处理器包含用于将字符串转换为数字序列的规则类型。 语音识别处理器利用声学模型数据库。 验证数据库存储一组有效的数字序列。 字符串验证处理器基于数字理解处理器输出的数字序列与验证数据库中的有效数字序列的比较来输出有效性信息。

    Method of generation a labeling guide for spoken dialog services
    9.
    发明授权
    Method of generation a labeling guide for spoken dialog services 有权
    生成口语对话服务标签指南的方法

    公开(公告)号:US07729902B1

    公开(公告)日:2010-06-01

    申请号:US11927738

    申请日:2007-10-30

    IPC分类号: G06F17/27 G06F17/21

    摘要: A method is disclosed for designing a labeling guide for use by a labeler in labeling data used for training a spoken language understanding (SLU) module for an application. The method comprises a labeling guide designer selecting domain-independent actions applicable to an application, selecting domain-dependent objects according to characteristics of the application, and generating a labeling guide using the selected domain-independent actions and selected domain-dependent objects. An advantage of the labeling guide generated in this manner is that the labeling guide designer can easily port the labeling guide to a new application by selecting a set of domain-independent action and then selecting the domain-dependent objects related to the new application.

    摘要翻译: 公开了一种用于设计标签指南的方法,用于标签机用于标记用于训练用于应用的口语理解(SLU)模块的数据。 该方法包括标签指导者设计者,其选择适用于应用的独立于领域的动作,根据应用的特征来选择依赖于域的对象,以及使用所选择的与域无关的动作和选择的域相关对象来生成标签指南。 以这种方式生成的标签指南的优点是,标签指南设计者可以通过选择一组独立于领域的动作,然后选择与新应用相关的域相关对象,轻松地将标签指南移植到新应用。

    SYSTEM FOR HANDLING FREQUENTLY ASKED QUESTIONS IN A NATURAL LANGUAGE DIALOG SERVICE
    10.
    发明申请
    SYSTEM FOR HANDLING FREQUENTLY ASKED QUESTIONS IN A NATURAL LANGUAGE DIALOG SERVICE 审中-公开
    在自然语言对话服务中处理常见问题的系统

    公开(公告)号:US20090070113A1

    公开(公告)日:2009-03-12

    申请号:US12266835

    申请日:2008-11-07

    IPC分类号: G10L15/18

    CPC分类号: G10L15/22 G06F3/167

    摘要: A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.

    摘要翻译: 公开了支持语音的帮助台服务。 该服务包括用于识别来自用户的语音的自动语音识别模块,用于理解来自自动语音识别模块的输出的口语语言理解模块,用于生成来自用户对语音的响应的对话管理模块,自然语音文本 - 语音合成模块,用于合成语音以产生对用户的响应,以及常见问题模块。 常见问题模块通过改变语音来处理用户的常见问题,并提供预定的提示来回答常见问题。