Noise cancellation system and method
    3.
    发明授权
    Noise cancellation system and method 有权
    噪声消除系统和方法

    公开(公告)号:US08296135B2

    公开(公告)日:2012-10-23

    申请号:US12270218

    申请日:2008-11-13

    IPC分类号: G10L21/02

    CPC分类号: G10L15/20 G10L21/0208

    摘要: A noise cancellation apparatus includes a noise estimation module for receiving a noise-containing input speech, and estimating a noise therefrom to output the estimated noise; a first Wiener filter module for receiving the input speech, and applying a first Wiener filter thereto to output a first estimation of clean speech; a database for storing data of a Gaussian mixture model for modeling clean speech; and an MMSE estimation module for receiving the first estimation of clean speech and the data of the Gaussian mixture model to output a second estimation of clean speech. The apparatus further includes a final clean speech estimation module for receiving the second estimation of clean speech from the MMSE estimation module and the estimated noise from the noise estimation module, and obtaining a final Wiener filter gain therefrom to output a final estimation of clean speech by applying the final Wiener filter gain.

    摘要翻译: 噪声消除装置包括:噪声估计模块,用于接收含噪声的输入语音,并估计噪声以输出所估计的噪声; 用于接收所述输入语音的第一维纳滤波器模块,以及对其施加第一维纳滤波器以输出清洁语音的第一估计; 用于存储用于建模清洁语音的高斯混合模型的数据的数据库; 以及MMSE估计模块,用于接收清洁语音的第一估计和高斯混合模型的数据,以输出干净语音的第二估计。 该装置还包括最后的清洁语音估计模块,用于从MMSE估计模块接收清洁语音的第二估计和来自噪声估计模块的估计噪声,并从中获得最终的维纳滤波器增益,以便通过以下方式输出清洁语音的最终估计: 应用最后的维纳滤波器增益。

    Human speech recognition apparatus and method
    4.
    发明授权
    Human speech recognition apparatus and method 有权
    人类语音识别装置及方法

    公开(公告)号:US08185393B2

    公开(公告)日:2012-05-22

    申请号:US12334032

    申请日:2008-12-12

    IPC分类号: G10L15/10

    CPC分类号: G10L15/187 G10L2015/025

    摘要: A speech recognition apparatus generates a feature vector series corresponding to a speech signal, and recognizes a phoneme series corresponding to the feature vector series using sounds corresponding to phonemes and a phoneme language model. In addition, the speech recognition apparatus recognizes vocabulary that corresponds to the recognized phoneme series. At this time, the phoneme language model represents connection relationships between the phonemes, and is modeled according to time-variant characteristics of the phonemes.

    摘要翻译: 语音识别装置生成对应于语音信号的特征向量序列,并且使用与音素对应的声音和音素语言模型来识别对应于特征矢量序列的音素序列。 此外,语音识别装置识别与识别的音素系列对应的词汇表。 此时,音素语言模型表示音素之间的连接关系,并根据音素的时变特征进行建模。

    Viterbi decoder and speech recognition method using same using non-linear filter for observation probabilities
    6.
    发明授权
    Viterbi decoder and speech recognition method using same using non-linear filter for observation probabilities 有权
    维特比解码器和语音识别方法使用非线性滤波器进行观察概率

    公开(公告)号:US08332222B2

    公开(公告)日:2012-12-11

    申请号:US12506719

    申请日:2009-07-21

    IPC分类号: G10L15/00

    CPC分类号: G10L15/08 G10L15/142

    摘要: A Viterbi decoder includes: an observation vector sequence generator for generating an observation vector sequence by converting an input speech to a sequence of observation vectors; a local optimal state calculator for obtaining a partial state sequence having a maximum similarity up to a current observation vector as an optimal state; an observation probability calculator for obtaining, as a current observation probability, a probability for observing the current observation vector in the optimal state; a buffer for storing therein a specific number of previous observation probabilities; a non-linear filter for calculating a filtered probability by using the previous observation probabilities stored in the buffer and the current observation probability; and a maximum likelihood calculator for calculating a partial maximum likelihood by using the filtered probability. The filtered probability may be a maximum value, a mean value or a median value of the previous observation probabilities and the current observation probability.

    摘要翻译: 维特比解码器包括:观测向量序列生成器,用于通过将输入语音转换为观察向量序列来生成观察向量序列; 局部最优状态计算器,用于获得具有与当前观察向量最大相似度的部分状态序列作为最佳状态; 观测概率计算器,用于获得在最佳状态下观察当前观测矢量的概率作为当前观测概率; 用于在其中存储特定数量的先前观察概率的缓冲器; 用于通过使用存储在缓冲器中的先前观察概率和当前观察概率来计算滤波概率的非线性滤波器; 以及最大似然计算器,用于通过使用滤波的概率来计算部分最大似然。 滤波概率可以是先前观测概率和当前观测概率的最大值,平均值或中值。

    Speech recognition system for mobile terminal
    8.
    发明授权
    Speech recognition system for mobile terminal 有权
    移动终端语音识别系统

    公开(公告)号:US07856356B2

    公开(公告)日:2010-12-21

    申请号:US11642132

    申请日:2006-12-20

    IPC分类号: G10L15/14

    CPC分类号: G10L15/142 G10L2015/025

    摘要: A speech recognition system for a mobile terminal includes an acoustic variation channel unit and a pronunciation channel unit. The acoustic variation channel unit transforms a speech signal into feature parameters and Viterbi-decodes the speech signal to produce a varied phoneme sequence by using the feature parameters and predetermined models. Further, the pronunciation variation channel unit Viterbi-decodes the varied phoneme sequence to produce a word phoneme sequence by using the varied phoneme sequence and a preset DHMM (Discrete Hidden Markov Model) based context-dependent error model.

    摘要翻译: 用于移动终端的语音识别系统包括声学变化信道单元和语音通道单元。 声学变化信道单元将语音信号转换为特征参数,并通过使用特征参数和预定模型对语音信号进行维特比解码以产生变化的音素序列。 此外,发音变化信道单元通过使用改变的音素序列和基于上下文的预设的DHMM(离散隐马尔可夫模型)基于上下文的误差模型,对变化的音素序列进行维特比解码以产生单词音素序列。

    SPEECH RECOGNITION SYSTEM AND METHOD
    10.
    发明申请
    SPEECH RECOGNITION SYSTEM AND METHOD 有权
    语音识别系统和方法

    公开(公告)号:US20100161326A1

    公开(公告)日:2010-06-24

    申请号:US12506705

    申请日:2009-07-21

    IPC分类号: G10L15/20

    摘要: A speech recognition system includes: a speed level classifier for measuring a moving speed of a moving object by using a noise signal at an initial time of speech recognition to determine a speed level of the moving object; a first speech enhancement unit for enhancing sound quality of an input speech signal of the speech recognition by using a Wiener filter, if the speed level of the moving object is equal to or lower than a specific level; and a second speech enhancement unit enhancing the sound quality of the input speech signal by using a Gaussian mixture model, if the speed level of the moving object is higher than the specific level. The system further includes an end point detection unit for detecting start and end points, an elimination unit for eliminating sudden noise components based on a sudden noise Gaussian mixture model.

    摘要翻译: 语音识别系统包括:速度级分类器,用于通过在语音识别的初始时间使用噪声信号来测量移动物体的移动速度,以确定移动物体的速度水平; 第一语音增强单元,如果移动对象的速度水平等于或低于特定水平,则通过使用维纳滤波器来增强语音识别的输入语音信号的声音质量; 以及第二语音增强单元,如果移动对象的速度水平高于特定水平,则通过使用高斯混合模型来增强输入语音信号的声音质量。 该系统还包括用于检测起点和终点的终点检测单元,用于基于突发噪声高斯混合模型消除突发噪声分量的消除单元。