Speech detection with noise suppression based on principal components analysis
    1.
    发明授权
    Speech detection with noise suppression based on principal components analysis 失效
    基于主成分分析的噪声抑制语音检测

    公开(公告)号:US06230122B1

    公开(公告)日:2001-05-08

    申请号:US09176178

    申请日:1998-10-21

    IPC分类号: G10L2102

    CPC分类号: G10L21/0208 G10L21/0232

    摘要: A method for effectively suppressing background noise in a speech detection system comprises a filter bank for separating source speech data into discrete frequency sub-bands to generate filtered channel energy, and a noise suppressor for weighting the frequency sub-bands to improve the signal-to-noise ratio of the resultant noise-suppressed channel energy. The noise suppressor preferably includes a subspace module for using a Karhunen-Loeve transformation to create a subspace based on the background noise, a projection module for generating projected channel energy by projecting the filtered channel energy onto the created subspace, and a weighting module for applying calculated weighting values to the projected channel energy to generate the noise-suppressed channel energy.

    摘要翻译: 一种用于有效地抑制语音检测系统中的背景噪声的方法包括用于将源语音数据分离成离散频率子带以产生经滤波的信道能量的滤波器组,以及用于对频率子带进行加权以改善信号到 噪声抑制通道能量的噪声比。 噪声抑制器优选地包括用于使用Karhunen-Loeve变换来创建基于背景噪声的子空间的子空间模块,用于通过将滤波的信道能量投影到所创建的子空间上来产生投影通道能量的投影模块,以及用于应用的加权模块 计算加权值到投影通道能量以产生噪声抑制的通道能量。

    System and method for speech recognition using an enhanced phone set
    2.
    发明授权
    System and method for speech recognition using an enhanced phone set 失效
    使用增强型电话机进行语音识别的系统和方法

    公开(公告)号:US07139708B1

    公开(公告)日:2006-11-21

    申请号:US09369031

    申请日:1999-08-04

    IPC分类号: G10L15/06

    摘要: A system and method for speech recognition using an enhanced phone set comprises speech data, an enhanced phone set, and a transcription generated by a transcription process. The transcription process selects appropriate phones from the enhanced phone set to represent acoustic-phonetic content of the speech data. The enhanced phone set includes base-phones and composite-phones. A phone dataset includes the speech data and the transcription. The present invention also comprises a transformer that applies transformation rules to the phone dataset to produce a transformed phone dataset. The transformed phone dataset may be utilized in training a speech recognizer, such as a Hidden Markov Model. Various types of transformation rules may be applied to the phone dataset of the present invention to find an optimum transformed phone dataset for training a particular speech recognizer.

    摘要翻译: 用于使用增强电话机的语音识别的系统和方法包括语音数据,增强电话机和由转录过程产生的转录。 转录过程从增强型电话机中选择合适的电话来表示语音数据的声音语音内容。 增强型手机包括基本电话和复合电话。 电话数据集包括语音数据和转录。 本发明还包括对电话数据集应用变换规则以产生变换的电话数据集的变压器。 变换的电话数据集可以用于训练语音识别器,例如隐马尔可夫模型。 可以将各种类型的变换规则应用于本发明的电话数据集,以找到用于训练特定语音识别器的最佳变换电话数据集。