System for voice verification of telephone transactions
    1.
    发明授权
    System for voice verification of telephone transactions 失效
    电话交易语音验证系统

    公开(公告)号:US5937381A

    公开(公告)日:1999-08-10

    申请号:US632723

    申请日:1996-04-10

    IPC分类号: G10L17/00 G10L5/06

    CPC分类号: G10L17/24

    摘要: A system and a method is disclosed for verifying a voice of a user conducting a telephone transaction. The system and method includes a mechanism for prompting the user to speak in a limited vocabulary. A feature extractor converts the limited vocabulary into a plurality of speech frames. A pre-processor is coupled to the feature extractor for processing the plurality of speech frames to produce a plurality of processed frames. The processing includes frame selection, which eliminates each of the plurality of speech frames having an absence of words. A Viterbi decoder is also coupled to said feature extractor for assigning a frame label to each of the plurality of speech frames to produce a plurality of frame labels. The processed frames and frame labels are then combined to produce a voice model, which includes each of the plurality of frame labels that correspond to the number of plurality of processed frames. A mechanism is also provided for comparing the voice model with the claimant's voice model, derived during a previous enrollment session. The voice model also is compared with an alternate voice model set, derived during previous enrollment sessions. The identity claimed is accepted if the voice model matches the claimant's voice model better than the alternative voice model set.

    摘要翻译: 公开了一种用于验证进行电话交易的用户的语音的系统和方法。 该系统和方法包括用于提示用户以有限的词汇表达的机制。 特征提取器将有限词汇转换成多个语音帧。 预处理器耦合到特征提取器,用于处理多个语音帧以产生多个经处理的帧。 该处理包括帧选择,其消除了不存在字的多个语音帧中的每一个。 维特比解码器还耦合到所述特征提取器,用于将帧标签分配给多个语音帧中的每一个以产生多个帧标签。 然后,处理的帧和帧标签被组合以产生语音模型,其包括与多个处理帧的数量相对应的多个帧标签中的每一个。 还提供了一种机制,用于将语音模型与在先前注册会话期间派生的索赔人的语音模型进行比较。 语音模型也与以前的注册会话中派生的替代语音模型集进行比较。 如果语音模型比替代语音模型集更好地与索赔人的语音模型匹配,则所接受的身份被接受。

    Method for speech processing involving whole-utterance modeling
    2.
    发明授权
    Method for speech processing involving whole-utterance modeling 失效
    涉及全话语建模的语音处理方法

    公开(公告)号:US06961703B1

    公开(公告)日:2005-11-01

    申请号:US09660635

    申请日:2000-09-13

    IPC分类号: G10L15/06 G10L17/00 G10L19/02

    摘要: A speech verification process involves comparison of enrollment and test speech data and an improved method of comparing the data is disclosed, wherein segmented frames of speech are analyzed jointly, rather than independently. The enrollment and test speech are both subjected to a feature extraction process to derive fixed-length feature vectors, and the feature vectors are compared, using a linear discriminant analysis and having no dependence upon the order of the words spoken or the speaking rate. The discriminant analysis is made possible, despite a relatively high dimensionality of the feature vectors, by a mathematical procedure provided for finding an eigenvector to simultaneously diagonalize the between-speaker and between-channel covariances of the enrollment and test data.

    摘要翻译: 语音验证过程包括比较注册和测试语音数据,并且公开了一种比较数据的改进方法,其中分割的语音帧被共同分析而不是独立地分析。 注册和测试语音都进行特征提取处理以得出固定长度特征向量,并且使用线性判别分析来比较特征向量,并且不依赖于口语的顺序或说话率。 尽管特征向量的维度相对较高,但通过提供用于寻找特征向量以同时使注册和测试数据之间的扬声器之间和频道间协方差同时对角化的数学过程,判别分析成为可能。

    Noise suppression and channel equalization preprocessor for speech and speaker recognizers: method and apparatus
    3.
    发明授权
    Noise suppression and channel equalization preprocessor for speech and speaker recognizers: method and apparatus 失效
    用于语音和扬声器识别器的噪声抑制和信道均衡预处理器:方法和装置

    公开(公告)号:US06266633B1

    公开(公告)日:2001-07-24

    申请号:US09218565

    申请日:1998-12-22

    IPC分类号: G10L2102

    CPC分类号: G10L21/0208

    摘要: A method for performing noise suppression and channel equalization of a noisy voice signal comprising the steps of sampling the noisy voice signal at a predetermined sampling rate fs; segmenting the sampled voice signal into a plurality of frames having a predetermined number of samples per frame, over a predetermined temporal window; generating an N-point spectral sample representation of each of the sample signal frames; determining the magnitude of each of the N-point spectral samples and generating a histogram of the energy associated with each of the N-point spectral samples at a particular frequency; detecting a peak amplitude of the histogram which corresponds to a noise threshold Nf associated with the particular frequency; determining a channel frequency response Cf associated with the particular frequency by determining a geometric mean over all the spectral samples having magnitude exceeding the noise threshold Nf; subtracting from each of the magnitudes of the N point spectral samples the noise threshold Nf to provide a noise suppressed sample sequence; applying blind deconvolution to the noise suppressed samples; transforming the deconvolved noise suppressed sampled sequence to a temporal representation; shifting the temporal sample sequence in time by a predetermined amount; and adding the time shifted temporal samples over a period corresponding to the predetermined temporal window to provide a suppressed noise voice signal.

    摘要翻译: 一种用于对噪声声音信号执行噪声抑制和信道均衡的方法,包括以预定采样率fs对噪声语音信号进行采样的步骤; 在预定的时间窗口上将采样的语音信号分割成具有每帧预定数量的采样的多个帧; 产生每个采样信号帧的N点频谱采样表示; 确定每个N点频谱样本的大小,并且产生与特定频率处的每个N点频谱样本相关联的能量的直方图; 检测对应于与特定频率相关联的噪声阈值Nf的直方图的峰值幅度; 通过确定具有超过噪声阈值Nf的幅度的所有频谱样本的几何平均来确定与特定频率相关联的信道频率响应Cf; 从N点频谱样本的每个幅度中减去噪声阈值Nf以提供噪声抑制采样序列; 对噪声抑制样本应用盲解卷积; 将解卷积噪声抑制采样序列变换为时间表示; 将时间采样序列在时间上移动预定量; 以及在与预定时间窗口相对应的时间段内添加时移采样,以提供抑制噪声语音信号。