Speech processing using conditional observable maximum likelihood continuity mapping
    2.
    发明授权
    Speech processing using conditional observable maximum likelihood continuity mapping 失效
    使用条件可观测最大似然连续性映射的语音处理

    公开(公告)号:US06678658B1

    公开(公告)日:2004-01-13

    申请号:US09612026

    申请日:2000-07-07

    申请人: John Hogden David Nix

    发明人: John Hogden David Nix

    IPC分类号: G10L1506

    CPC分类号: G10L15/063 G10L15/14

    摘要: A computer implemented method enables the recognition of speech and speech characteristics. Parameters are initialized of first probability density functions that map between the symbols in the vocabulary of one or more sequences of speech codes that represent speech sounds and a continuity map. Parameters are also initialized of second probability density functions that map between the elements in the vocabulary of one or more desired sequences of speech transcription symbols and the continuity map. The parameters of the probability density functions are then trained to maximize the probabilities of the desired sequences of speech-transcription symbols. A new sequence of speech codes is then input to the continuity map having the trained first and second probability function parameters. A smooth path is identified on the continuity map that has the maximum probability for the new sequence of speech codes. The probability of each speech transcription symbol for each input speech code can then be output.

    摘要翻译: 计算机实现的方法能够识别语音和语音特征。 初始化第一概率密度函数的参数,其映射在表示语音的一个或多个语音代码序列的词汇表中的符号和连续性映射之间。 第二概率密度函数的参数也被初始化,该功能在一个或多个期望的语音转录符号序列的词汇表中的元素和连续性映射之间进行映射。 然后训练概率密度函数的参数以使语音转录符号的期望序列的概率最大化。 然后,将新的语音码序列输入具有训练的第一和第二概率函数参数的连续性映射。 在连续性图上识别出具有新的语音码序列的最大概率的平滑路径。 然后可以输出每个输入语音代码的每个语音转录符号的概率。