Discriminative training of language models for text and speech classification
    1.
    发明授权
    Discriminative training of language models for text and speech classification 有权
    文本和语言分类语言模型的歧视性训练

    公开(公告)号:US08306818B2

    公开(公告)日:2012-11-06

    申请号:US12103035

    申请日:2008-04-15

    IPC分类号: G10L15/00 G06F17/27

    摘要: Methods are disclosed for estimating language models such that the conditional likelihood of a class given a word string, which is very well correlated with classification accuracy, is maximized. The methods comprise tuning statistical language model parameters jointly for all classes such that a classifier discriminates between the correct class and the incorrect ones for a given training sentence or utterance. Specific embodiments of the present invention pertain to implementation of the rational function growth transform in the context of a discriminative training technique for n-gram classifiers.

    摘要翻译: 公开了用于估计语言模型的方法,使得给定字串的类的条件似然性与分类准确度非常良好地相关联。 这些方法包括对所有类共同调整统计语言模型参数,使得分类器在给定训练句或话语中区分正确类和不正确类之间的差异。 本发明的具体实施例涉及在n-gram分类器的鉴别训练技术的上下文中实现有理函数增长变换。

    Discriminative training of language models for text and speech classification
    2.
    发明授权
    Discriminative training of language models for text and speech classification 有权
    文本和语言分类语言模型的歧视性训练

    公开(公告)号:US07379867B2

    公开(公告)日:2008-05-27

    申请号:US10453349

    申请日:2003-06-03

    IPC分类号: G06F17/27 G10L15/00

    摘要: Methods are disclosed for estimating language models such that the conditional likelihood of a class given a word string, which is very well correlated with classification accuracy, is maximized. The methods comprise tuning statistical language model parameters jointly for all classes such that a classifier discriminates between the correct class and the incorrect ones for a given training sentence or utterance. Specific embodiments of the present invention pertain to implementation of the rational function growth transform in the context of a discriminative training technique for n-gram classifiers.

    摘要翻译: 公开了用于估计语言模型的方法,使得给定字串的类的条件似然性与分类准确度非常良好地相关联。 这些方法包括对所有类共同调整统计语言模型参数,使得分类器在给定训练句或话语中区分正确类和不正确类之间的差异。 本发明的具体实施例涉及在n-gram分类器的鉴别训练技术的上下文中实现有理函数增长变换。

    DISCRIMINATIVE TRAINING OF LANGUAGE MODELS FOR TEXT AND SPEECH CLASSIFICATION
    4.
    发明申请
    DISCRIMINATIVE TRAINING OF LANGUAGE MODELS FOR TEXT AND SPEECH CLASSIFICATION 有权
    用于文本和语音分类的语言模式的歧视性培训

    公开(公告)号:US20080215311A1

    公开(公告)日:2008-09-04

    申请号:US12103035

    申请日:2008-04-15

    IPC分类号: G06F17/27

    摘要: Methods are disclosed for estimating language models such that the conditional likelihood of a class given a word string, which is very well correlated with classification accuracy, is maximized. The methods comprise tuning statistical language model parameters jointly for all classes such that a classifier discriminates between the correct class and the incorrect ones for a given training sentence or utterance. Specific embodiments of the present invention pertain to implementation of the rational function growth transform in the context of a discriminative training technique for n-gram classifiers.

    摘要翻译: 公开了用于估计语言模型的方法,使得给定字串的类的条件似然性与分类准确度非常良好地相关联。 这些方法包括对所有类共同调整统计语言模型参数,使得分类器在给定训练句或话语中区分正确类和不正确类之间的差异。 本发明的具体实施例涉及在n-gram分类器的鉴别训练技术的上下文中实现有理函数增长变换。

    Method and apparatus for predicting word error rates from text
    10.
    发明授权
    Method and apparatus for predicting word error rates from text 有权
    用于从文本中预测字错误率的方法和装置

    公开(公告)号:US07117153B2

    公开(公告)日:2006-10-03

    申请号:US10365850

    申请日:2003-02-13

    IPC分类号: G10L15/06 G10L15/14

    CPC分类号: G10L15/197 G10L15/183

    摘要: A method of modeling a speech recognition system includes decoding a speech signal produced from a training text to produce a sequence of predicted speech units. The training text comprises a sequence of actual speech units that is used with the sequence of predicted speech units to form a confusion model. In further embodiments, the confusion model is used to decode a text to identify an error rate that would be expected if the speech recognition system decoded speech based on the text.

    摘要翻译: 对语音识别系统进行建模的方法包括对从训练文本产生的语音信号进行解码以产生预测语音单元的序列。 训练文本包括与预测语音单元的序列一起使用以形成混淆模型的实际语音单元的序列。 在另外的实施例中,混淆模型用于对文本进行解码以识别如果语音识别系统基于文本解码的语音将会预期的错误率。