Speech recognition rejection method using generalized additive models
    1.
    发明授权
    Speech recognition rejection method using generalized additive models 失效
    使用广义加法模型的语音识别拒绝方法

    公开(公告)号:US6006182A

    公开(公告)日:1999-12-21

    申请号:US934892

    申请日:1997-09-22

    IPC分类号: G10L15/14 G10L5/06 G10L9/00

    CPC分类号: G10L15/142

    摘要: Systems and methods consistent with the present invention determine whether to accept one of a plurality of intermediate recognition results output by a speech recognition system as a final recognition result. The system first combines a plurality of speech rejection features into a feature function in which weights are assigned to each rejection feature in accordance with a recognition accuracy of each rejection feature. Feature values are then calculated for each of the rejection features using the plurality of intermediate recognition results. The system next computes the feature function according to the calculated feature values to determine a rejection decision value. Finally, one of the plurality of intermediate recognition results is accepted as the final recognition result according to the rejection decision value.

    摘要翻译: 与本发明一致的系统和方法确定是否接受由语音识别系统输出的多个中间识别结果中的一个作为最终识别结果。 该系统首先将多个语音抑制特征组合成特征功能,其中根据每个拒绝特征的识别精度将权重分配给每个拒绝特征。 然后使用多个中间识别结果为每个拒绝特征计算特征值。 系统接下来根据计算的特征值计算特征函数以确定拒绝判定值。 最后,多个中间识别结果之一被接受为根据拒绝判定值的最终识别结果。

    Search optimization system and method for continuous speech recognition
    2.
    发明授权
    Search optimization system and method for continuous speech recognition 失效
    搜索优化系统和连续语音识别方法

    公开(公告)号:US06397179B2

    公开(公告)日:2002-05-28

    申请号:US09185529

    申请日:1998-11-04

    IPC分类号: G10L1508

    CPC分类号: G10L15/1815 G10L2015/085

    摘要: A system and method for continuous speech recognition (CSR) is optimized to reduce processing time for connected word grammars bounded by semantically null words. The savings, which reduce processing time both during the forward and the backward passes of the search, as well as during rescoring, are achieved by performing only the minimal amount of computation required to produce an exact N-best list of semantically meaningful words (N-best list of salient words). This departs from the standard Spoken Language System modeling which any notion of meaning is handled by the Natural Language Understanding (NLU) component. By expanding the task of the recognizer component from a simple acoustic match to allow semantic information to be fed to the recognizer, significant processing time savings are achieved, and make it possible to run an increased number of speech recognition channels in parallel for improved performance, which may enhance users perception of value and quality of service.

    摘要翻译: 用于连续语音识别(CSR)的系统和方法被优化以减少由语义空字界定的连接词语法的处理时间。 在搜索的正向和反向遍历期间以及在解密期间减少处理时间的节省通过仅执行产生语义有意义的单词的精确N最佳列表所需的最小量的计算来实现(N 最显着的词汇表)。 这偏离了标准语言系统建模,任何意义的概念都由自然语言理解(NLU)组成。 通过从简单的声学匹配扩展识别器组件的任务以允许将语义信息馈送到识别器,实现显着的处理时间节省,并且使得可以并行运行更多数量的语音识别信道以提高性能, 这可能会增强用户的价值观和服务质量。