METHOD AND APPARATUS FOR LOCATING SPEECH KEYWORD AND SPEECH RECOGNITION SYSTEM
    1.
    发明申请
    METHOD AND APPARATUS FOR LOCATING SPEECH KEYWORD AND SPEECH RECOGNITION SYSTEM 有权
    用于定位语音关键词和语音识别系统的方法和装置

    公开(公告)号:US20100094626A1

    公开(公告)日:2010-04-15

    申请号:US12443063

    申请日:2007-09-27

    IPC分类号: G10L15/02 G10L15/06 G10L15/14

    摘要: It is an object of the present invention to provide a method and apparatus for locating a keyword of a speech and a speech recognition system. The method includes the steps of: by extracting feature parameters from frames constituting the recognition target speech, forming a feature parameter vector sequence that represents the recognition target speech; by normalizing of the feature parameter vector sequence with use of a codebook containing a plurality of codebook vectors, obtaining a feature trace of the recognition target speech in a vector space; and specifying the position of a keyword by matching prestored keyword template traces with the feature trace. According to the present invention, a keyword template trace and a feature space trace of a target speech are drawn in accordance with an identical codebook. This causes resampling to be unnecessary in performing linear movement matching of speech wave frames having similar phonological feature structures. This makes it possible to improve the speed of location and recognition while ensuring the precision of recognition.

    摘要翻译: 本发明的目的是提供一种用于定位语音和语音识别系统的关键词的方法和装置。 该方法包括以下步骤:通过从构成识别目标语音的帧中提取特征参数,形成表示识别目标语音的特征参数向量序列; 通过使用包含多个码本向量的码本来归一化特征参数矢量序列,获得矢量空间中的识别目标语音的特征轨迹; 并通过将预先存储的关键字模板跟踪与特征跟踪相匹配来指定关键字的位置。 根据本发明,根据相同的码本绘制目标语音的关键字模板轨迹和特征空间轨迹。 这导致在执行具有相似的语音特征结构的语音波帧的线性移动匹配中,重新采样是不必要的。 这样可以提高位置和识别的速度,同时确保识别精度。

    Method and apparatus for locating speech keyword and speech recognition system
    2.
    发明授权
    Method and apparatus for locating speech keyword and speech recognition system 有权
    用于定位语音关键词和语音识别系统的方法和装置

    公开(公告)号:US08255215B2

    公开(公告)日:2012-08-28

    申请号:US12443063

    申请日:2007-09-27

    IPC分类号: G10L15/26 G10L19/14 G10L15/04

    摘要: It is an object of the present invention to provide a method and apparatus for locating a keyword of a speech and a speech recognition system. The method includes the steps of: by extracting feature parameters from frames constituting the recognition target speech, forming a feature parameter vector sequence that represents the recognition target speech; by normalizing of the feature parameter vector sequence with use of a codebook containing a plurality of codebook vectors, obtaining a feature trace of the recognition target speech in a vector space; and specifying the position of a keyword by matching prestored keyword template traces with the feature trace. According to the present invention, a keyword template trace and a feature space trace of a recognition target speech are drawn in accordance with an identical codebook. This causes resampling to be unnecessary in performing linear movement matching of speech wave frames having similar phonological feature structures. This makes it possible to improve the speed of location and recognition while ensuring the precision of recognition.

    摘要翻译: 本发明的目的是提供一种用于定位语音和语音识别系统的关键词的方法和装置。 该方法包括以下步骤:通过从构成识别目标语音的帧中提取特征参数,形成表示识别目标语音的特征参数向量序列; 通过使用包含多个码本向量的码本来归一化特征参数矢量序列,获得矢量空间中的识别目标语音的特征轨迹; 并通过将预先存储的关键字模板跟踪与特征跟踪相匹配来指定关键字的位置。 根据本发明,根据相同的码本绘制关键字模板跟踪和识别对象语音的特征空间轨迹。 这导致在执行具有相似的语音特征结构的语音波帧的线性移动匹配中,重新采样是不必要的。 这样可以提高位置和识别的速度,同时确保识别精度。