Spoken Language Identification System and Methods for Training and Operating Same
    80.
    发明申请
    Spoken Language Identification System and Methods for Training and Operating Same 有权
    口语识别系统和培训与操作方法相同

    公开(公告)号:US20070299666A1

    公开(公告)日:2007-12-27

    申请号:US11575479

    申请日:2005-09-19

    IPC分类号: G10L15/00

    CPC分类号: G10L15/005 G10L2015/025

    摘要: A method for training a spoken language identification system to identify an unknown language as one of a plurality of known candidate languages includes the process of creating a sound inventory comprising a plurality of sound tokens, the collective plurality of sound tokens provided from a subset of the known candidate languages. The method further includes providing a plurality of training samples, each training sample composed within one of the known candidate languages. Further included is the process of generating one or more training vectors from each training database, wherein each training vector is defined as a function of said plurality of sound tokens provided from said subset of the known candidate languages. The method further includes associating each training vector with the candidate language of the corresponding training sample.

    摘要翻译: 用于训练口语识别系统以识别作为多种已知候选语言之一的未知语言的方法包括创建包括多个声音令牌的声音库存的过程,所述多个声音令牌的集合从 已知的候选语言。 该方法还包括提供多个训练样本,每个训练样本在已知候选语言之一内组成。 进一步包括从每个训练数据库生成一个或多个训练向量的过程,其中每个训练向量被定义为从所述已知候选语言的所述子集提供的所述多个声音令牌的函数。 该方法还包括将每个训练向量与相应训练样本的候选语言相关联。