发明授权
US06343267B1 Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques 有权
使用本征语音技术的扬声器归一化和扬声器和环境适应的尺寸减小

  • 专利标题: Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques
  • 专利标题(中): 使用本征语音技术的扬声器归一化和扬声器和环境适应的尺寸减小
  • 申请号: US09148753
    申请日: 1998-09-04
  • 公开(公告)号: US06343267B1
    公开(公告)日: 2002-01-29
  • 发明人: Roland KuhnPatrick NguyenJean-Claude Junqua
  • 申请人: Roland KuhnPatrick NguyenJean-Claude Junqua
  • 主分类号: G10L1908
  • IPC分类号: G10L1908
Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques
摘要:
A set of speaker dependent models or adapted models is trained upon a comparatively large number of training speakers, one model per speaker, and model parameters are extracted in a predefined order to construct a set of supervectors, one per speaker. Dimensionality reduction is then performed on the set of supervectors to generate a set of eigenvectors that define an eigenvoice space. If desired, the number of vectors may be reduced to achieve data compression. Thereafter, a new speaker provides adaptation data from which a supervector is constructed by constraining this supervector to be in the eigenvoice space based on a maximum likelihood estimation. The resulting coefficients in the eigenspace of this new speaker may then be used to construct a new set of model parameters from which an adapted model is constructed for that speaker. The adapted model may then be further adapted via MAP, MLLR, MLED or the like. The eigenvoice technique may be applied to MLLR transformation matrices or the like; Bayesian estimation performed in eigenspace uses prior knowledge about speaker space density to refine the estimate about the location of a new speaker in eigenspace.
信息查询
0/0