发明授权
US5280562A Speech coding apparatus with single-dimension acoustic prototypes for a
speech recognizer
失效
具有用于语音识别器的单维声学原型的语音编码装置
- 专利标题: Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer
- 专利标题(中): 具有用于语音识别器的单维声学原型的语音编码装置
-
申请号: US770495申请日: 1991-10-03
-
公开(公告)号: US5280562A公开(公告)日: 1994-01-18
- 发明人: Lalit R. Bahl , Jerome R. Bellegarda , Edward A. Epstein , John M. Lucassen , David Nahamoo , Michael A. Picheny
- 申请人: Lalit R. Bahl , Jerome R. Bellegarda , Edward A. Epstein , John M. Lucassen , David Nahamoo , Michael A. Picheny
- 申请人地址: NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: NY Armonk
- 主分类号: G10L19/00
- IPC分类号: G10L19/00 ; G10L15/02 ; G10L19/02 ; H03M7/30 ; G10L9/02
摘要:
In speech recognition and speech coding, the values of at least two features of an utterance are measured during a series of time intervals to produce a series of feature vector signals. A plurality of single-dimension prototype vector signals having only one parameter value are stored. At least two single-dimension prototype vector signals having parameter values representing first feature values, and at least two other single-dimension prototype vector signals have parameter values representing second feature values. A plurality of compound-dimension prototype vector signals have unique identification values and comprise one first-dimension and one second-dimension prototype vector signal. At least two compound-dimension prototype vector signals comprise the same first-dimension prototype vector signal. The feature values of each feature vector signal are compared to the parameter values of the compound-dimension prototype vector signals to obtain prototype match scores. The identification values of the compound-dimension prototype vector signals having the best prototype match scores for the feature vectors signals are output as a sequence of coded representations of an utterance to be recognized. A match score, comprising an estimate of the closeness of a match between a speech unit and the sequence of coded representations of the utterance, is generated for each of a plurality of speech units. At least one speech subunit, of one or more best candidate speech units having the best match scores, is displayed.
公开/授权文献
- US4154571A Premix gas burner assembly 公开/授权日:1979-05-15
信息查询