Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer

发明授权

US5280562A Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer 失效

标题翻译：具有用于语音识别器的单维声学原型的语音编码装置

请登陆查看更多内容

专利标题： Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer
专利标题（中）： 具有用于语音识别器的单维声学原型的语音编码装置
申请号： US770495

申请日： 1991-10-03
公开(公告)号： US5280562A

公开(公告)日： 1994-01-18
发明人: Lalit R. Bahl , Jerome R. Bellegarda , Edward A. Epstein , John M. Lucassen , David Nahamoo , Michael A. Picheny
申请人： Lalit R. Bahl , Jerome R. Bellegarda , Edward A. Epstein , John M. Lucassen , David Nahamoo , Michael A. Picheny
申请人地址： NY Armonk
专利权人： International Business Machines Corporation
当前专利权人： International Business Machines Corporation
当前专利权人地址： NY Armonk
主分类号： G10L19/00
IPC分类号： G10L19/00 ; G10L15/02 ; G10L19/02 ; H03M7/30 ; G10L9/02

Speech coding apparatus with single-dimension acoustic prototypes for a
speech recognizer

摘要：

In speech recognition and speech coding, the values of at least two features of an utterance are measured during a series of time intervals to produce a series of feature vector signals. A plurality of single-dimension prototype vector signals having only one parameter value are stored. At least two single-dimension prototype vector signals having parameter values representing first feature values, and at least two other single-dimension prototype vector signals have parameter values representing second feature values. A plurality of compound-dimension prototype vector signals have unique identification values and comprise one first-dimension and one second-dimension prototype vector signal. At least two compound-dimension prototype vector signals comprise the same first-dimension prototype vector signal. The feature values of each feature vector signal are compared to the parameter values of the compound-dimension prototype vector signals to obtain prototype match scores. The identification values of the compound-dimension prototype vector signals having the best prototype match scores for the feature vectors signals are output as a sequence of coded representations of an utterance to be recognized. A match score, comprising an estimate of the closeness of a match between a speech unit and the sequence of coded representations of the utterance, is generated for each of a plurality of speech units. At least one speech subunit, of one or more best candidate speech units having the best match scores, is displayed.

摘要（中）：

在语音识别和语音编码中，在一系列时间间隔期间测量话音的至少两个特征的值，以产生一系列特征向量信号。存储仅具有一个参数值的多个单维原型矢量信号。具有表示第一特征值的参数值和至少两个其它单维原型矢量信号的至少两个单维原型矢量信号具有表示第二特征值的参数值。多个复合尺寸原型矢量信号具有唯一的识别值，并且包括一个第一维和一个第二维原型矢量信号。至少两个复合维度原型矢量信号包括相同的第一维原型矢量信号。将每个特征向量信号的特征值与化合物维度原型矢量信号的参数值进行比较，以获得原型匹配分数。具有特征矢量信号的具有最佳原型匹配分数的复合维度原型矢量信号的识别值被输出为将被识别的话语的编码表示的序列。针对多个语音单元中的每一个生成包括语音单元与语音编码表示序列之间的匹配的接近度的估计的匹配分数。显示具有最佳匹配分数的一个或多个最佳候选语音单元的至少一个语音子单元。

公开/授权文献

US4154571A Premix gas burner assembly 公开/授权日：1979-05-15

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L19/00	用于冗余度下降情形（例如在声码器中）的语音或音频信号分析-合成技术；语音或音频信号编码或解码，采用源滤波器模型或心理声学分析（乐器中的入G10H）