专利检索 ap:("Lalit R. Bahl" OR "Jerome R. Bellegarda" OR "Peter V. De Souza" OR "Ponani S. Gopalakrishnan" OR "Arthur J. Nadas" OR "David Nahamoo" OR "Michael A. Picheny") AND inv:"Jerome R. Bellegarda" 第 1 页

1.

发明授权
Speech coding apparatus having speaker dependent prototypes generated from nonuser reference data 失效
标题翻译：具有由非用户参考数据生成的具有说话者依赖原型的语音编码装置

公开(公告)号：US5278942A

公开(公告)日：1994-01-11

申请号：US802678

申请日：1991-12-05

申请人： Lalit R. Bahl , Jerome R. Bellegarda , Peter V. De Souza , Ponani S. Gopalakrishnan , Arthur J. Nadas , David Nahamoo , Michael A. Picheny

发明人： Lalit R. Bahl , Jerome R. Bellegarda , Peter V. De Souza , Ponani S. Gopalakrishnan , Arthur J. Nadas , David Nahamoo , Michael A. Picheny

IPC分类号： G10L19/00 , G10L15/02 , G10L15/06 , G10L15/10 , G10L9/02

CPC分类号： G10L15/063 , G10L15/02

摘要： A speech coding apparatus and method for use in a speech recognition apparatus and method. The value of at least one feature of an utterance is measured during each of a series of successive time intervals to produce a series of feature vector signals representing the feature values. A plurality of prototype vector signals, each having at least one parameter value and a unique identification value are stored. The closeness of the feature vector signal is compared to the parameter values of the prototype vector signals to obtain prototype match scores for the feature value signal and each prototype vector signal. The identification value of the prototype vector signal having the best prototype match score is output as a coded representation signal of the feature vector signal. Speaker-dependent prototype vector signals are generated from both synthesized training vector signals and measured training vector signals. The synthesized training vector signals are transformed reference feature vector signals representing the values of features of one or more utterances of one or more speakers in a reference set of speakers. The measured training feature vector signals represent the values of features of one or more utterances of a new speaker/user not in the reference set.

摘要翻译： 一种用于语音识别装置和方法的语音编码装置和方法。在一系列连续时间间隔的每一个期间测量话音的至少一个特征的值，以产生表示特征值的一系列特征向量信号。存储多个具有至少一个参数值和唯一识别值的原型矢量信号。将特征矢量信号的接近度与原型矢量信号的参数值进行比较，以获得特征值信号和每个原型矢量信号的原型匹配分数。输出具有最佳原型匹配分数的原型矢量信号的识别值作为特征矢量信号的编码表示信号。从合成的训练矢量信号和测量的训练矢量信号产生与扬声器相关的原型矢量信号。合成的训练矢量信号是变换的参考特征矢量信号，其代表参考的一组扬声器中的一个或多个扬声器的一个或多个话音的特征值。测量的训练特征向量信号表示不在参考集合中的新的说话者/用户的一个或多个话语的特征值。

2.

发明授权
Method and apparatus for modeling words with multi-arc markov models 失效
标题翻译：用多模式MARKOV模型建模语言的方法和装置

公开(公告)号：US5129001A

公开(公告)日：1992-07-07

申请号：US514075

申请日：1990-04-25

申请人： Lalit R. Bahl , Jerome R. Bellegarda , Peter V. De Souza , Ponani S. Gopalakrishnan , David Nahamoo , Michael A. Picheny

发明人： Lalit R. Bahl , Jerome R. Bellegarda , Peter V. De Souza , Ponani S. Gopalakrishnan , David Nahamoo , Michael A. Picheny

IPC分类号： G06F7/00 , G06F17/18 , G10L15/06 , G10L15/10 , G10L15/14

CPC分类号： G10L15/144

摘要： Modeling a word is done by concatenating a series of elemental models to form a word model. At least one elemental model in the series is a composite elemental model formed by combining the starting states of at least first and second primitive elemental models. Each primitive elemental model represents a speech component. The primitive elemental models are combined by a weighted combination of their parameters in proportion to the values of the weighting factors. To tailor the word model to closely represent variations in the pronunciation of the word, the word is uttered a plurality of times by a plurality of different speakers. Constructing word models from composite elemental models, and constructing composite elemental models from primitive elemental models enables word models to represent many variations in the pronunciation of a word. Providing a relatively small set of primitive elemental models for a relatively large vocabulary of words enables models to be trained to the voice of a new speaker by having the new speaker utter only a small subset of the words in the vocabulary.

3.

发明授权
Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer 失效
标题翻译：具有用于语音识别器的单维声学原型的语音编码装置

公开(公告)号：US5280562A

公开(公告)日：1994-01-18

申请号：US770495

申请日：1991-10-03

申请人： Lalit R. Bahl , Jerome R. Bellegarda , Edward A. Epstein , John M. Lucassen , David Nahamoo , Michael A. Picheny

发明人： Lalit R. Bahl , Jerome R. Bellegarda , Edward A. Epstein , John M. Lucassen , David Nahamoo , Michael A. Picheny

IPC分类号： G10L19/00 , G10L15/02 , G10L19/02 , H03M7/30 , G10L9/02

CPC分类号： G10L19/038 , H03M7/3082

摘要： In speech recognition and speech coding, the values of at least two features of an utterance are measured during a series of time intervals to produce a series of feature vector signals. A plurality of single-dimension prototype vector signals having only one parameter value are stored. At least two single-dimension prototype vector signals having parameter values representing first feature values, and at least two other single-dimension prototype vector signals have parameter values representing second feature values. A plurality of compound-dimension prototype vector signals have unique identification values and comprise one first-dimension and one second-dimension prototype vector signal. At least two compound-dimension prototype vector signals comprise the same first-dimension prototype vector signal. The feature values of each feature vector signal are compared to the parameter values of the compound-dimension prototype vector signals to obtain prototype match scores. The identification values of the compound-dimension prototype vector signals having the best prototype match scores for the feature vectors signals are output as a sequence of coded representations of an utterance to be recognized. A match score, comprising an estimate of the closeness of a match between a speech unit and the sequence of coded representations of the utterance, is generated for each of a plurality of speech units. At least one speech subunit, of one or more best candidate speech units having the best match scores, is displayed.

摘要翻译： 在语音识别和语音编码中，在一系列时间间隔期间测量话音的至少两个特征的值，以产生一系列特征向量信号。存储仅具有一个参数值的多个单维原型矢量信号。具有表示第一特征值的参数值和至少两个其它单维原型矢量信号的至少两个单维原型矢量信号具有表示第二特征值的参数值。多个复合尺寸原型矢量信号具有唯一的识别值，并且包括一个第一维和一个第二维原型矢量信号。至少两个复合维度原型矢量信号包括相同的第一维原型矢量信号。将每个特征向量信号的特征值与化合物维度原型矢量信号的参数值进行比较，以获得原型匹配分数。具有特征矢量信号的具有最佳原型匹配分数的复合维度原型矢量信号的识别值被输出为将被识别的话语的编码表示的序列。针对多个语音单元中的每一个生成包括语音单元与语音编码表示序列之间的匹配的接近度的估计的匹配分数。显示具有最佳匹配分数的一个或多个最佳候选语音单元的至少一个语音子单元。

4.

发明授权
Fast algorithm for deriving acoustic prototypes for automatic speech recognition 失效
标题翻译：用于自动语音识别的声学原型的快速算法

公开(公告)号：US5276766A

公开(公告)日：1994-01-04

申请号：US730714

申请日：1991-07-16

申请人： Lalit R. Bahl , Jerome R. Bellegarda , Peter V. DeSouza , David Nahamoo , Michael A. Picheny

发明人： Lalit R. Bahl , Jerome R. Bellegarda , Peter V. DeSouza , David Nahamoo , Michael A. Picheny

IPC分类号： G10L19/00 , G10L15/02 , G10L15/06 , G10L9/04

CPC分类号： G10L15/063

摘要： An apparatus for generating a set of acoustic prototype signals for encoding speech includes a memory for storing a training script model comprising a series of word-segment models. Each word-segment model comprises a series of elementary models. An acoustic measure is provided for measuring the value of at least one feature of an utterance of the training script during each of a series of time intervals to produce a series of feature vector signals representing the feature values of the utterance. An acoustic matcher is provided for estimating at least one path through the training script model which would produce the entire series of measured feature vector signals. From the estimated path, the elementary model in the training script model which would produce each feature vector signal is estimated. The apparatus further comprises a cluster processor for clustering the feature vector signals into a plurality of clusters. Each feature vector signal in a cluster corresponds to a single elementary model in a single location in a single word-segment model. Each cluster signal has a cluster value equal to an average of the feature values of all feature vectors in the signal. Finally, the apparatus includes a memory for storing a plurality of prototype vector signals. Each prototype vector signal corresponds to an elementary model, has an identifier, and comprises at least two partition values. The partition values are equal to combinations of the cluster values of one or more cluster signals corresponding to the elementary model.

摘要翻译： 一种用于生成用于编码语音的声原型信号的集合的装置包括用于存储包括一系列字段模型的训练脚本模型的存储器。每个单词段模型包括一系列基本模型。提供了一种声学测量，用于在一系列时间间隔的每一个期间测量训练脚本的发音的至少一个特征的值，以产生表示发音的特征值的一系列特征向量信号。提供声学匹配器用于估计通过训练脚本模型的至少一个路径，其将产生整个测量的特征向量信号的一系列。从估计的路径，估计将产生每个特征向量信号的训练脚本模型中的基本模型。该装置还包括用于将特征向量信号聚类成多个聚类的聚类处理器。群集中的每个特征向量信号对应于单个单词段模型中单个位置中的单个基本模型。每个聚类信号具有等于信号中所有特征向量的特征值的平均值的聚类值。最后，该装置包括用于存储多个原型矢量信号的存储器。每个原型矢量信号对应于基本模型，具有标识符，并且包括至少两个分区值。分区值等于对应于基本模型的一个或多个聚类信号的聚类值的组合。

5.

发明授权
Automatic handwriting recognition using both static and dynamic parameters 失效

公开(公告)号：US5550931A

公开(公告)日：1996-08-27

申请号：US450557

申请日：1995-05-25

申请人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

发明人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

IPC分类号： G06K9/46 , G06K9/03 , G06K9/22 , G06K9/62 , G06K9/68 , G06K9/00

CPC分类号： G06K9/6293 , G06K9/00416 , G06K9/00429

摘要： Methods and apparatus are disclosed for recognizing handwritten characters in response to an input signal from a handwriting transducer. A feature extraction and reduction procedure is disclosed that relies on static or shape information, wherein the temporal order in which points are captured by an electronic tablet may be disregarded. A method of the invention generates and processes the tablet data with three independent sets of feature vectors which encode the shape information of the input character information. These feature vectors include horizontal (x-axis) and vertical (y-axis) slices of a bit-mapped image of the input character data, and an additional feature vector to encode an absolute y-axis displacement from a baseline of the bit-mapped image. It is shown that the recognition errors that result from the spatial or static processing are quite different from those resulting from temporal or dynamic processing. Furthermore, it is shown that these differences complement one another. As a result, a combination of these two sources of feature vector information provides a substantial reduction in an overall recognition error rate. Methods to combine probability scores from dynamic and the static character models are also disclosed.

6.

发明授权
Automatic handwriting recognition using both static and dynamic parameters 失效
标题翻译：使用静态和动态参数自动手写识别

公开(公告)号：US5544264A

公开(公告)日：1996-08-06

申请号：US451001

申请日：1995-05-25

申请人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

发明人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

IPC分类号： G06K9/46 , G06K9/03 , G06K9/22 , G06K9/62 , G06K9/68 , G06K9/00

CPC分类号： G06K9/6293 , G06K9/00416 , G06K9/00429

摘要： Methods and apparatus are disclosed for recognizing handwritten characters in response to an input signal from a handwriting transducer. A feature extraction and reduction procedure is disclosed that relies on static or shape information, wherein the temporal order in which points are captured by an electronic tablet may be disregarded. A method of the invention generates and processes the tablet data with three independent sets of feature vectors which encode the shape information of the input character information. These feature vectors include horizontal (x-axis) and vertical (y-axis) slices of a bit-mapped image of the input character data, and an additional feature vector to encode an absolute y-axis displacement from a baseline of the bit-mapped image. It is shown that the recognition errors that result from the spatial or static processing are quite different from those resulting from temporal or dynamic processing. Furthermore, it is shown that these differences complement one another. As a result, a combination of these two sources of feature vector information provides a substantial reduction in an overall recognition error rate. Methods to combine probability scores from dynamic and the static character models are also disclosed.

摘要翻译： 公开了用于响应于来自手写传感器的输入信号识别手写字符的方法和装置。公开了一种依赖于静态或形状信息的特征提取和缩减过程，其中可以忽略由电子平板电脑捕获点的时间顺序。本发明的方法利用编码输入字符信息的形状信息的三个独立的特征向量组来生成和处理图形输入板数据。这些特征向量包括输入字符数据的位映射图像的水平（x轴）和垂直（y轴）切片，以及附加特征向量，用于编码从比特映射图像的基线的绝对y轴位移。映射图像。显示由空间或静态处理产生的识别错误与由时间或动态处理产生的识别错误截然不同。此外，这表明这些差异相互补充。结果，这两个特征向量信息源的组合提供了总体识别错误率的显着降低。还公开了从动态和静态字符模型组合概率分数的方法。

7.

发明授权
Automatic handwriting recognition using both static and dynamic parameters 失效

公开(公告)号：US5491758A

公开(公告)日：1996-02-13

申请号：US009515

申请日：1993-01-27

申请人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

发明人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

IPC分类号： G06K9/46 , G06K9/03 , G06K9/22 , G06K9/62 , G06K9/68 , G06K9/00

CPC分类号： G06K9/6293 , G06K9/00416 , G06K9/00429

摘要： Methods and apparatus are disclosed for recognizing handwritten characters in response to an input signal from a handwriting transducer. A feature extraction and reduction procedure is disclosed that relies on static or shape information, wherein the temporal order in which points are captured by an electronic tablet may be disregarded. A method of the invention generates and processes the tablet data with three independent sets of feature vectors which encode the shape information of the input character information. These feature vectors include horizontal (x-axis) and vertical (y-axis) slices of a bit-mapped image of the input character data, and an additional feature vector to encode an absolute y-axis displacement from a baseline of the bit-mapped image. It is shown that the recognition errors that result from the spatial or static processing are quite different from those resulting from temporal or dynamic processing. Furthermore, it is shown that these differences complement one another. As a result, a combination of these two sources of feature vector information provides a substantial reduction in an overall recognition error rate. Methods to combine probability scores from dynamic and the static character models are also disclosed.

8.

发明授权
Automatic handwriting recognition using both static and dynamic parameters 失效

公开(公告)号：US5544261A

公开(公告)日：1996-08-06

申请号：US450556

申请日：1995-05-25

申请人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

发明人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

IPC分类号： G06K9/46 , G06K9/03 , G06K9/22 , G06K9/62 , G06K9/68 , G06K9/00

CPC分类号： G06K9/6293 , G06K9/00416 , G06K9/00429

摘要： Methods and apparatus are disclosed for recognizing handwritten characters in response to an input signal from a handwriting transducer. A feature extraction and reduction procedure is disclosed that relies on static or shape information, wherein the temporal order in which points are captured by an electronic tablet may be disregarded. A method of the invention generates and processes the tablet data with three independent sets of feature vectors which encode the shape information of the input character information. These feature vectors include horizontal (x-axis) and vertical (y-axis) slices of a bit-mapped image of the input character data, and an additional feature vector to encode an absolute y-axis displacement from a baseline of the bit-mapped image. It is shown that the recognition errors that result from the spatial or static processing are quite different from those resulting from temporal or dynamic processing. Furthermore, it is shown that these differences complement one another. As a result, a combination of these two sources of feature vector information provides a substantial reduction in an overall recognition error rate. Methods to combine probability scores from dynamic and the static character models are also disclosed.

9.

发明授权
Automatic handwriting recognition using both static and dynamic parameters 失效

公开(公告)号：US5539839A

公开(公告)日：1996-07-23

申请号：US450558

申请日：1995-05-25

申请人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

发明人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

IPC分类号： G06K9/46 , G06K9/03 , G06K9/22 , G06K9/62 , G06K9/68 , G06K9/00

CPC分类号： G06K9/6293 , G06K9/00416 , G06K9/00429

摘要： Methods and apparatus are disclosed for recognizing handwritten characters in response to an input signal from a handwriting transducer. A feature extraction and reduction procedure is disclosed that relies on static or shape information, wherein the temporal order in which points are captured by an electronic tablet may be disregarded. A method of the invention generates and processes the tablet data with three independent sets of feature vectors which encode the shape information of the input character information. These feature vectors include horizontal (x-axis) and vertical (y-axis) slices of a bit-mapped image of the input character data, and an additional feature vector to encode an absolute y-axis displacement from a baseline of the bit-mapped image. It is shown that the recognition errors that result from the spatial or static processing are quite different from those resulting from temporal or dynamic processing. Furthermore, it is shown that these differences complement one another. As a result, a combination of these two sources of feature vector information provides a substantial reduction in an overall recognition error rate. Methods to combine probability scores from dynamic and the static character models are also disclosed.

10.

发明授权
Continuous parameter hidden Markov model approach to automatic handwriting recognition 失效
标题翻译：连续参数隐马尔可夫模型法自动手写识别

公开(公告)号：US5636291A

公开(公告)日：1997-06-03

申请号：US467615

申请日：1995-06-06

申请人： Eveline J. Bellegarda , Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

发明人： Eveline J. Bellegarda , Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan

IPC分类号： G06K9/62 , G06K9/68 , G06K9/70 , G06K9/00 , G06F15/00

CPC分类号： G06K9/6297

摘要： A computer-based system and method for recognizing handwriting. The present invention includes a pre-processor, a front end, and a modeling component. The present invention operates as follows. First, the present invention identifies the lexemes for all characters of interest. Second, the present invention performs a training phase in order to generate a hidden Markov model for each of the lexemes. Third, the present invention performs a decoding phase to recognize handwritten text. Hidden Markov models for lexemes are produced during the training phase. The present invention performs the decoding phase as follows. The present invention receives test characters to be decoded (that is, to be recognized). The present invention generates sequences of feature vectors for the test characters by mapping in chirographic space. For each of the test characters, the present invention computes probabilities that the test character can be generated by the hidden Markov models. The present invention decodes the test character as the recognized character associated with the hidden Markov model having the greatest probability.

摘要翻译： 一种用于识别笔迹的基于计算机的系统和方法。本发明包括预处理器，前端和建模组件。本发明如下操作。首先，本发明识别所有感兴趣的人物的词汇。第二，本发明执行训练阶段，以便为每个词汇生成隐马尔可夫模型。第三，本发明执行解码阶段来识别手写文本。训练阶段产生了隐马尔可夫模型。本发明如下进行解码阶段。本发明接收要解码的测试字符（即将被识别）。本发明通过在手写空间中映射来生成用于测试字符的特征向量的序列。对于每个测试字符，本发明计算由隐马尔可夫模型可以产生测试字符的概率。本发明将测试字符解码为与具有最大概率的隐马尔可夫模型相关联的识别字符。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类