专利检索 ap:("Scott Axelrod" OR "Sreeram Balakrishnan" OR "Stanley Chen" OR "Yuging Gao" OR "Ramesh Gopinath" OR "Hong-Kwang Kuo" OR "Benoit Maison" OR "David Nahamoo" OR "Michael Picheny" OR "George Saon" OR "Geoffrey Zweig") AND inv:"Yuging Gao" 第 1 页

1.

发明申请
Speech recognition utilizing multitude of speech features 失效
标题翻译：语音识别利用多种语音特征

公开(公告)号：US20050119885A1

公开(公告)日：2005-06-02

申请号：US10724536

申请日：2003-11-28

申请人： Scott Axelrod , Sreeram Balakrishnan , Stanley Chen , Yuging Gao , Ramesh Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Picheny , George Saon , Geoffrey Zweig

发明人： Scott Axelrod , Sreeram Balakrishnan , Stanley Chen , Yuging Gao , Ramesh Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Picheny , George Saon , Geoffrey Zweig

IPC分类号： G10L15/10 , G10L15/00 , G10L15/02 , G10L15/06 , G10L15/14

CPC分类号： G10L15/063 , G10L15/02 , G10L15/14 , G10L2015/085

摘要： In a speech recognition system, the combination of a log-linear model with a multitude of speech features is provided to recognize unknown speech utterances. The speech recognition system models the posterior probability of linguistic units relevant to speech recognition using a log-linear model. The posterior model captures the probability of the linguistic unit given the observed speech features and the parameters of the posterior model. The posterior model may be determined using the probability of the word sequence hypotheses given a multitude of speech features. Log-linear models are used with features derived from sparse or incomplete data. The speech features that are utilized may include asynchronous, overlapping, and statistically non-independent speech features. Not all features used in training need to appear in testing/recognition.

摘要翻译： 在语音识别系统中，提供了具有多个语音特征的对数线性模型的组合来识别未知语音语音。语音识别系统使用对数线性模型对与语音识别相关的语言单位的后验概率进行建模。后验模型捕获了语言单位给出观察到的语音特征和后验模型参数的概率。可以使用给定多个语音特征的单词序列假设的概率来确定后验模型。对数线性模型与来自稀疏或不完整数据的特征一起使用。所使用的语音特征可以包括异步，重叠和统计上非独立的语音特征。培训中使用的并非所有功能都需要出现在测试/识别中。

2.

发明申请
SPEECH RECOGNITION UTILIZING MULTITUDE OF SPEECH FEATURES 审中-公开
标题翻译：语音识别利用多种语音特征

公开(公告)号：US20080312921A1

公开(公告)日：2008-12-18

申请号：US12195123

申请日：2008-08-20

申请人： Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Rameah A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig

发明人： Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Rameah A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig

IPC分类号： G10L15/00 , G10L15/04

CPC分类号： G10L15/063 , G10L15/02 , G10L15/14 , G10L2015/085

摘要： In a speech recognition system, the combination of a log-linear model with a multitude of speech features is provided to recognize unknown speech utterances. The speech recognition system models the posterior probability of linguistic units relevant to speech recognition using a log-linear model. The posterior model captures the probability of the linguistic unit given the observed speech features and the parameters of the posterior model. The posterior model may be determined using the probability of the word sequence hypotheses given a multitude of speech features. Log-linear models are used with features derived from sparse or incomplete data. The speech features that are utilized may include asynchronous, overlapping, and statistically non-independent speech features. Not all features used in training need to appear in testing/recognition.

摘要翻译： 在语音识别系统中，提供了具有多个语音特征的对数线性模型的组合来识别未知语音语音。语音识别系统使用对数线性模型对与语音识别相关的语言单位的后验概率进行建模。后验模型捕获了语言单位给出观察到的语音特征和后验模型参数的概率。可以使用给定多个语音特征的单词序列假设的概率来确定后验模型。对数线性模型与来自稀疏或不完整数据的特征一起使用。所使用的语音特征可以包括异步，重叠和统计上非独立的语音特征。培训中使用的并非所有功能都需要出现在测试/识别中。

3.

发明授权
Speech recognition utilizing multitude of speech features 失效
标题翻译：语音识别利用多种语音特征

公开(公告)号：US07464031B2

公开(公告)日：2008-12-09

申请号：US10724536

申请日：2003-11-28

申请人： Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Ramesh A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig

发明人： Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Ramesh A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig

IPC分类号： G10L15/00 , G10L15/20

CPC分类号： G10L15/063 , G10L15/02 , G10L15/14 , G10L2015/085

摘要： In a speech recognition system, the combination of a log-linear model with a multitude of speech features is provided to recognize unknown speech utterances. The speech recognition system models the posterior probability of linguistic units relevant to speech recognition using a log-linear model. The posterior model captures the probability of the linguistic unit given the observed speech features and the parameters of the posterior model. The posterior model may be determined using the probability of the word sequence hypotheses given a multitude of speech features. Log-linear models are used with features derived from sparse or incomplete data. The speech features that are utilized may include asynchronous, overlapping, and statistically non-independent speech features. Not all features used in training need to appear in testing/recognition.

摘要翻译： 在语音识别系统中，提供了具有多个语音特征的对数线性模型的组合来识别未知语音语音。语音识别系统使用对数线性模型对与语音识别相关的语言单位的后验概率进行建模。后验模型捕获了语言单位给出观察到的语音特征和后验模型参数的概率。可以使用给定多个语音特征的单词序列假设的概率来确定后验模型。对数线性模型与来自稀疏或不完整数据的特征一起使用。所使用的语音特征可以包括异步，重叠和统计上非独立的语音特征。培训中使用的并非所有功能都需要出现在测试/识别中。

4.

发明授权
Systems and methods for fast and memory efficient machine translation using statistical integrated phase lattice 有权
标题翻译：使用统计综合相位格的快速和高效的机器翻译的系统和方法

公开(公告)号：US08229731B2

公开(公告)日：2012-07-24

申请号：US12824806

申请日：2010-06-28

申请人： Stanley Chen , Yuging Gao , Bowen Zhou

发明人： Stanley Chen , Yuging Gao , Bowen Zhou

IPC分类号： G06F17/28

CPC分类号： G06F17/2818

摘要： A phrase-based translation system and method includes a statistically integrated phrase lattice (SIPL) (H) which represents an entire translational model. An input (I) is translated by determining a best path through an entire lattice (S) by performing an efficient composition operation between the input and the SIPL. The efficient composition operation is performed by a multiple level search where each operand in the efficient composition operation represents a different search level.

摘要翻译： 基于短语的翻译系统和方法包括代表整个翻译模型的统计学上综合的词组（SIPL）（H）。通过在输入和SIPL之间执行有效的组合操作来确定通过整个网格（S）的最佳路径来转换输入（I）。通过多级搜索来执行高效合成操作，其中高效合成操作中的每个操作数表示不同的搜索级别。

5.

发明申请
SYSTEM AND METHOD FOR APPLYING BRIDGING MODELS FOR ROBUST AND EFFICIENT SPEECH TO SPEECH TRANSLATION 有权
标题翻译：将语音模型应用于语音翻译的系统与方法

公开(公告)号：US20090299724A1

公开(公告)日：2009-12-03

申请号：US12128199

申请日：2008-05-28

申请人： Yonggang Deng , Yuging Gao , Bing Xiang

发明人： Yonggang Deng , Yuging Gao , Bing Xiang

IPC分类号： G06F17/28

CPC分类号： G06F17/2809

摘要： A system and method for speech translation includes a bridge module connected between a first component and a second component. The bridge module includes a transformation model configured to receive an original hypothesis output from a first component. The transformation model has one or more transformation features configured to transform the original hypothesis into a new hypothesis that is more easily translated by the second component.

摘要翻译： 用于语音翻译的系统和方法包括连接在第一组件和第二组件之间的桥模块。桥模块包括被配置为从第一组件接收原始假设输出的变换模型。转换模型具有一个或多个变换特征，其被配置为将原始假设转换为更容易被第二组件翻译的新假设。

6.

发明授权
Feature vector-based apparatus and method for robust pattern recognition 有权
标题翻译：基于特征向量的鲁棒模式识别装置和方法

公开(公告)号：US07054810B2

公开(公告)日：2006-05-30

申请号：US09968051

申请日：2001-10-01

申请人： Yuging Gao , Michael A. Picheny , Bhuvana Ramabhadran

发明人： Yuging Gao , Michael A. Picheny , Bhuvana Ramabhadran

IPC分类号： G10L15/00

CPC分类号： G10L15/02

摘要： N sets of feature vectors are generated from a set of observation vectors which are indicative of a pattern which it is desired to recognize. At least one of the sets of feature vectors is different than at least one other of the sets of feature vectors, and is preselected for purposes of containing at least some complimentary information with regard to the at least one other set of feature vectors. The N sets of feature vectors are combined in a manner to obtain an optimized set of feature vectors which best represents the pattern. The combination is performed via one of a weighted likelihood combination scheme and a rank-based state-selection scheme; preferably, it is done in accordance with an equation set forth herein. In one aspect, a weighted likelihood combination can be employed, while in another aspect, rank-based state selection can be employed. An apparatus suitable for performing the method is described, and implementation in a computer program product is also contemplated. The invention is applicable to any type of pattern recognition problem where robustness is important, such as, for example, recognition of speech, handwriting or optical characters under challenging conditions.

摘要翻译： 从指示希望识别的图案的一组观察向量生成N组特征向量。所述特征向量集合中的至少一个不同于所述特征向量集合中的至少另一个，并且为了至少包含关于所述至少一个其他特征向量集合的补充信息的目的而被预先选择。 N组特征向量以一种方式组合以获得最佳表示图案的特征向量的优化集合。组合通过加权似然组合方案和基于秩的状态选择方案之一执行; 优选地，根据本文所阐述的等式进行。在一个方面，可以采用加权似然组合，而在另一方面，可以采用基于秩的状态选择。描述了适用于执行该方法的装置，并且还考虑了计算机程序产品中的实现。本发明可应用于任何类型的鲁棒性重要的模式识别问题，例如在挑战性条件下的语音识别，手写或光学特征。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类