专利检索 ap:("Geoffrey G. Zweig" OR "Eric J. Stollnitz" OR "Richard Szeliski" OR "Sudipta Sinha" OR "Johannes Kopf") AND inv:"Geoffrey G. Zweig" 第 2 页

11.

发明授权
Methods and apparatus for correlating biometric attributes and biometric attribute production features 有权
标题翻译：将生物特征属性和生物特征属性生产特征相关联的方法和装置

公开(公告)号：US06411933B1

公开(公告)日：2002-06-25

申请号：US09444684

申请日：1999-11-22

申请人： Stephane Herman Maes , Geoffrey G. Zweig

发明人： Stephane Herman Maes , Geoffrey G. Zweig

IPC分类号： G10L2100

CPC分类号： G10L17/26 , A61B5/117 , A61B5/4803 , G06K9/00892 , G07C9/00158

摘要： A method of validating production of a biometric attribute allegedly associated with a user comprises the following steps. A first signal is generated representing data associated with the biometric attribute allegedly received in association with the user. A second signal is also generated representing data associated with at least one feature detected in association with the production of the biometric attribute allegedly received from the user. Then, the first signal and the second signal are compared to determine a correlation level between the biometric attribute and the production feature, wherein the validation of the production of the biometric attribute depends on the correlation level. Accordingly, the invention serves to provide substantial assurance that the biometric attribute offered by the user has been physically generated by the user.

摘要翻译： 据称与用户相关联的验证生物特征属性的生成的方法包括以下步骤。生成第一信号，其表示与被认为与用户相关联地接收到的生物特征属性相关联的数据。还生成第二信号，表示与据称从用户接收的生物特征属性的生成相关联地检测到的至少一个特征相关联的数据。然后，比较第一信号和第二信号以确定生物特征属性和生产特征之间的相关性水平，其中生物特征属性生产的验证取决于相关级别。因此，本发明用于提供用户提供的生物特征属性已经由用户物理地生成的实质性保证。

12.

发明申请
AUTOMATIC SPEECH RECOGNITION BASED UPON INFORMATION RETRIEVAL METHODS 审中-公开
标题翻译：基于信息检索方法的自动语音识别

公开(公告)号：US20110224982A1

公开(公告)日：2011-09-15

申请号：US12722556

申请日：2010-03-12

申请人： Alejandro Acero , James Garnet Droppo, III , Xiaoqiang Xiao , Geoffrey G. Zweig

发明人： Alejandro Acero , James Garnet Droppo, III , Xiaoqiang Xiao , Geoffrey G. Zweig

IPC分类号： G10L15/02

CPC分类号： G10L15/08 , G10L2015/025

摘要： Described is a technology in which information retrieval (IR) techniques are used in a speech recognition (ASR) system. Acoustic units (e.g., phones, syllables, multi-phone units, words and/or phrases) are decoded, and features found from those acoustic units. The features are then used with IR techniques (e.g., TF-IDF based retrieval) to obtain a target output (a word or words). Also described is the use of IR techniques to provide a full large vocabulary continuous speech (LVCSR) recognizer

摘要翻译： 描述了在语音识别（ASR）系统中使用信息检索（IR）技术的技术。声学单元（例如，电话，音节，多电话单元，单词和/或短语）被解码，并且从那些声学单元找到的特征。然后将特征与IR技术（例如，基于TF-IDF的检索）一起使用以获得目标输出（一个或多个单词）。还描述了使用IR技术来提供完整的大词汇连续语音（LVCSR）识别器

13.

发明授权
Method for clustering closely resembling data objects 失效

公开(公告)号：US6119124A

公开(公告)日：2000-09-12

申请号：US48653

申请日：1998-03-26

申请人： Andrei Z. Broder , Steven C. Glassman , Charles G. Nelson , Mark S. Manasse , Geoffrey G. Zweig

发明人： Andrei Z. Broder , Steven C. Glassman , Charles G. Nelson , Mark S. Manasse , Geoffrey G. Zweig

IPC分类号： G06F17/30

CPC分类号： G06F17/3071 , Y10S707/99932 , Y10S707/99933 , Y10S707/99935 , Y10S707/99944

摘要： A computer-implemented method determines the resemblance of data objects such as Web pages. Each data object is partitioned into a sequence of tokens. The tokens are grouped into overlapping sets of the tokens to form shingles. Each shingle is represented by a unique identification element encoded as a fingerprint. A minimum element from each of the images of the set of fingerprints associated with a document under each of a plurality of pseudo random permutations of the set of all fingerprints are selected to generate a sketch of each data object. The sketches characterize the resemblance of the data objects. The sketches can be further partitioned into a plurality of groups. Each group is fingerprinted to form a feature. Data objects that share more than a certain numbers of features are estimated to be nearly identical.

14.

发明授权
Searching a database of listings 有权
标题翻译：搜索列表的数据库

公开(公告)号：US09218412B2

公开(公告)日：2015-12-22

申请号：US11746847

申请日：2007-05-10

申请人： Ye-Yi Wang , Dong Yu , Yun-Cheng Ju , Alejandro Acero , Geoffrey G. Zweig

发明人： Ye-Yi Wang , Dong Yu , Yun-Cheng Ju , Alejandro Acero , Geoffrey G. Zweig

IPC分类号： G06F7/00 , G06F17/30 , G06F3/06 , G10L15/187 , G10L15/197

CPC分类号： G06F17/30663 , G06F3/0641 , G06F17/3069 , G10L15/187 , G10L15/197

摘要： A database having listings rather than long documents is searched using a term frequency-inverse document frequency (Tf/Idf) algorithm.

摘要翻译： 使用术语频率 - 逆文档频率（Tf / Idf）算法搜索具有列表而不是长文档的数据库。

15.

发明申请
Automated Data Cleanup 有权
标题翻译：自动数据清理

公开(公告)号：US20100076752A1

公开(公告)日：2010-03-25

申请号：US12561521

申请日：2009-09-17

申请人： Geoffrey G. Zweig , Yun-Cheng Ju

发明人： Geoffrey G. Zweig , Yun-Cheng Ju

IPC分类号： G06F17/21 , G10L15/26

CPC分类号： G10L15/063 , G06F17/2735 , G10L15/187

摘要： The described implementations relate to automated data cleanup. One system includes a language model generated from language model seed text and a dictionary of possible data substitutions. This system also includes a transducer configured to cleanse a corpus utilizing the language model and the dictionary.

摘要翻译： 所描述的实现涉及自动数据清理。一个系统包括从语言模型种子文本生成的语言模型和可能的数据替换的字典。该系统还包括配置成利用语言模型和词典清理语料库的换能器。

16.

发明授权
Speech recognition utilizing multitude of speech features 失效
标题翻译：语音识别利用多种语音特征

公开(公告)号：US07464031B2

公开(公告)日：2008-12-09

申请号：US10724536

申请日：2003-11-28

申请人： Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Ramesh A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig

发明人： Scott E. Axelrod , Sreeram Viswanath Balakrishnan , Stanley F. Chen , Yuging Gao , Ramesh A. Gopinath , Hong-Kwang Kuo , Benoit Maison , David Nahamoo , Michael Alan Picheny , George A. Saon , Geoffrey G. Zweig

IPC分类号： G10L15/00 , G10L15/20

CPC分类号： G10L15/063 , G10L15/02 , G10L15/14 , G10L2015/085

摘要： In a speech recognition system, the combination of a log-linear model with a multitude of speech features is provided to recognize unknown speech utterances. The speech recognition system models the posterior probability of linguistic units relevant to speech recognition using a log-linear model. The posterior model captures the probability of the linguistic unit given the observed speech features and the parameters of the posterior model. The posterior model may be determined using the probability of the word sequence hypotheses given a multitude of speech features. Log-linear models are used with features derived from sparse or incomplete data. The speech features that are utilized may include asynchronous, overlapping, and statistically non-independent speech features. Not all features used in training need to appear in testing/recognition.

摘要翻译： 在语音识别系统中，提供了具有多个语音特征的对数线性模型的组合来识别未知语音语音。语音识别系统使用对数线性模型对与语音识别相关的语言单位的后验概率进行建模。后验模型捕获了语言单位给出观察到的语音特征和后验模型参数的概率。可以使用给定多个语音特征的单词序列假设的概率来确定后验模型。对数线性模型与来自稀疏或不完整数据的特征一起使用。所使用的语音特征可以包括异步，重叠和统计上非独立的语音特征。培训中使用的并非所有功能都需要出现在测试/识别中。

17.

发明授权
Lattice-based unsupervised maximum likelihood linear regression for speaker adaptation 有权
标题翻译：用于说话者适应的基于格子的无监督最大似然线性回归

公开(公告)号：US07216077B1

公开(公告)日：2007-05-08

申请号：US09670251

申请日：2000-09-26

申请人： Mukund Padmanabhan , George A. Saon , Geoffrey G. Zweig

发明人： Mukund Padmanabhan , George A. Saon , Geoffrey G. Zweig

IPC分类号： G10L15/06 , G10L15/14

CPC分类号： G10L15/065

摘要： Methods and arrangements using lattice-based information for unsupervised speaker adaptation. By performing adaptation against a word lattice, correct models are more likely to be used in estimating a transform. Further, a particular type of lattice proposed herein enables the use of a natural confidence measure given by the posterior occupancy probability of a state, that is, the statistics of a particular state will be updated with the current frame only if the a posteriori probability of the state at that particular time is greater than a predetermined threshold.

摘要翻译： 使用基于网格的信息进行无监督的演讲者适应的方法和安排。通过对单词格进行调整，正确的模型更有可能用于估计变换。此外，本文中提出的特定类型的晶格使得能够使用由状态的后占用概率给出的自然置信度度量，即，仅当前一帧的后验概率该特定时间的状态大于预定阈值。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类