专利检索 ap:("Tirso M. Alonso" OR "Ilana Bromberg" OR "Dilek Z. Hakkani-Tur" OR "Barbara B. Hollister" OR "Mazin G. Rahim" OR "Giuseppe Riccardi" OR "Lawrence Lyon Rose" OR "Daniel Leon Stern" OR "Gokhan Tur" OR "James M. Wilson") AND inv:"Mazin G. Rahim" 第 7 页

61.

发明授权
Method and apparatus for automatically building conversational systems 有权
标题翻译：自动构建对话系统的方法和装置

公开(公告)号：US08462917B2

公开(公告)日：2013-06-11

申请号：US13465659

申请日：2012-05-07

申请人： Srinivas Bangalore , Mazin G. Rahim , Junlan Feng

发明人： Srinivas Bangalore , Mazin G. Rahim , Junlan Feng

IPC分类号： H04M1/64 , H04M3/493 , G06Q10/06 , G06Q30/02 , G06Q30/06 , H04L12/64 , H04M7/00

CPC分类号： G06F17/243 , G06Q10/06311 , G06Q30/0201 , G06Q30/0635 , H04L12/6418 , H04M3/4938 , H04M7/0009 , H04M7/0036 , H04M2201/40 , H04M2201/60 , H04M2203/355

摘要： A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.

摘要翻译： 系统和方法为世界各地的Web内容提供了一种自然语言界面。提前或动态地，使用解析算法解析网页内容。使用电话接口的人可以提供语音信息，其被转换成文本并用于自动填写网页表单上的输入字段。然后将表单提交到数据库搜索，并生成响应。包含在响应网页上的信息被提取并经由文本到语音引擎转换成语音，并传达给该人。

62.

发明授权
System and method of providing a spoken dialog interface to a website 有权

公开(公告)号：US08442834B2

公开(公告)日：2013-05-14

申请号：US13587554

申请日：2012-08-16

申请人： Srinivas Bangalore , Junlan Feng , Mazin G. Rahim

发明人： Srinivas Bangalore , Junlan Feng , Mazin G. Rahim

IPC分类号： G10L15/22 , G10L15/18

CPC分类号： G10L13/08 , G06F17/27 , G10L15/063 , G10L15/183 , G10L15/22 , H04M3/4936

摘要： Disclosed is a method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes selecting anchor texts within a website based on a term density, weighting those anchor texts based on a percent of salient words to total words, and incorporating the weighted anchor texts into a live spoken dialog interface, the weights determining a level of incorporation into the live spoken dialog interface.

63.

发明授权
System and method of spoken language understanding in human computer dialogs 有权

公开(公告)号：US08386262B2

公开(公告)日：2013-02-26

申请号：US13481031

申请日：2012-05-25

申请人： Srinivas Bangalore , Narendra K. Gupta , Mazin G. Rahim

发明人： Srinivas Bangalore , Narendra K. Gupta , Mazin G. Rahim

IPC分类号： G10L21/00 , G06F17/20 , G06F17/27

CPC分类号： G10L15/1815 , G06F17/2785 , G10L13/043 , G10L15/02 , G10L15/1822 , G10L15/265

摘要： A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

64.

发明申请
Systems and Methods for Monitoring Speech Data Labelers 有权
标题翻译：语音数据标签器监控系统和方法

公开(公告)号：US20100217597A1

公开(公告)日：2010-08-26

申请号：US12772589

申请日：2010-05-03

申请人： Lee Begeja , Richard Vandervoort Cox , Harris Drucker , David Crawford Gibbon , Allen Louis Gorin , Patrick Guy Haffner , Steven H. Lewis , Zhu Liu , Mazin G. Rahim , Bernard S. Renger , Behzad Shahraray

发明人： Lee Begeja , Richard Vandervoort Cox , Harris Drucker , David Crawford Gibbon , Allen Louis Gorin , Patrick Guy Haffner , Steven H. Lewis , Zhu Liu , Mazin G. Rahim , Bernard S. Renger , Behzad Shahraray

IPC分类号： G10L15/00

CPC分类号： G10L13/08 , G10L15/18

摘要： Systems and methods for using an annotation guide to label utterances and speech data with a call type are disclosed. A method embodiment monitors labelers of speech data by presenting via a processor a test utterance to a labeler, receiving input from the labeler that selects a particular call type from a list of call types and determining via the processor if the labeler labeled the test utterance correctly. Based on the determining step, the method performs at least one of the following: revising the annotation guide, retraining the labeler or altering the test utterance.

摘要翻译： 公开了使用注释指南来标记具有呼叫类型的话语和语音数据的系统和方法。方法实施例通过经由处理器向标签器呈现测试话语来监视语音数据的标签器，从标签器接收从呼叫类型列表中选择特定呼叫类型的输入，并且经由处理器确定是否标记了测试话语的标签器正确。基于确定步骤，该方法执行以下至少之一：修改注释指南，重新训练标签器或改变测试话语。

65.

发明授权
Timing of speech recognition over lossy transmission systems 有权
标题翻译：有损传输系统语音识别的时序

公开(公告)号：US07496503B1

公开(公告)日：2009-02-24

申请号：US11611983

申请日：2006-12-18

申请人： Richard Vandervoort Cox , Stephen Michael Marcus , Mazin G. Rahim , Nambirajan Seshadri , Robert Douglas Sharp

发明人： Richard Vandervoort Cox , Stephen Michael Marcus , Mazin G. Rahim , Nambirajan Seshadri , Robert Douglas Sharp

IPC分类号： G10L19/00

CPC分类号： G10L15/02 , G10L15/20

摘要： Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. After waiting for a predetermined time, speech vectors are generated and potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

摘要翻译： 识别通过有损通信链路作为语音向量接收的语音流包括：通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量，其中与每个语音向量相关联的一些分组丢失或损坏传输。每个构造的语音向量是多维的并且包括相关联的特征。在等待预定的时间之后，产生语音向量，并且在存在时将语音向量内潜在的损坏的特征指示给语音识别器。当存在损坏的特征时，语音识别器在语音向量上尝试语音识别。该识别可以仅基于每个语音向量内的某些或有效特征。当指示步骤指示损坏的值以及尝试的识别步骤失败时，请求重新发送丢失或损坏的数据包。

66.

发明授权
Spoken language understanding that incorporates prior knowledge into boosting 有权
标题翻译：将先前知识纳入提升的口语理解

公开(公告)号：US07328146B1

公开(公告)日：2008-02-05

申请号：US11484120

申请日：2006-07-11

申请人： Hiyan Alshawi , Giuseppe DiFabrizzio , Narendra K. Gupta , Mazin G. Rahim , Robert E. Schapire , Yoram Singer

发明人： Hiyan Alshawi , Giuseppe DiFabrizzio , Narendra K. Gupta , Mazin G. Rahim , Robert E. Schapire , Yoram Singer

IPC分类号： G06F17/20

CPC分类号： G06F17/28

摘要： A system for understanding entries, such as speech, develops a classifier by employing prior knowledge with which a given corpus of training entries is enlarged threefold. A rule is created for each of the labels employed in the classifier, and the created rules are applied to the given corpus to create a corpus of attachments by appending a weight of ηp(x), or 1−ηp(x), to labels of entries that meet, or fail to meet, respectively, conditions of the labels' rules, and to also create a corpus of non-attachments by appending a weight of 1−ηp(x), or ηp(x), to labels of entries that meet, or fail to meet conditions of the labels' rules.

摘要翻译： 用于理解诸如言语之类的条目的系统通过采用将给定的训练条目语料库放大三倍的先验知识来开发分类器。为分类器中使用的每个标签创建规则，并将创建的规则应用于给定的语料库，以通过将一个权重为etap（x）或1-etap（x）附加到标签来创建附件语料库分别符合或未能满足标签规则的条件的条目，并通过将1-etap（x）或etap（x）的权重附加到标签的标签上来创建非附件语料库满足或不符合标签规则条件的条目。

67.

发明授权
Speech recognition over lossy networks with rejection threshold 有权
标题翻译：具有拒绝门槛的有损网络的语音识别

公开(公告)号：US07171359B1

公开(公告)日：2007-01-30

申请号：US10902304

申请日：2004-07-29

申请人： Richard Vandervoort Cox , Stephen Michael Marcus , Mazin G. Rahim , Nambirajan Seshadri , Robert Douglas Sharp

发明人： Richard Vandervoort Cox , Stephen Michael Marcus , Mazin G. Rahim , Nambirajan Seshadri , Robert Douglas Sharp

IPC分类号： G10L15/06

CPC分类号： G10L15/02 , G10L15/20

摘要： Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. Potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

摘要翻译： 识别通过有损通信链路作为语音向量接收的语音流包括：通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量，其中与每个语音向量相关联的一些分组丢失或损坏传输。每个构造的语音向量是多维的并且包括相关联的特征。语音向量中的潜在损坏的特征在存在时被指示给语音识别器。当存在损坏的特征时，语音识别器在语音向量上尝试语音识别。该识别可以仅基于每个语音向量内的某些或有效特征。当指示步骤指示损坏的值以及尝试的识别步骤失败时，请求重新发送丢失或损坏的数据包。

68.

发明授权
Method and system for performing speech recognition 失效
标题翻译：执行语音识别的方法和系统

公开(公告)号：US5806022A

公开(公告)日：1998-09-08

申请号：US575378

申请日：1995-12-20

申请人： Mazin G. Rahim , Jay Gordon Wilpon

发明人： Mazin G. Rahim , Jay Gordon Wilpon

IPC分类号： G10L15/20 , G10L11/00 , G10L15/02 , G10L15/08 , G10L15/10 , G10L21/02 , G10L3/02 , G10L9/00 , H04M1/76

CPC分类号： G10L15/02 , G10L15/08 , G10L21/02 , G10L25/12 , G10L25/18

摘要： Speech recognition processing is compensated for improving robustness of speech recognition in the presence of enhanced speech signals. The compensation overcomes the adverse effects that speech signal enhancement may have on speech recognition performance, where speech signal enhancement causes acoustical mismatches between recognition models trained using unenhanced speech signals and feature data extracted from enhanced speech signals. Compensation is provided at the front end of an automatic speech recognition system by combining linear predictive coding and mel-based cepstral parameter analysis for computing cepstral features of transmitted speech signals used for speech recognition processing by selectively weighting mel-filter banks when processing frequency domain representations of the enhanced speech signals.

摘要翻译： 在增强语音信号的存在下，语音识别处理被补偿以提高语音识别的鲁棒性。补偿克服语音信号增强可能对语音识别性能的不利影响，其中语音信号增强导致使用未增强语音信号训练的识别模型和从增强语音信号提取的特征数据之间的声学失配。在自动语音识别系统的前端通过组合线性预测编码和基于梅尔的倒谱参数分析来提供用于计算用于语音识别处理的传输语音信号的倒谱特征的补偿，其中当处理频域表示时，通过选择性地加权梅尔滤波器组的增强语音信号。

69.

发明授权
Discriminative utterance verification for connected digits recognition 失效
标题翻译：连接数字识别的歧视性话语验证

公开(公告)号：US5737489A

公开(公告)日：1998-04-07

申请号：US528902

申请日：1995-09-15

申请人： Wu Chou , Biing-Hwang Juang , Chin-Hui Lee , Mazin G. Rahim

发明人： Wu Chou , Biing-Hwang Juang , Chin-Hui Lee , Mazin G. Rahim

IPC分类号： G09B19/06 , G10L15/06 , G10L15/10 , G10L15/14 , G10L15/22 , G10L15/28 , G10L5/06

CPC分类号： G10L15/144 , G10L15/063 , G10L15/10

摘要： In a speech recognition system, a recognition processor receives an unknown utterance signal as input. The recognition processor in response to the unknown utterance signal input accesses a recognition database and scores the utterance signal against recognition models in the recognition database to classify the unknown utterance and to generate a hypothesis speech signal. A verification processor receives the hypothesis speech signal as input to be verified. The verification processor accesses a verification database to test the hypothesis speech signal against verification models reflecting a preselected type of training stored in the verification database. Based on the verification test, the verification processor generates a confidence measure signal. The confidence measure signal can be compared against a verification threshold to determine the accuracy of the recognition decision made by the recognition processor.

摘要翻译： 在语音识别系统中，识别处理器接收未知的话音信号作为输入。响应于未知话语信号输入的识别处理器访问识别数据库，并根据识别数据库中的识别模型对话音信号进行评分，以对未知话语进行分类并生成假设语音信号。验证处理器接收假设语音信号作为待验证的输入。验证处理器访问验证数据库以针对反映存储在验证数据库中的预选类型的训练的验证模型来测试假设语音信号。基于验证测试，验证处理器产生置信度测量信号。可以将置信度信号与验证阈值进行比较，以确定由识别处理器进行的识别决策的准确性。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类