专利检索 ap:("Yariv Ephraim" OR "Mazin G. Rahim") AND inv:"Mazin G. Rahim" 第 1 页

1.

发明授权
Method and apparatus for speech recognition using second order statistics and linear estimation of cepstral coefficients 失效
标题翻译：使用二阶统计学和倒谱系数的线性估计的语音识别方法和装置

公开(公告)号：US06202047B1

公开(公告)日：2001-03-13

申请号：US09050301

申请日：1998-03-30

申请人： Yariv Ephraim , Mazin G. Rahim

发明人： Yariv Ephraim , Mazin G. Rahim

IPC分类号： G10L1514

CPC分类号： G10L15/02 , G10L15/142 , G10L25/24

摘要： A method and apparatus for speech recognition using second order statistics and linear estimation of cepstral coefficients. In one embodiment, a speech input signal is received and cepstral features are extracted. An answer is generated using the extracted cepstral features and a fixed signal independent diagonal matrix as the covariance matrix for the cepstral components of the speech input signal and, for example, a hidden Markov model. In another embodiment, a noisy speech input signal is received and a cepstral vector representing a clean speech input signal is generated based on the noisy speech input signal and an explicit linear minimum mean square error cepstral estimator.

摘要翻译： 一种使用二阶统计学和倒谱系数线性估计的语音识别的方法和装置。在一个实施例中，接收语音输入信号并提取倒谱特征。使用提取的倒谱特征和固定信号独立对角矩阵作为用于语音输入信号的倒谱分量的协方差矩阵和例如隐马尔可夫模型来生成答案。在另一个实施例中，接收噪声语音输入信号，并且基于噪声语音输入信号和显式线性最小均方误差倒谱估计器产生表示干净语音输入信号的倒谱矢量。

2.

发明授权
System and method of extracting clauses for spoken language understanding 有权

公开(公告)号：US08849648B1

公开(公告)日：2014-09-30

申请号：US10329138

申请日：2002-12-24

申请人： Srinivas Bangalore , Narendra K. Gupta , Mazin G. Rahim

发明人： Srinivas Bangalore , Narendra K. Gupta , Mazin G. Rahim

IPC分类号： G06F17/27

CPC分类号： G06F17/2705 , G06F17/218 , G06F17/27 , G06F17/2775 , G10L15/05 , G10L15/26 , G10L2015/025 , G10L2015/081 , G10L2015/088

摘要： A clausifier and method of extracting clauses for spoken language understanding are disclosed. The method relates to generating a set of clauses from speech utterance text and comprises inserting at least one boundary tag in speech utterance text related to sentence boundaries, inserting at least one edit tag indicating a portion of the speech utterance text to remove, and inserting at least one conjunction tag within the speech utterance text. The result is a set of clauses that may be identified within the speech utterance text according to the inserted at least one boundary tag, at least one edit tag and at least one conjunction tag. The disclosed clausifier comprises a sentence boundary classifier, an edit detector classifier, and a conjunction detector classifier. The clausifier may comprise a single classifier or a plurality of classifiers to perform the steps of identifying sentence boundaries, editing text, and identifying conjunctions within the text.

3.

发明授权
System and method of providing an automated data-collection in spoken dialog systems 有权

公开(公告)号：US08694324B2

公开(公告)日：2014-04-08

申请号：US13476150

申请日：2012-05-21

申请人： Giuseppe Di Fabbrizio , Dilek Z. Hakkani-Tur , Mazin G. Rahim , Bernard S. Renger , Gokhan Tur

发明人： Giuseppe Di Fabbrizio , Dilek Z. Hakkani-Tur , Mazin G. Rahim , Bernard S. Renger , Gokhan Tur

IPC分类号： G10L21/00 , G10L19/00 , G06F17/27

CPC分类号： G10L15/063 , G10L15/183 , G10L15/22

摘要： The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.

4.

发明授权
Recognizing the numeric language in natural spoken dialogue 有权
标题翻译：认识到自然语言对话中的数字语言

公开(公告)号：US08655658B2

公开(公告)日：2014-02-18

申请号：US13280884

申请日：2011-10-25

申请人： Mazin G. Rahim , Giuseppe Riccardi , Jeremy Huntley Wright , Bruce Melvin Buntschuh , Allen Louis Gorin

发明人： Mazin G. Rahim , Giuseppe Riccardi , Jeremy Huntley Wright , Bruce Melvin Buntschuh , Allen Louis Gorin

IPC分类号： G10L15/14 , G10L15/18

CPC分类号： G10L15/142

摘要： A system and a method are provided. A speech recognition processor receives unconstrained input speech and outputs a string of words. The speech recognition processor is based on a numeric language that represents a subset of a vocabulary. The subset includes a set of words identified as being for interpreting and understanding number strings. A numeric understanding processor contains classes of rules for converting the string of words into a sequence of digits. The speech recognition processor utilizes an acoustic model database. A validation database stores a set of valid sequences of digits. A string validation processor outputs validity information based on a comparison of a sequence of digits output by the numeric understanding processor with valid sequences of digits in the validation database.

摘要翻译： 提供了一种系统和方法。语音识别处理器接收无约束输入语音并输出一串字。语音识别处理器基于代表词汇子集的数字语言。该子集包括被识别为用于解释和理解数字串的一组单词。数字理解处理器包含用于将字符串转换为数字序列的规则类型。语音识别处理器利用声学模型数据库。验证数据库存储一组有效的数字序列。字符串验证处理器基于数字理解处理器输出的数字序列与验证数据库中的有效数字序列的比较来输出有效性信息。

5.

发明授权
Method and apparatus for automatically building conversational systems 有权
标题翻译：自动构建对话系统的方法和装置

公开(公告)号：US08175230B2

公开(公告)日：2012-05-08

申请号：US12644393

申请日：2009-12-22

申请人： Srinivas Bangalore , Mazin G. Rahim , Junlan Feng

发明人： Srinivas Bangalore , Mazin G. Rahim , Junlan Feng

IPC分类号： H04M1/64 , G10L15/22 , G06F15/16

CPC分类号： G06F17/243 , G06Q10/06311 , G06Q30/0201 , G06Q30/0635 , H04L12/6418 , H04M3/4938 , H04M7/0009 , H04M7/0036 , H04M2201/40 , H04M2201/60 , H04M2203/355

摘要： A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.

摘要翻译： 系统和方法为世界各地的Web内容提供了一种自然语言界面。提前或动态地，使用解析算法解析网页内容。使用电话接口的人可以提供语音信息，其被转换成文本并用于自动填写网页表单上的输入字段。然后将表单提交到数据库搜索，并生成响应。包含在响应网页上的信息被提取并经由文本到语音引擎转换成语音，并传达给该人。

6.

发明授权
Systems and methods for monitoring speech data labelers 有权
标题翻译：用于监控语音数据标签器的系统和方法

公开(公告)号：US08170880B2

公开(公告)日：2012-05-01

申请号：US12772589

申请日：2010-05-03

申请人： Lee Begeja , Richard Vandervoort Cox , Harris Drucker , David Crawford Gibbon , Allen Louis Gorin , Patrick Guy Haffner , Steven H. Lewis , Zhu Liu , Mazin G. Rahim , Bernard S. Renger , Behzad Shahraray

发明人： Lee Begeja , Richard Vandervoort Cox , Harris Drucker , David Crawford Gibbon , Allen Louis Gorin , Patrick Guy Haffner , Steven H. Lewis , Zhu Liu , Mazin G. Rahim , Bernard S. Renger , Behzad Shahraray

IPC分类号： G10L15/22 , G10L21/06

CPC分类号： G10L13/08 , G10L15/18

摘要： Systems and methods herein use an annotation guide to label utterances and speech data with a call type. A system practicing the method embodiment monitors labelers of speech data by presenting via a processor a test utterance to a labeler, receiving input from the labeler that selects a particular call type from a list of call types and determining via the processor if the labeler labeled the test utterance correctly. Based on the determining step, the system revises the annotation guide, retrains the labeler, and/or alters the test utterance.

摘要翻译： 本文中的系统和方法使用注释指南来标记具有呼叫类型的话语和语音数据。实施方法实施例的系统通过经由处理器向标签器呈现测试话语来监视语音数据的标签器，从标签器接收从呼叫类型列表中选择特定呼叫类型的输入，并且经由处理器确定标签为测试发音正确。基于确定步骤，系统修改注释指南，重新训练标签器，和/或改变测试话语。

7.

发明申请
SYSTEM AND METHOD FOR PROVIDING A NATURAL LANGUAGE INTERFACE TO A DATABASE 有权
标题翻译：向数据库提供自然语言界面的系统和方法

公开(公告)号：US20110179006A1

公开(公告)日：2011-07-21

申请号：US13074419

申请日：2011-03-29

申请人： Richard Vandervoort Cox , Hossein Eslambolchi , Behzad Nadji , Mazin G. Rahim

发明人： Richard Vandervoort Cox , Hossein Eslambolchi , Behzad Nadji , Mazin G. Rahim

IPC分类号： G06F17/30 , G06F15/18 , G06F17/27 , G10L15/18 , G10L13/00

CPC分类号： G06F17/30864 , G06F17/30657 , G06F17/30663 , G06F17/30684 , G06N99/005 , G10L15/22 , G10L21/06

摘要： A system and method for providing a natural language interface to a database or the Internet. The method provides a response from a database to a natural language query. The method comprises receiving a user query, extracting key data from the user query, submitting the extracted key data to a data base search engine to retrieve a top n pages from the data base, processing of the top n pages through a natural language dialog engine and providing a response based on processing the top n pages.

摘要翻译： 一种用于向数据库或因特网提供自然语言界面的系统和方法。该方法提供从数据库到自然语言查询的响应。该方法包括接收用户查询，从用户查询中提取密钥数据，将所提取的密钥数据提交给数据库搜索引擎以从数据库中检索最前面的页面，通过自然语言对话引擎处理前n个页面并提供基于处理前n页的响应。

8.

发明授权
Reducing time for annotating speech data to develop a dialog application 有权
标题翻译：减少注释语音数据开发对话应用程序的时间

公开(公告)号：US07860713B2

公开(公告)日：2010-12-28

申请号：US12165755

申请日：2008-07-01

申请人： Tirso M. Alonso , Ilana Bromberg , Dilek Z. Hakkani-Tur , Barbara B. Hollister , Mazin G. Rahim , Giuseppe Riccardi , Lawrence Lyon Rose , Daniel Leon Stern , Gokhan Tur , James M. Wilson

发明人： Tirso M. Alonso , Ilana Bromberg , Dilek Z. Hakkani-Tur , Barbara B. Hollister , Mazin G. Rahim , Giuseppe Riccardi , Lawrence Lyon Rose , Daniel Leon Stern , Gokhan Tur , James M. Wilson

IPC分类号： G10L15/14 , G10L15/22

CPC分类号： G10L15/19 , G10L15/063 , G10L15/183 , H04M3/4936

摘要： Systems and methods for annotating speech data. The present invention reduces the time required to annotate speech data by selecting utterances for annotation that will be of greatest benefit. A selection module uses speech models, including speech recognition models and spoken language understanding models, to identify utterances that should be annotated based on criteria such as confidence scores generated by the models. These utterances are placed in an annotation list along with a type of annotation to be performed for the utterances and an order in which the annotation should proceed. The utterances in the annotation list can be annotated for speech recognition purposes, spoken language understanding purposes, labeling purposes, etc. The selection module can also select utterances for annotation based on previously annotated speech data and deficiencies in the various models.

摘要翻译： 用于注释语音数据的系统和方法。本发明通过选择最有益的用于注释的话语来减少注释语音数据所需的时间。选择模块使用包括语音识别模型和语言理解模型在内的语音模型来基于诸如由模型产生的置信度得分的标准来识别应当注释的话语。这些话语被放置在注释列表中，以及要为语句执行的注释类型以及注释应该继续执行的顺序。注释列表中的话语可以被注释用于语音识别目的，语言理解目的，标签目的等。选择模块还可以基于先前注释的语音数据和各种模型中的缺陷来选择用于注释的话语。

9.

发明授权
Method of generation a labeling guide for spoken dialog services 有权
标题翻译：生成口语对话服务标签指南的方法

公开(公告)号：US07729902B1

公开(公告)日：2010-06-01

申请号：US11927738

申请日：2007-10-30

申请人： Narendra K. Gupta , Barbara B. Hollister , Mazin G. Rahim , Giuseppe Riccardi

发明人： Narendra K. Gupta , Barbara B. Hollister , Mazin G. Rahim , Giuseppe Riccardi

IPC分类号： G06F17/27 , G06F17/21

CPC分类号： G06F17/279 , G10L15/18 , G10L15/183

摘要： A method is disclosed for designing a labeling guide for use by a labeler in labeling data used for training a spoken language understanding (SLU) module for an application. The method comprises a labeling guide designer selecting domain-independent actions applicable to an application, selecting domain-dependent objects according to characteristics of the application, and generating a labeling guide using the selected domain-independent actions and selected domain-dependent objects. An advantage of the labeling guide generated in this manner is that the labeling guide designer can easily port the labeling guide to a new application by selecting a set of domain-independent action and then selecting the domain-dependent objects related to the new application.

摘要翻译： 公开了一种用于设计标签指南的方法，用于标签机用于标记用于训练用于应用的口语理解（SLU）模块的数据。该方法包括标签指导者设计者，其选择适用于应用的独立于领域的动作，根据应用的特征来选择依赖于域的对象，以及使用所选择的与域无关的动作和选择的域相关对象来生成标签指南。以这种方式生成的标签指南的优点是，标签指南设计者可以通过选择一组独立于领域的动作，然后选择与新应用相关的域相关对象，轻松地将标签指南移植到新应用。

10.

发明申请
SYSTEM FOR HANDLING FREQUENTLY ASKED QUESTIONS IN A NATURAL LANGUAGE DIALOG SERVICE 审中-公开
标题翻译：在自然语言对话服务中处理常见问题的系统

公开(公告)号：US20090070113A1

公开(公告)日：2009-03-12

申请号：US12266835

申请日：2008-11-07

申请人： Narendra K. Gupta , Mazin G. Rahim , Giuseppe Riccardi

发明人： Narendra K. Gupta , Mazin G. Rahim , Giuseppe Riccardi

IPC分类号： G10L15/18

CPC分类号： G10L15/22 , G06F3/167

摘要： A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.

摘要翻译： 公开了支持语音的帮助台服务。该服务包括用于识别来自用户的语音的自动语音识别模块，用于理解来自自动语音识别模块的输出的口语语言理解模块，用于生成来自用户对语音的响应的对话管理模块，自然语音文本 - 语音合成模块，用于合成语音以产生对用户的响应，以及常见问题模块。常见问题模块通过改变语音来处理用户的常见问题，并提供预定的提示来回答常见问题。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类