专利检索 ap:("Guillaume Proulx" OR "Youssef Billawala" OR "Elaine Drom" OR "Farzad Ehsani" OR "Yookyung Kim" OR "Demitrios Master") AND inv:"Farzad Ehsani" 第 1 页

1.

发明申请
Methods for speech-to-speech translation 审中-公开
标题翻译：语言到语音翻译的方法

公开(公告)号：US20080133245A1

公开(公告)日：2008-06-05

申请号：US11633859

申请日：2006-12-04

申请人： Guillaume Proulx , Youssef Billawala , Elaine Drom , Farzad Ehsani , Yookyung Kim , Demitrios Master

发明人： Guillaume Proulx , Youssef Billawala , Elaine Drom , Farzad Ehsani , Yookyung Kim , Demitrios Master

IPC分类号： G10L21/00 , G06F17/28 , G10L11/00

CPC分类号： G06F17/2872 , G06F17/2818 , G10L13/00 , G10L15/26

摘要： The present invention disclose modular speech-to-speech translation systems and methods that provide adaptable platforms to enable verbal communication between speakers of different languages within the context of specific domains. The components of the preferred embodiments of the present invention includes: (1) speech recognition; (2) machine translation; (3) N-best merging module; (4) verification; and (5) text-to-speech. Characteristics of the speech recognition module here are that the modules are structured to provide N-best selections and multi-stream processing, where multiple speech recognition engines may be active at any one time. The N-best lists from the one or more speech recognition engines may be handled either separately or collectively to improve both recognition and translation results. A merge module is responsible for integrating the N-best outputs of the translation engines along with confidence/translation scores to create a ranked list or recognition-translation pairs.

摘要翻译： 本发明公开了提供适应性平台的模块化语音到语音翻译系统和方法，以使得能够在特定域的上下文内的不同语言的说话者之间进行口头通信。本发明的优选实施例的组件包括：（1）语音识别; （2）机器翻译; （3）最佳合并模块; （4）验证; （5）文字转语音。这里的语音识别模块的特征在于，模块被构造成提供N个最佳选择和多流处理，其中多个语音识别引擎可以在任何一个时间处于活动状态。来自一个或多个语音识别引擎的N最佳列表可以单独处理或集体处理以改善识别和翻译结果。合并模块负责整合翻译引擎的N最佳输出以及置信/翻译分数，以创建排名列表或识别 - 转换对。

2.

发明授权
Robust information extraction from utterances 有权
标题翻译：从言语中提取鲁棒的信息

公开(公告)号：US08583416B2

公开(公告)日：2013-11-12

申请号：US11965711

申请日：2007-12-27

申请人： Jun Huang , Yookyung Kim , Youssef Billawala , Farzad Ehsani , Demitrios Master

发明人： Jun Huang , Yookyung Kim , Youssef Billawala , Farzad Ehsani , Demitrios Master

IPC分类号： G06F17/28 , G10L15/00 , G10L21/00

CPC分类号： G10L15/1822 , G10L15/1815

摘要： The performance of traditional speech recognition systems (as applied to information extraction or translation) decreases significantly with, larger domain size, scarce training data as well as under noisy environmental conditions. This invention mitigates these problems through the introduction of a novel predictive feature extraction method which combines linguistic and statistical information for representation of information embedded in a noisy source language. The predictive features are combined with text classifiers to map the noisy text to one of the semantically or functionally similar groups. The features used by the classifier can be syntactic, semantic, and statistical.

摘要翻译： 传统语音识别系统（应用于信息提取或翻译）的性能随着更大的域大小，稀缺的训练数据以及噪声环境条件而显着降低。本发明通过引入一种新颖的预测特征提取方法来缓解这些问题，该方法结合语言和统计信息来表示以噪声源语言嵌入的信息。预测特征与文本分类器组合，将嘈杂的文本映射到语义或功能相似的组之一。分类器使用的特征可以是语法，语义和统计。

3.

发明申请
Robust Information Extraction from Utterances 有权
标题翻译：强大的信息提取

公开(公告)号：US20090171662A1

公开(公告)日：2009-07-02

申请号：US11965711

申请日：2007-12-27

申请人： Jun Huang , Yookyung Kim , Youssef Billawala , Farzad Ehsani , Demitrios Master

发明人： Jun Huang , Yookyung Kim , Youssef Billawala , Farzad Ehsani , Demitrios Master

IPC分类号： G10L15/00

CPC分类号： G10L15/1822 , G10L15/1815

摘要： The performance of traditional speech recognition systems (as applied to information extraction or translation) decreases significantly with, larger domain size, scarce training data as well as under noisy environmental conditions. This invention mitigates these problems through the introduction of a novel predictive feature extraction method which combines linguistic and statistical information for representation of information embedded in a noisy source language. The predictive features are combined with text classifiers to map the noisy text to one of the semantically or functionally similar groups. The features used by the classifier can be syntactic, semantic, and statistical.

摘要翻译： 传统语音识别系统（应用于信息提取或翻译）的性能随着更大的域大小，稀缺的训练数据以及噪声环境条件而显着降低。本发明通过引入一种新颖的预测特征提取方法来缓解这些问题，该方法结合语言和统计信息来表示以噪声源语言嵌入的信息。预测特征与文本分类器组合，将嘈杂的文本映射到语义或功能相似的组之一。分类器使用的特征可以是语法，语义和统计。

4.

发明授权
Mobile speech-to-speech interpretation system 有权
标题翻译：移动语音到语音解释系统

公开(公告)号：US08478578B2

公开(公告)日：2013-07-02

申请号：US12351793

申请日：2009-01-09

申请人： Farzad Ehsani , Demitrios Master , Elaine Drom Zuber

发明人： Farzad Ehsani , Demitrios Master , Elaine Drom Zuber

IPC分类号： G06F17/28

CPC分类号： G06F17/289 , G06F17/28 , G06F17/2818 , G06F17/2854 , G10L13/00 , G10L15/005 , G10L15/02 , G10L15/26 , G10L15/30

摘要： Interpretation from a first language to a second language via one or more communication devices is performed through a communication network (e.g. phone network or the internet) using a server for performing recognition and interpretation tasks, comprising the steps of: receiving an input speech utterance in a first language on a first mobile communication device; conditioning said input speech utterance; first transmitting said conditioned input speech utterance to a server; recognizing said first transmitted speech utterance to generate one or more recognition results; interpreting said recognition results to generate one or more interpretation results in an interlingua; mapping the interlingua to a second language in a first selected format; second transmitting said interpretation results in the first selected format to a second mobile communication device; and presenting said interpretation results in a second selected format on said second communication device.

摘要翻译： 通过一个或多个通信设备从第一语言到第二语言的解释通过使用用于执行识别和解释任务的服务器的通信网络（例如电话网络或因特网）来执行，包括以下步骤：接收输入语音话语第一移动通信设备上的第一语言; 说出输入的言语说话; 首先将所述条件化输入语音话语发送到服务器; 识别所述第一发送语音话语以产生一个或多个识别结果; 解释所述识别结果以在国际语言中产生一个或多个解释结果; 以第一选择的格式将国际语言映射到第二语言; 将第一选择格式的所述解释结果发送给第二移动通信设备; 以及在所述第二通信设备上呈现第二选择格式的所述解释结果。

5.

发明申请
Speech-to-speech translation system with user-modifiable paraphrasing grammars 审中-公开
标题翻译：具有用户可修改的释义语法的语音到语音翻译系统

公开(公告)号：US20070016401A1

公开(公告)日：2007-01-18

申请号：US11203621

申请日：2005-08-12

申请人： Farzad Ehsani , Demitrios Master , Guillaume Proulx

发明人： Farzad Ehsani , Demitrios Master , Guillaume Proulx

IPC分类号： G06F17/27

CPC分类号： G06F17/2872 , G10L15/005

摘要： The present invention discloses a speech-to-speech translation device which allows one or more users to input a spoken utterance in one language, translates the utterance into one or more second languages, and outputs the translation in speech form. Additionally, the device allows for translation both directions, recognizing inputs in the one or more second languages and translating them back into the first language. The device recognizes and translates utterances in a limited domain as in a phrase book translation system, so the translation accuracy is essentially 100%. By limiting the domain the system increases the accuracy of the speech recognition component and thus the accuracy of the overall system. However unlike other phrase book systems, the device also allows wide variations and paraphrasing in the input, so that the user is much more likely to find the desired phrase from the stored list of phrases. The device paraphrases the input to a basic canonical form and performs the translation on that canonical form, ignoring the non-essential variations in the surface form of the input. The device can provide visual and/or auditory feedback to confirm the recognized input and makes the system usable for non-bilingual users with absolute confidence.

摘要翻译： 本发明公开了一种语音到语音翻译装置，其允许一个或多个用户以一种语言输入口语话语，将话语转换为一种或多种第二语言，并以语音形式输出翻译。此外，该设备允许翻译两个方向，识别一个或多个第二语言中的输入并将其翻译成第一语言。该设备在短语书籍翻译系统中识别和翻译有限域中的话语，因此翻译准确性基本上为100％。通过限制域，系统提高了语音识别组件的准确性，从而提高了整个系统的准确性。然而，与其他短语书系统不同，该设备还允许输入中的宽泛的变化和释义，使得用户更可能从存储的短语列表中找到所需的短语。该设备将输入释义为基本规范形式，并以该规范形式执行翻译，忽略了输入表面形式中的非必要变体。该设备可以提供视觉和/或听觉反馈，以确认已识别的输入，并使系统可以绝对自信地用于非双语用户。

6.

发明授权
Methods for using manual phrase alignment data to generate translation models for statistical machine translation 有权
标题翻译：使用手动短语对齐数据生成用于统计机器翻译的翻译模型的方法

公开(公告)号：US08229728B2

公开(公告)日：2012-07-24

申请号：US11969518

申请日：2008-01-04

申请人： Jun Huang , Yookyung Kim , Demitrios Master , Farzad Ehsani

发明人： Jun Huang , Yookyung Kim , Demitrios Master , Farzad Ehsani

IPC分类号： G06F17/20 , G06F17/21 , G06F17/28

CPC分类号： G06F17/2818 , G06F17/2827

摘要： The present invention adopts the fundamental architecture of a statistical machine translation system which utilizes statistical models learned from the training data and does not require expert knowledge for rule-based machine translation systems. Out of the training parallel data, a certain amount of sentence pairs are selected for manual alignment. These sentences are aligned at the phrase level instead of at the word level. Depending on the size of the training data, the optimal amount for manual alignment may vary. The alignment is done using an alignment tool with a graphical user interface which is convenient and intuitive to the users. Manually aligned data are then utilized to improve the automatic word alignment component. Model combination methods are also introduced to improve the accuracy and the coverage of statistical models for the task of statistical machine translation.

摘要翻译： 本发明采用统计机器翻译系统的基础架构，该系统利用从训练数据中获得的统计模型，不需要基于规则的机器翻译系统的专业知识。在训练并行数据中，选择一定量的句子对进行手动对齐。这些句子在短语级别而不是单词级别对齐。根据训练数据的大小，手动校准的最佳量可能会有所不同。使用具有用户方便和直观的图形用户界面的对准工具进行对准。然后使用手动对齐的数据来改进自动字对齐组件。还引入了模型组合方法，以提高统计机器翻译任务的统计模型的准确性和覆盖率。

7.

发明申请
Methods for Using Manual Phrase Alignment Data to Generate Translation Models for Statistical Machine Translation 有权
标题翻译：使用手动短语对齐数据生成统计机器翻译的翻译模型的方法

公开(公告)号：US20090177460A1

公开(公告)日：2009-07-09

申请号：US11969518

申请日：2008-01-04

申请人： Jun Huang , Yookyung Kim , Demitrios Master , Farzad Ehsani

发明人： Jun Huang , Yookyung Kim , Demitrios Master , Farzad Ehsani

IPC分类号： G06F17/28

CPC分类号： G06F17/2818 , G06F17/2827

摘要： The present invention adopts the fundamental architecture of a statistical machine translation system which utilizes statistical models learned from the training data and does not require expert knowledge for rule-based machine translation systems. Out of the training parallel data, a certain amount of sentence pairs are selected for manual alignment. These sentences are aligned at the phrase level instead of at the word level. Depending on the size of the training data, the optimal amount for manual alignment may vary. The alignment is done using an alignment tool with a graphical user interface which is convenient and intuitive to the users. Manually aligned data are then utilized to improve the automatic word alignment component. Model combination methods are also introduced to improve the accuracy and the coverage of statistical models for the task of statistical machine translation.

摘要翻译： 本发明采用统计机器翻译系统的基础架构，该系统利用从训练数据中获得的统计模型，不需要基于规则的机器翻译系统的专业知识。在训练并行数据中，选择一定量的句子对进行手动对齐。这些句子在短语级别而不是单词级别对齐。根据训练数据的大小，手动校准的最佳量可能会有所不同。使用具有用户方便和直观的图形用户界面的对准工具进行对准。然后使用手动对齐的数据来改进自动字对齐组件。还提出了模型组合方法，以提高统计机器翻译任务的统计模型的准确性和覆盖率。

8.

发明申请
Mobile Speech-to-Speech Interpretation System 有权
标题翻译：移动语音到语音解释系统

公开(公告)号：US20090177461A1

公开(公告)日：2009-07-09

申请号：US12351793

申请日：2009-01-09

申请人： Farzad Ehsani , Demitrios Master , Elaine Zuber

发明人： Farzad Ehsani , Demitrios Master , Elaine Zuber

IPC分类号： G06F17/28

CPC分类号： G06F17/289 , G06F17/28 , G06F17/2818 , G06F17/2854 , G10L13/00 , G10L15/005 , G10L15/02 , G10L15/26 , G10L15/30

摘要： Interpretation from a first language to a second language via one or more communication devices is performed through a communication network (e.g. phone network or the internet) using a server for performing recognition and interpretation tasks, comprising the steps of: receiving an input speech utterance in a first language on a first mobile communication device; conditioning said input speech utterance; first transmitting said conditioned input speech utterance to a server; recognizing said first transmitted speech utterance to generate one or more recognition results; interpreting said recognition results to generate one or more interpretation results in an interlingua; mapping the interlingua to a second language in a first selected format; second transmitting said interpretation results in the first selected format to a second mobile communication device; and presenting said interpretation results in a second selected format on said second communication device.

摘要翻译： 通过一个或多个通信设备从第一语言到第二语言的解释通过使用用于执行识别和解释任务的服务器的通信网络（例如电话网络或因特网）来执行，包括以下步骤：接收输入语音话语第一移动通信设备上的第一语言; 说出输入的言语说话; 首先将所述条件化输入语音话语发送到服务器; 识别所述第一发送语音话语以产生一个或多个识别结果; 解释所述识别结果以在国际语言中产生一个或多个解释结果; 以第一选择的格式将国际语言映射到第二语言; 将第一选择格式的所述解释结果发送给第二移动通信设备; 以及在所述第二通信设备上呈现第二选择格式的所述解释结果。

9.

发明授权
Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface 有权

公开(公告)号：US08442812B2

公开(公告)日：2013-05-14

申请号：US10818219

申请日：2004-04-05

申请人： Farzad Ehsani , Eva M. Knodt , Demitrios L. Master

发明人： Farzad Ehsani , Eva M. Knodt , Demitrios L. Master

IPC分类号： G06F17/27 , G10L15/00

CPC分类号： G06F17/28 , G06F17/2775 , G06F17/2795 , G10L15/183 , G10L15/193 , G10L15/197 , G10L15/22

摘要： The invention enables creation of grammar networks that can regulate, control, and define the content and scope of human-machine interaction in natural language voice user interfaces (NLVUI). The invention enables phrase-based modeling of generic structures of verbal interaction to be used for the purpose of automating part of the design of such grammar networks. Most particularly, the invention enables such grammar networks to be used in providing a voice-controlled user interface to human readable text data that is also machine-readable (such as a Web page, a word processing document, a PDF document, or a spreadsheet).

10.

发明授权
Methods for creating a phrase thesaurus 有权
标题翻译：创建短语词库的方法

公开(公告)号：US08374871B2

公开(公告)日：2013-02-12

申请号：US10096194

申请日：2002-03-11

申请人： Farzad Ehsani , Eva M. Knodt

发明人： Farzad Ehsani , Eva M. Knodt

IPC分类号： G10L15/00 , G10L21/00

CPC分类号： G06F17/271 , G06F17/2715 , G06F17/2765 , G06F17/2775 , G06F17/279 , G06F17/2795 , G10L15/005 , G10L15/183 , G10L15/193 , G10L15/197 , G10L15/22

摘要： The invention enables creation of grammar networks that can regulate, control, and define the content and scope of human-machine interaction in natural language voice user interfaces (NLVUI). More specifically, the invention concerns a phrase-based modeling of generic structures of verbal interaction and use of these models for the purpose of automating part of the design of such grammar networks.

摘要翻译： 本发明能够创建可以调节，控制和定义自然语言语音用户界面（NLVUI）中人机交互的内容和范围的语法网络。更具体地，本发明涉及用于自动化这种语法网络的设计的一部分的目的语言交互的通用结构和这些模型的使用的基于短语的建模。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类