专利检索 ap:("Andrew Aaron" OR "Ellen Eide" OR "Wael Hamza" OR "Michael Picheny" OR "Charles Rutherfoord" OR "Zhi Shuang" OR "Maria Smith") AND inv:"Andrew Aaron" 第 1 页

1.

发明申请
Method, apparatus and computer program providing a multi-speaker database for concatenative text-to-speech synthesis 有权
标题翻译：方法，装置和计算机程序提供用于并行文本到语音合成的多扬声器数据库

公开(公告)号：US20060229876A1

公开(公告)日：2006-10-12

申请号：US11101223

申请日：2005-04-07

申请人： Andrew Aaron , Ellen Eide , Wael Hamza , Michael Picheny , Charles Rutherfoord , Zhi Shuang , Maria Smith

发明人： Andrew Aaron , Ellen Eide , Wael Hamza , Michael Picheny , Charles Rutherfoord , Zhi Shuang , Maria Smith

IPC分类号： G10L13/00

CPC分类号： G10L13/07 , G10L2021/0135

摘要： A method, apparatus and a computer program product to generate an audible speech word that corresponds to text. The method includes providing a text word and, in response to the text word, processing pre-recorded speech segments that are derived from a plurality of speakers to selectively concatenate together speech segments based on at least one cost function to form audio data for generating an audible speech word that corresponds to the text word. A data structure is also provided for use in a concatenative text-to-speech system that includes a plurality of speech segments derived from a plurality of speakers, where each speech segment includes an associated attribute vector each of which is comprised of at least one attribute vector element that identifies the speaker from which the speech segment was derived.

摘要翻译： 一种用于生成对应于文本的可听话语词的方法，装置和计算机程序产品。该方法包括提供文本字，并且响应于文本字，处理从多个扬声器导出的预先记录的语音片段，以便基于至少一个成本函数选择性地将语音片段并置在一起，以形成用于生成对应于文本字的声音语音字。还提供了一种数据结构，用于包括从多个扬声器导出的多个语音段的级联文本到语音系统，其中每个语音段包括相关联的属性向量，每个语音段包括至少一个属性标识从中导出语音段的扬声器的向量元素。

2.

发明申请
Generating paralinguistic phenomena via markup 有权
标题翻译：通过标记产生分析现象

公开(公告)号：US20050273338A1

公开(公告)日：2005-12-08

申请号：US10861055

申请日：2004-06-04

申请人： Andrew Aaron , Raimo Bakis , Ellen Eide , Wael Hamza

发明人： Andrew Aaron , Raimo Bakis , Ellen Eide , Wael Hamza

IPC分类号： G10L13/06

CPC分类号： G10L13/08

摘要： Examples of paralinguistic events (e.g., breaths, coughs, sighs, etc.) are recorded. A text-to-speech (“TTS”) engine may insert the examples into a stream of synthetic speech using, for example, markup. The synthetic speech may include a combination of normal text and paralinguistic text.

摘要翻译： 记录截肢事件（例如呼吸，咳嗽，叹息等）的例子。文本到语音（“TTS”）引擎可以使用例如标记将示例插入到合成语音流中。合成语音可以包括正常文本和paralinguistic文本的组合。

3.

发明申请
Systems and methods for expressive text-to-speech 审中-公开
标题翻译：表达文字到言语的系统和方法

公开(公告)号：US20050096909A1

公开(公告)日：2005-05-05

申请号：US10695979

申请日：2003-10-29

申请人： Raimo Bakis , Andrew Aaron , Ellen Eide , Thiruvilwamalai Raman

发明人： Raimo Bakis , Andrew Aaron , Ellen Eide , Thiruvilwamalai Raman

IPC分类号： G10L13/00 , G10L13/02 , G10L13/08

CPC分类号： G10L13/033 , G10L13/027 , G10L13/04

摘要： Systems and methods are provided for expressive text-to-speech which include identifying text to convert to speech, selecting a speech style sheet from a set of available speech style sheets, the speech style sheet defining desired speech characteristics, marking the text to associate the text with the selected speech style sheet, and converting the text to speech having the desired speech characteristics by applying a low level markup associated with the speech style sheet.

摘要翻译： 提供了用于表达性文本到语音的系统和方法，其包括识别要转换为语音的文本，从一组可用语音样式表中选择语音样式表，定义期望的语音特征的语音样式表，标记文本以关联具有所选择的语音样式表的文本，并且通过应用与语音样式表相关联的低级标记将文本转换为具有期望语音特征的语音。

4.

发明申请
System and method for improving text-to-speech software intelligibility through the detection of uncommon words and phrases 审中-公开
标题翻译：通过检测不寻常的单词和短语来提高文本到语音软件的清晰度的系统和方法

公开(公告)号：US20050234724A1

公开(公告)日：2005-10-20

申请号：US10825578

申请日：2004-04-15

申请人： Andrew Aaron , Ellen Eide

发明人： Andrew Aaron , Ellen Eide

IPC分类号： G10L13/08 , G10L21/02

CPC分类号： G10L13/10 , G10L21/0264

摘要： Disclosed is a system and method for improving the intelligibility of speech output by a speech synthesizer by determining if uncommon words exist in the text, and if it is determined that an uncommon word exists in the text, pausing the output of the synthesized speech of the uncommon word to offset the uncommon word from its surrounding speech.

摘要翻译： 公开了一种用于通过确定文本中是否存在不常见的单词来提高语音合成器的语音输出的可懂度的系统和方法，并且如果确定文本中存在不常见的单词，则暂停该文本的合成语音的输出不寻常的话来弥补周围言论中的不寻常的话。

5.

发明授权
System and method for a device sound interface manager 有权
标题翻译：用于设备声音接口管理器的系统和方法

公开(公告)号：US08243954B2

公开(公告)日：2012-08-14

申请号：US12019153

申请日：2008-01-24

申请人： Andrew Aaron , Dimitri Kanevsky , Edward E. Kelley , Bhuvana Ramabhadran

发明人： Andrew Aaron , Dimitri Kanevsky , Edward E. Kelley , Bhuvana Ramabhadran

IPC分类号： H03G3/20 , A61F11/06 , H04B1/00 , H04B15/00 , G10K11/16

CPC分类号： H03G3/32

摘要： A system for regulating the volume and frequency content of audio producing devices, the system includes: one or more noise making objects (NMO) configured with individual sound control devices in electrical communication with a noise management server and one or more audio producing devices configured with individual sound control devices; wherein the sound control devices have electronic logic processing, storage, and communication capabilities; wherein the noise management server utilize the sound control devices to: determine whether the NMO are producing noise in the audible range of one or more audio producing devices; determine a noise characteristic of the one or more NMO; command the one or more NMO to send the noise characteristic to the one or more audio producing devices; and wherein the volume and frequency content of audio produced by the one or more audio producing devices is adjusted in response to the received noise characteristic.

摘要翻译： 一种用于调节音频产生装置的音量和频率内容的系统，所述系统包括：一个或多个噪声制作对象（NMO），其配置有与噪声管理服务器电通信的各个声音控制装置和配置有个人声控装置; 声音控制装置具有电子逻辑处理，存储和通信能力; 其中所述噪声管理服务器利用所述声音控制装置来：确定所述NMO是否在一个或多个音频产生装置的可听范围内产生噪声; 确定一个或多个NMO的噪声特性; 命令所述一个或多个NMO将所述噪声特性发送到所述一个或多个音频产生设备; 并且其中响应于所接收的噪声特性来调整由所述一个或多个音频产生装置产生的音频的音量和频率内容。

6.

发明授权
Operation of a noise cancellation device 有权
标题翻译：噪声消除装置的操作

公开(公告)号：US09054666B2

公开(公告)日：2015-06-09

申请号：US13448428

申请日：2012-04-17

申请人： Andrew Aaron , Dimitri Kanevsky , Edward E. Kelley , Bhuvana Ramabhadran

发明人： Andrew Aaron , Dimitri Kanevsky , Edward E. Kelley , Bhuvana Ramabhadran

IPC分类号： H04R3/02 , H03G3/20 , H03G3/00 , H03G3/32

CPC分类号： H03G3/32

摘要： A method for improving the performance of a noise cancellation device, the method includes determining whether one or more noise making objects (NMO) are near an audible range of the noise cancellation device and receiving a signal from the one or more NMOs indicative of a kind of noise the one or more NMOs is generating. The method also includes selecting a specific noise cancellation model to reduce an expected noise in response to the received kind of noise the one or more NMOs is generating.

摘要翻译： 一种用于改善噪声消除装置的性能的方法，所述方法包括确定一个或多个噪声产生对象（NMO）是否在噪声消除装置的可听范围附近并且接收来自指示某种类型的一个或多个NMO的信号一个或多个NMO正在产生噪音。该方法还包括选择特定的噪声消除模型以响应于所接收的一个或多个NMO产生的噪声种类来减少期望的噪声。

7.

发明申请
Autonomous system and method for creating readable scripts for concatenative text-to-speech synthesis (TTS) corpora 有权
标题翻译：用于创建用于并行文本到语音合成（TTS）语料库的可读脚本的自动系统和方法

公开(公告)号：US20070168193A1

公开(公告)日：2007-07-19

申请号：US11332292

申请日：2006-01-17

申请人： Andrew Aaron , David Ferrucci , John Pitrelli

发明人： Andrew Aaron , David Ferrucci , John Pitrelli

IPC分类号： G10L13/08

CPC分类号： G10L13/04

摘要： A method (and system) which autonomously generates a cohesive script from a text database for creating a speech corpus for concatenative text-to-speech, and more particularly, which generates cohesive scripts having fluency and natural prosody that can be used to generate compact text-to-speech recordings that cover a plurality of phonetic events.

摘要翻译： 一种方法（和系统），其自动地从文本数据库生成一个连贯的脚本，用于创建用于并排的文本到语音的语音语料库，更具体地，其产生具有可用于生成紧凑文本的流畅性和自然韵律的连贯的脚本覆盖多个语音事件的语音录音。

8.

发明申请
Operation of a Noise Cancellation Device 有权
标题翻译：噪音消除装置的操作

公开(公告)号：US20120207316A1

公开(公告)日：2012-08-16

申请号：US13448428

申请日：2012-04-17

申请人： Andrew Aaron , Dimitri Kanevsky , Edward E. Kelley , Bhuvana Ramabhadran

发明人： Andrew Aaron , Dimitri Kanevsky , Edward E. Kelley , Bhuvana Ramabhadran

IPC分类号： G10K11/16

CPC分类号： H03G3/32

摘要： A method for improving the performance of a noise cancellation device, the method includes determining whether one or more noise making objects (NMO) are near an audible range of the noise cancellation device and receiving a signal from the one or more NMOs indicative of a kind of noise the one or more NMOs is generating. The method also includes selecting a specific noise cancellation model to reduce an expected noise in response to the received kind of noise the one or more NMOs is generating.

摘要翻译： 一种用于改善噪声消除装置的性能的方法，所述方法包括确定一个或多个噪声产生对象（NMO）是否在噪声消除装置的可听范围附近并且接收来自指示某种类型的一个或多个NMO的信号一个或多个NMO正在产生噪音。该方法还包括选择特定的噪声消除模型以响应于所接收的一个或多个NMO产生的噪声种类来减少期望的噪声。

9.

发明申请
SYSTEM AND METHOD FOR A DEVICE SOUND INTERFACE MANAGER 有权
标题翻译：用于设备声音接口管理器的系统和方法

公开(公告)号：US20090190767A1

公开(公告)日：2009-07-30

申请号：US12019153

申请日：2008-01-24

申请人： Andrew Aaron , Dimitri Kanevsky , Edward E. Kelley , Bhuvana Ramabhadran

发明人： Andrew Aaron , Dimitri Kanevsky , Edward E. Kelley , Bhuvana Ramabhadran

IPC分类号： H03G3/20

CPC分类号： H03G3/32

摘要： A system for regulating the volume and frequency content of audio producing devices, the system includes: one or more noise making objects (NMO) configured with individual sound control devices in electrical communication with a noise management server and one or more audio producing devices configured with individual sound control devices; wherein the sound control devices have electronic logic processing, storage, and communication capabilities; wherein the noise management server utilize the sound control devices to: determine whether the NMO are producing noise in the audible range of one or more audio producing devices; determine a noise characteristic of the one or more NMO; command the one or more NMO to send the noise characteristic to the one or more audio producing devices; and wherein the volume and frequency content of audio produced by the one or more audio producing devices is adjusted in response to the received noise characteristic.

摘要翻译： 一种用于调节音频产生装置的音量和频率内容的系统，所述系统包括：一个或多个噪声制作对象（NMO），其配置有与噪声管理服务器电通信的各个声音控制装置和配置有个人声控装置; 声音控制装置具有电子逻辑处理，存储和通信能力; 其中所述噪声管理服务器利用所述声音控制装置来：确定所述NMO是否在一个或多个音频产生装置的可听范围内产生噪声; 确定一个或多个NMO的噪声特性; 命令所述一个或多个NMO将所述噪声特性发送到所述一个或多个音频产生设备; 并且其中响应于所接收的噪声特性来调整由所述一个或多个音频产生装置产生的音频的音量和频率内容。

10.

发明授权
Apparatus, program storage device and method for testing speech recognition in the mobile environment of a vehicle 有权
标题翻译：用于在车辆的移动环境中测试语音识别的装置，程序存储装置和方法

公开(公告)号：US07487084B2

公开(公告)日：2009-02-03

申请号：US10210667

申请日：2002-07-31

申请人： Andrew Aaron , Subrata K. Das , David M. Lubensky

发明人： Andrew Aaron , Subrata K. Das , David M. Lubensky

IPC分类号： G10L15/00

CPC分类号： G10L15/01

摘要： A testing arrangement provided for speech recognition systems in vehicles. Preferably included are a “mobile client” secured in the vehicle and driven around at a desired speed, an audio system and speaker which plays back a set of prerecorded utterances stored digitally in a computer arrangement such that the speech of a human being is simulated, transmission of the speech signal to a server, followed by speech recognition and signal-to-noise ratio (SNR) computation. Here, the acceptability of the vehicular speech recognition system is preferably determined via comparison with pre-specified standards of recognition accuracy and SNR values.

摘要翻译： 为车辆中的语音识别系统提供的测试装置。优选地包括固定在车辆中并以所需速度驱动的“移动客户端”，音频系统和扬声器，其回放一组预先存储的话语，其以数字方式存储在计算机装置中，使得人的语音被模拟，将语音信号传输到服务器，随后进行语音识别和信噪比（SNR）计算。这里，车辆语音识别系统的可接受性优选地通过与预先指定的识别精度和SNR值的标准相比较来确定。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类