专利检索 ap:("Yong Zhao" OR "Frank Kao-ping Soong" OR "Min Chu" OR "Lijuan Wang") AND inv:"Min Chu" 第 1 页

1.

发明授权
Unnatural prosody detection in speech synthesis 有权

公开(公告)号：US08583438B2

公开(公告)日：2013-11-12

申请号：US11903020

申请日：2007-09-20

申请人： Yong Zhao , Frank Kao-ping Soong , Min Chu , Lijuan Wang

发明人： Yong Zhao , Frank Kao-ping Soong , Min Chu , Lijuan Wang

IPC分类号： G10L13/00

CPC分类号： G10L13/10

摘要： Described is a technology by which synthesized speech generated from text is evaluated against a prosody model (trained offline) to determine whether the speech will sound unnatural. If so, the speech is regenerated with modified data. The evaluation and regeneration may be iterative until deemed natural sounding. For example, text is built into a lattice that is then (e.g., Viterbi) searched to find a best path. The sections (e.g., units) of data on the path are evaluated via a prosody model. If the evaluation deems a section to correspond to unnatural prosody, that section is replaced, e.g., by modifying/pruning the lattice and re-performing the search. Replacement may be iterative until all sections pass the evaluation. Unnatural prosody detection may be biased such that during evaluation, unnatural prosody is falsely detected at a higher rate relative to a rate at which unnatural prosody is missed.

2.

发明申请
Unnatural prosody detection in speech synthesis 有权
标题翻译：语言合成中的非自然韵律检测

公开(公告)号：US20090083036A1

公开(公告)日：2009-03-26

申请号：US11903020

申请日：2007-09-20

申请人： Yong Zhao , Frank Kao-ping Soong , Min Chu , Lijuan Wang

发明人： Yong Zhao , Frank Kao-ping Soong , Min Chu , Lijuan Wang

IPC分类号： G10L13/08 , G06F17/30

CPC分类号： G10L13/10

摘要： Described is a technology by which synthesized speech generated from text is evaluated against a prosody model (trained offline) to determine whether the speech will sound unnatural. If so, the speech is regenerated with modified data. The evaluation and regeneration may be iterative until deemed natural sounding. For example, text is built into a lattice that is then (e.g., Viterbi) searched to find a best path. The sections (e.g., units) of data on the path are evaluated via a prosody model. If the evaluation deems a section to correspond to unnatural prosody, that section is replaced, e.g., by modifying/pruning the lattice and re-performing the search. Replacement may be iterative until all sections pass the evaluation. Unnatural prosody detection may be biased such that during evaluation, unnatural prosody is falsely detected at a higher rate relative to a rate at which unnatural prosody is missed.

摘要翻译： 描述了一种技术，通过该技术，从文本产生的合成语音针对韵律模型（离线训练）进行评估，以确定语音是否会听起来不自然。如果是，则使用修改的数据重新生成语音。评估和再生可能是迭代的，直到被认为是自然的声音。例如，文本被内置到一个格子中，然后（例如，维特比）被搜索以找到最佳路径。通过韵律模型评估路径上的数据的部分（例如，单位）。如果评估认为一部分对应于非自然韵律，则该部分被替换，例如通过修改/修剪格子并重新执行搜索。替换可能是迭代的，直到所有部分通过评估。不自然的韵律检测可能有偏差，使得在评估期间，相对于错过非自然韵律的速率，以较高的速率错误地检测到非自然韵律。

3.

发明申请
Identifying language of origin for words using estimates of normalized appearance frequency 有权
标题翻译：使用归一化出现频率的估计来识别词语的起源语言

公开(公告)号：US20080059151A1

公开(公告)日：2008-03-06

申请号：US11515468

申请日：2006-09-01

申请人： Yi Ning Chen , Min Chu , Jiali You , Frank Kao-Ping Soong

发明人： Yi Ning Chen , Min Chu , Jiali You , Frank Kao-Ping Soong

IPC分类号： G06F17/27

CPC分类号： G06F17/278 , G06F17/275

摘要： The language of origin of a word or named entity is predicted using estimates of frequency of occurrence of the word or named entity in different languages. In one embodiment, the normalized frequency of occurrence of the word or named entity in a variety of different languages is estimated and the values are used as features in a feature vector which is scored and used to identify language of origin.

摘要翻译： 使用不同语言的单词或命名实体的出现频率的估计来预测单词或命名实体的起始语言。在一个实施例中，估计各种不同语言的单词或命名实体的归一化出现频率，并将该值用作特征向量中的特征，该特征向量被打分并用于识别原始语言。

4.

发明授权
Unsupervised labeling of sentence level accent 有权
标题翻译：句子级别口音的无监督标签

公开(公告)号：US07844457B2

公开(公告)日：2010-11-30

申请号：US11708442

申请日：2007-02-20

申请人： YiNing Chen , Frank Kao-ping Soong , Min Chu

发明人： YiNing Chen , Frank Kao-ping Soong , Min Chu

IPC分类号： G10L15/06

CPC分类号： G10L13/08

摘要： Methods are disclosed for automatic accent labeling without manually labeled data. The methods are designed to exploit accent distribution between function and content words.

摘要翻译： 公开了无手动标记数据的自动重音标记的方法。这些方法旨在利用功能和内容单词之间的重音分配。

5.

发明授权
Voice persona service for embedding text-to-speech features into software programs 有权
标题翻译：语音人物服务，用于将文本到语音功能嵌入到软件程序中

公开(公告)号：US07689421B2

公开(公告)日：2010-03-30

申请号：US11823169

申请日：2007-06-27

申请人： Yusheng Li , Min Chu , Xin Zou , Frank Kao-ping Soong

发明人： Yusheng Li , Min Chu , Xin Zou , Frank Kao-ping Soong

IPC分类号： G10L13/08

CPC分类号： G10L13/08 , G10L13/033

摘要： Described is a voice persona service by which users convert text into speech waveforms, based on user-provided parameters and voice data from a service data store. The service may be remotely accessed, such as via the Internet. The user may provide text tagged with parameters, with the text sent to a text-to-speech engine along with base or custom voice data, and the resulting waveform morphed based on the tags. The user may also provide speech. Once created, a voice persona corresponding to the speech waveform may be persisted, exchanged, made public, shared and so forth. In one example, the voice persona service receives user input and parameters, and retrieves a base or custom voice that may be edited by the user via a morphing algorithm. The service outputs a waveform, such as a .wav file for embedding in a software program, and persists the voice persona corresponding to that waveform.

摘要翻译： 描述了基于用户提供的参数和来自服务数据存储器的语音数据的用户将文本转换为语音波形的语音人物服务。该服务可以被远程访问，例如通过因特网。用户可以提供标有参数的文本，文本发送到文本到语音引擎以及基本或自定义语音数据，并且基于标签变形的结果波形。用户还可以提供语音。一旦创建，对应于语音波形的语音人物可以被持续，交换，公开，共享等等。在一个示例中，语音人物服务接收用户输入和参数，并且检索可由用户通过变形算法编辑的基本或自定义语音。该服务输出一个波形，例如.wav文件，用于嵌入到软件程序中，并保持对应于该波形的语音人物角色。

6.

发明申请
Unsupervised labeling of sentence level accent 有权
标题翻译：句子级别口音的无监督标签

公开(公告)号：US20080201145A1

公开(公告)日：2008-08-21

申请号：US11708442

申请日：2007-02-20

申请人： YiNing Chen , Frank Kao-ping Soong , Min Chu

发明人： YiNing Chen , Frank Kao-ping Soong , Min Chu

IPC分类号： G10L15/00

CPC分类号： G10L13/08

摘要： Methods are disclosed for automatic accent labeling without manually labeled data. The methods are designed to exploit accent distribution between function and content words.

摘要翻译： 公开了无手动标记数据的自动重音标记的方法。这些方法旨在利用功能和内容单词之间的重音分配。

7.

发明授权
Refining of segmental boundaries in speech waveforms using contextual-dependent models 失效
标题翻译：使用上下文相关模型对语音波形中的分段边界进行精细化

公开(公告)号：US07496512B2

公开(公告)日：2009-02-24

申请号：US10823129

申请日：2004-04-13

申请人： Yong Zhao , Min Chu , Jian-lai Zhou , Lijuan Wang

发明人： Yong Zhao , Min Chu , Jian-lai Zhou , Lijuan Wang

IPC分类号： G10L17/00

CPC分类号： G10L15/02 , G10L2015/022

摘要： A method and apparatus are provided for refining segmental boundaries in speech waveforms. Contextual acoustic feature similarities are used as a basis for clustering adjacent phoneme speech units, where each adjacent pair phoneme speech units include a segmental boundary. A refining model is trained for each cluster and used to refine boundaries of contextual phoneme speech units forming the clusters.

摘要翻译： 提供了一种用于在语音波形中精细化分段边界的方法和装置。上下文声学特征相似性被用作聚类相邻音素语音单元的基础，其中每个相邻对的音素语音单元包括节段边界。针对每个群集训练一个细化模型，并用于精化形成群集的上下文音素语音单元的边界。

8.

发明申请
Refining of segmental boundaries in speech waveforms using contextual-dependent models 失效
标题翻译：使用上下文相关模型对语音波形中的分段边界进行精细化

公开(公告)号：US20050228664A1

公开(公告)日：2005-10-13

申请号：US10823129

申请日：2004-04-13

申请人： Yong Zhao , Min Chu , Jian-lai Zhou , Lijuan Wang

发明人： Yong Zhao , Min Chu , Jian-lai Zhou , Lijuan Wang

IPC分类号： G10L15/02 , G10L15/06

CPC分类号： G10L15/02 , G10L2015/022

摘要： A method and apparatus are provided for refining segmental boundaries in speech waveforms. Contextual acoustic feature similarities are used as a basis for clustering adjacent phoneme speech units, where each adjacent pair phoneme speech units include a segmental boundary. A refining model is trained for each cluster and used to refine boundaries of contextual phoneme speech units forming the clusters.

摘要翻译： 提供了一种用于在语音波形中精细化分段边界的方法和装置。上下文声学特征相似性被用作聚类相邻音素语音单元的基础，其中每个相邻对的音素语音单元包括节段边界。针对每个群集训练一个细化模型，并用于精化形成群集的上下文音素语音单元的边界。

9.

发明授权
Name synthesis 有权
标题翻译：名称综合

公开(公告)号：US08719027B2

公开(公告)日：2014-05-06

申请号：US11712298

申请日：2007-02-28

申请人： Yining Chen , Yusheng Li , Min Chu , Frank Kao-Ping Soong

发明人： Yining Chen , Yusheng Li , Min Chu , Frank Kao-Ping Soong

IPC分类号： G10L13/00 , G10L15/00

CPC分类号： G10L13/08

摘要： An automated method of providing a pronunciation of a word to a remote device is disclosed. The method includes receiving an input indicative of the word to be pronounced. The method further includes searching a database having a plurality of records. Each of the records has an indication of a textual representation and an associated indication of an audible representation. At least one output is provided to the remote device of an audible representation of the word to be pronounced.

摘要翻译： 公开了一种向远程设备提供单词发音的自动化方法。该方法包括接收指示要发音的单词的输入。该方法还包括搜索具有多个记录的数据库。每个记录具有文本表示的指示和可听见的表示的相关指示。至少一个输出被提供给要发音的单词的可听表示的远程设备。

10.

发明授权
Identifying language of origin for words using estimates of normalized appearance frequency 有权
标题翻译：使用归一化出现频率的估计来识别词语的起源语言

公开(公告)号：US07689408B2

公开(公告)日：2010-03-30

申请号：US11515468

申请日：2006-09-01

申请人： Yi Ning Chen , Min Chu , Jiali You , Frank Kao-Ping Soong

发明人： Yi Ning Chen , Min Chu , Jiali You , Frank Kao-Ping Soong

IPC分类号： G06F17/20 , G10L21/00

CPC分类号： G06F17/278 , G06F17/275

摘要： The language of origin of a word or named entity is predicted using estimates of frequency of occurrence of the word or named entity in different languages. In one embodiment, the normalized frequency of occurrence of the word or named entity in a variety of different languages is estimated and the values are used as features in a feature vector which is scored and used to identify language of origin.

摘要翻译： 使用不同语言的单词或命名实体的出现频率的估计来预测单词或命名实体的起始语言。在一个实施例中，估计各种不同语言的单词或命名实体的归一化出现频率，并将该值用作特征向量中的特征，该特征向量被打分并用于识别原始语言。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类