Providing personalized voice font for text-to-speech applications
    6.
    发明授权
    Providing personalized voice font for text-to-speech applications 失效
    为文字到语音应用程序提供个性化的语音字体

    公开(公告)号:US07693719B2

    公开(公告)日:2010-04-06

    申请号:US10977178

    申请日:2004-10-29

    IPC分类号: G10L21/00 G10L13/00 G06F3/16

    CPC分类号: G10L13/033 G10L2021/0135

    摘要: A method for synthesizing speech from text includes receiving one or more waveforms characteristic of a voice of a person selected by a user, generating a personalized voice font based on the one or more waveforms, and delivering the personalized voice font to the user's computer, whereby speech can be synthesized from text, the speech being in the voice of the selected person, the speech being synthesized using the personalized voice font. A system includes a text-to-speech (TTS) application operable to generate a voice font based on speech waveforms transmitted from a client computer remotely accessing the TTS application.

    摘要翻译: 一种用于从文本合成语音的方法包括接收用户选择的人物的声音特征的一个或多个波形,基于一个或多个波形产生个性化语音字体,并将个性化语音字体传送到用户的计算机,由此 可以从文本合成语音,语音在所选择的人的语音中,使用个性化语音字体合成语音。 一种系统包括文本到语音(TTS)应用,其可操作以基于远程访问TTS应用的客户端计算机发送的语音波形来生成语音字体。

    Refining of segmental boundaries in speech waveforms using contextual-dependent models
    7.
    发明授权
    Refining of segmental boundaries in speech waveforms using contextual-dependent models 失效
    使用上下文相关模型对语音波形中的分段边界进行精细化

    公开(公告)号:US07496512B2

    公开(公告)日:2009-02-24

    申请号:US10823129

    申请日:2004-04-13

    IPC分类号: G10L17/00

    CPC分类号: G10L15/02 G10L2015/022

    摘要: A method and apparatus are provided for refining segmental boundaries in speech waveforms. Contextual acoustic feature similarities are used as a basis for clustering adjacent phoneme speech units, where each adjacent pair phoneme speech units include a segmental boundary. A refining model is trained for each cluster and used to refine boundaries of contextual phoneme speech units forming the clusters.

    摘要翻译: 提供了一种用于在语音波形中精细化分段边界的方法和装置。 上下文声学特征相似性被用作聚类相邻音素语音单元的基础,其中每个相邻对的音素语音单元包括节段边界。 针对每个群集训练一个细化模型,并用于精化形成群集的上下文音素语音单元的边界。

    Optimization of an objective measure for estimating mean opinion score of synthesized speech
    8.
    发明授权
    Optimization of an objective measure for estimating mean opinion score of synthesized speech 失效
    优化综合语音平均意见得分的客观量度

    公开(公告)号:US07386451B2

    公开(公告)日:2008-06-10

    申请号:US10660388

    申请日:2003-09-11

    IPC分类号: G10L13/08 G10L13/00

    CPC分类号: G10L25/69 G10L13/00

    摘要: A method is provided for optimizing an objective measure used to estimate mean opinion score or naturalness of synthesized speech from a speech synthesizer. The method includes using an objective measure that has components derived directly from textual information used to form synthesized utterances. The objective measure has a high correlation with mean opinion score such that a relationship can be formed between the objective measure and corresponding mean opinion score. The objective measure is altered to provide a different function of textual information derived from the utterances so as to improve the relationship between the scores of the objective measure and subjective ratings of the synthesized utterances.

    摘要翻译: 提供了一种用于优化用于估计来自语音合成器的合成语音的平均意见分数或自然度的客观测量的方法。 该方法包括使用具有直接从用于形成合成话语的文本信息导出的成分的客观度量。 客观量度与平均意见分数具有很高的相关性,从而可以在客观量度和相应的平均意见得分之间形成关系。 改变客观量度以提供从话语中得出的文本信息的不同功能,以改善客观测量的分数与合成话语的主观评级之间的关系。

    Speech unit selection using HMM acoustic models
    9.
    发明申请
    Speech unit selection using HMM acoustic models 审中-公开
    使用HMM声学模型进行语音单元选择

    公开(公告)号:US20080059190A1

    公开(公告)日:2008-03-06

    申请号:US11508093

    申请日:2006-08-22

    IPC分类号: G10L13/00

    CPC分类号: G10L13/06

    摘要: A concatenating speech synthesizer concatenates selected speech units to obtain the desired synthesized speech. When desired speech units of phonetic and/or prosodic context are not available, the synthesizer selects replacement speech units based on measures representative of the difference between the HMM acoustic models of the desired speech unit and available speech units.

    摘要翻译: 级联语音合成器连接所选择的语音单元以获得期望的合成语音。 当需要语音和/或韵律上下文的语音单元不可用时,合成器基于表示期望语音单元的HMM声学模型和可用语音单元之间的差异的度量来选择替换语音单元。

    Identifying language of origin for words using estimates of normalized appearance frequency
    10.
    发明申请
    Identifying language of origin for words using estimates of normalized appearance frequency 有权
    使用归一化出现频率的估计来识别词语的起源语言

    公开(公告)号:US20080059151A1

    公开(公告)日:2008-03-06

    申请号:US11515468

    申请日:2006-09-01

    IPC分类号: G06F17/27

    CPC分类号: G06F17/278 G06F17/275

    摘要: The language of origin of a word or named entity is predicted using estimates of frequency of occurrence of the word or named entity in different languages. In one embodiment, the normalized frequency of occurrence of the word or named entity in a variety of different languages is estimated and the values are used as features in a feature vector which is scored and used to identify language of origin.

    摘要翻译: 使用不同语言的单词或命名实体的出现频率的估计来预测单词或命名实体的起始语言。 在一个实施例中,估计各种不同语言的单词或命名实体的归一化出现频率,并将该值用作特征向量中的特征,该特征向量被打分并用于识别原始语言。