Identifying language of origin for words using estimates of normalized appearance frequency
    1.
    发明申请
    Identifying language of origin for words using estimates of normalized appearance frequency 有权
    使用归一化出现频率的估计来识别词语的起源语言

    公开(公告)号:US20080059151A1

    公开(公告)日:2008-03-06

    申请号:US11515468

    申请日:2006-09-01

    IPC分类号: G06F17/27

    CPC分类号: G06F17/278 G06F17/275

    摘要: The language of origin of a word or named entity is predicted using estimates of frequency of occurrence of the word or named entity in different languages. In one embodiment, the normalized frequency of occurrence of the word or named entity in a variety of different languages is estimated and the values are used as features in a feature vector which is scored and used to identify language of origin.

    摘要翻译: 使用不同语言的单词或命名实体的出现频率的估计来预测单词或命名实体的起始语言。 在一个实施例中,估计各种不同语言的单词或命名实体的归一化出现频率,并将该值用作特征向量中的特征,该特征向量被打分并用于识别原始语言。

    Identifying language of origin for words using estimates of normalized appearance frequency
    2.
    发明授权
    Identifying language of origin for words using estimates of normalized appearance frequency 有权
    使用归一化出现频率的估计来识别词语的起源语言

    公开(公告)号:US07689408B2

    公开(公告)日:2010-03-30

    申请号:US11515468

    申请日:2006-09-01

    IPC分类号: G06F17/20 G10L21/00

    CPC分类号: G06F17/278 G06F17/275

    摘要: The language of origin of a word or named entity is predicted using estimates of frequency of occurrence of the word or named entity in different languages. In one embodiment, the normalized frequency of occurrence of the word or named entity in a variety of different languages is estimated and the values are used as features in a feature vector which is scored and used to identify language of origin.

    摘要翻译: 使用不同语言的单词或命名实体的出现频率的估计来预测单词或命名实体的起始语言。 在一个实施例中,估计各种不同语言的单词或命名实体的归一化出现频率,并将该值用作特征向量中的特征,该特征向量被打分并用于识别原始语言。

    Unnatural prosody detection in speech synthesis

    公开(公告)号:US08583438B2

    公开(公告)日:2013-11-12

    申请号:US11903020

    申请日:2007-09-20

    IPC分类号: G10L13/00

    CPC分类号: G10L13/10

    摘要: Described is a technology by which synthesized speech generated from text is evaluated against a prosody model (trained offline) to determine whether the speech will sound unnatural. If so, the speech is regenerated with modified data. The evaluation and regeneration may be iterative until deemed natural sounding. For example, text is built into a lattice that is then (e.g., Viterbi) searched to find a best path. The sections (e.g., units) of data on the path are evaluated via a prosody model. If the evaluation deems a section to correspond to unnatural prosody, that section is replaced, e.g., by modifying/pruning the lattice and re-performing the search. Replacement may be iterative until all sections pass the evaluation. Unnatural prosody detection may be biased such that during evaluation, unnatural prosody is falsely detected at a higher rate relative to a rate at which unnatural prosody is missed.

    Voice persona service for embedding text-to-speech features into software programs
    5.
    发明授权
    Voice persona service for embedding text-to-speech features into software programs 有权
    语音人物服务,用于将文本到语音功能嵌入到软件程序中

    公开(公告)号:US07689421B2

    公开(公告)日:2010-03-30

    申请号:US11823169

    申请日:2007-06-27

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08 G10L13/033

    摘要: Described is a voice persona service by which users convert text into speech waveforms, based on user-provided parameters and voice data from a service data store. The service may be remotely accessed, such as via the Internet. The user may provide text tagged with parameters, with the text sent to a text-to-speech engine along with base or custom voice data, and the resulting waveform morphed based on the tags. The user may also provide speech. Once created, a voice persona corresponding to the speech waveform may be persisted, exchanged, made public, shared and so forth. In one example, the voice persona service receives user input and parameters, and retrieves a base or custom voice that may be edited by the user via a morphing algorithm. The service outputs a waveform, such as a .wav file for embedding in a software program, and persists the voice persona corresponding to that waveform.

    摘要翻译: 描述了基于用户提供的参数和来自服务数据存储器的语音数据的用户将文本转换为语音波形的语音人物服务。 该服务可以被远程访问,例如通过因特网。 用户可以提供标有参数的文本,文本发送到文本到语音引擎以及基本或自定义语音数据,并且基于标签变形的结果波形。 用户还可以提供语音。 一旦创建,对应于语音波形的语音人物可以被持续,交换,公开,共享等等。 在一个示例中,语音人物服务接收用户输入和参数,并且检索可由用户通过变形算法编辑的基本或自定义语音。 该服务输出一个波形,例如.wav文件,用于嵌入到软件程序中,并保持对应于该波形的语音人物角色。

    Name synthesis
    7.
    发明授权
    Name synthesis 有权
    名称综合

    公开(公告)号:US08719027B2

    公开(公告)日:2014-05-06

    申请号:US11712298

    申请日:2007-02-28

    IPC分类号: G10L13/00 G10L15/00

    CPC分类号: G10L13/08

    摘要: An automated method of providing a pronunciation of a word to a remote device is disclosed. The method includes receiving an input indicative of the word to be pronounced. The method further includes searching a database having a plurality of records. Each of the records has an indication of a textual representation and an associated indication of an audible representation. At least one output is provided to the remote device of an audible representation of the word to be pronounced.

    摘要翻译: 公开了一种向远程设备提供单词发音的自动化方法。 该方法包括接收指示要发音的单词的输入。 该方法还包括搜索具有多个记录的数据库。 每个记录具有文本表示的指示和可听见的表示的相关指示。 至少一个输出被提供给要发音的单词的可听表示的远程设备。

    Unnatural prosody detection in speech synthesis
    8.
    发明申请
    Unnatural prosody detection in speech synthesis 有权
    语言合成中的非自然韵律检测

    公开(公告)号:US20090083036A1

    公开(公告)日:2009-03-26

    申请号:US11903020

    申请日:2007-09-20

    IPC分类号: G10L13/08 G06F17/30

    CPC分类号: G10L13/10

    摘要: Described is a technology by which synthesized speech generated from text is evaluated against a prosody model (trained offline) to determine whether the speech will sound unnatural. If so, the speech is regenerated with modified data. The evaluation and regeneration may be iterative until deemed natural sounding. For example, text is built into a lattice that is then (e.g., Viterbi) searched to find a best path. The sections (e.g., units) of data on the path are evaluated via a prosody model. If the evaluation deems a section to correspond to unnatural prosody, that section is replaced, e.g., by modifying/pruning the lattice and re-performing the search. Replacement may be iterative until all sections pass the evaluation. Unnatural prosody detection may be biased such that during evaluation, unnatural prosody is falsely detected at a higher rate relative to a rate at which unnatural prosody is missed.

    摘要翻译: 描述了一种技术,通过该技术,从文本产生的合成语音针对韵律模型(离线训练)进行评估,以确定语音是否会听起来不自然。 如果是,则使用修改的数据重新生成语音。 评估和再生可能是迭代的,直到被认为是自然的声音。 例如,文本被内置到一个格子中,然后(例如,维特比)被搜索以找到最佳路径。 通过韵律模型评估路径上的数据的部分(例如,单位)。 如果评估认为一部分对应于非自然韵律,则该部分被替换,例如通过修改/修剪格子并重新执行搜索。 替换可能是迭代的,直到所有部分通过评估。 不自然的韵律检测可能有偏差,使得在评估期间,相对于错过非自然韵律的速率,以较高的速率错误地检测到非自然韵律。

    Voice persona service for embedding text-to-speech features into software programs
    9.
    发明申请
    Voice persona service for embedding text-to-speech features into software programs 有权
    语音人物服务,用于将文本到语音功能嵌入到软件程序中

    公开(公告)号:US20090006096A1

    公开(公告)日:2009-01-01

    申请号:US11823169

    申请日:2007-06-27

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08 G10L13/033

    摘要: Described is a voice persona service by which users convert text into speech waveforms, based on user-provided parameters and voice data from a service data store. The service may be remotely accessed, such as via the Internet. The user may provide text tagged with parameters, with the text sent to a text-to-speech engine along with base or custom voice data, and the resulting waveform morphed based on the tags. The user may also provide speech. Once created, a voice persona corresponding to the speech waveform may be persisted, exchanged, made public, shared and so forth. In one example, the voice persona service receives user input and parameters, and retrieves a base or custom voice that may be edited by the user via a morphing algorithm. The service outputs a waveform, such as a .wav file for embedding in a software program, and persists the voice persona corresponding to that waveform.

    摘要翻译: 描述了基于用户提供的参数和来自服务数据存储器的语音数据的用户将文本转换为语音波形的语音人物服务。 该服务可以被远程访问,例如通过因特网。 用户可以提供标有参数的文本,文本发送到文本到语音引擎以及基本或自定义语音数据,并且基于标签变形的结果波形。 用户还可以提供语音。 一旦创建,对应于语音波形的语音人物可以被持续,交换,公开,共享等等。 在一个示例中,语音人物服务接收用户输入和参数,并且检索可由用户通过变形算法编辑的基本或自定义语音。 该服务输出一个波形,例如.wav文件,用于嵌入到软件程序中,并保持对应于该波形的语音人物角色。

    Name synthesis
    10.
    发明申请
    Name synthesis 有权
    名称综合

    公开(公告)号:US20080208574A1

    公开(公告)日:2008-08-28

    申请号:US11712298

    申请日:2007-02-28

    IPC分类号: G10L21/00

    CPC分类号: G10L13/08

    摘要: An automated method of providing a pronunciation of a word to a remote device is disclosed. The method includes receiving an input indicative of the word to be pronounced. The method further includes searching a database having a plurality of records. Each of the records has an indication of a textual representation and an associated indication of an audible representation. At least one output is provided to the remote device of an audible representation of the word to be pronounced.

    摘要翻译: 公开了一种向远程设备提供单词发音的自动化方法。 该方法包括接收指示要发音的单词的输入。 该方法还包括搜索具有多个记录的数据库。 每个记录具有文本表示的指示和可听见的表示的相关指示。 至少一个输出被提供给要发音的单词的可听表示的远程设备。