METHOD AND APPARATUS FOR ALIGNING TEXTS
    1.
    发明申请
    METHOD AND APPARATUS FOR ALIGNING TEXTS 有权
    方法和设备用于对齐TEXTS

    公开(公告)号:US20110054901A1

    公开(公告)日:2011-03-03

    申请号:US12869921

    申请日:2010-08-27

    IPC分类号: G10L15/04

    CPC分类号: G06F17/2827

    摘要: A method and apparatus for aligning texts. The method includes acquiring a target text and a reference text and aligning the target text and the reference text at word level based on phoneme similarity. The method can be applied to automatically archiving a multimedia resource and a method of automatically searching a multimedia resource.

    摘要翻译: 一种用于对齐文本的方法和装置。 该方法包括获取目标文本和参考文本,并且基于音素相似性将目标文本和文本级别的参考文本对准。 该方法可以应用于自动归档多媒体资源和自动搜索多媒体资源的方法。

    Method and apparatus for aligning texts
    2.
    发明授权
    Method and apparatus for aligning texts 有权
    用于对齐文本的方法和装置

    公开(公告)号:US08527272B2

    公开(公告)日:2013-09-03

    申请号:US12869921

    申请日:2010-08-27

    IPC分类号: G10L15/04

    CPC分类号: G06F17/2827

    摘要: A method and apparatus for aligning texts. The method includes acquiring a target text and a reference text and aligning the target text and the reference text at word level based on phoneme similarity. The method can be applied to automatically archiving a multimedia resource and a method of automatically searching a multimedia resource.

    摘要翻译: 一种用于对齐文本的方法和装置。 该方法包括获取目标文本和参考文本,并且基于音素相似性将目标文本和文本级别的参考文本对准。 该方法可以应用于自动归档多媒体资源和自动搜索多媒体资源的方法。

    Assessing speech prosody
    3.
    发明授权
    Assessing speech prosody 有权
    评估言语韵律

    公开(公告)号:US09368126B2

    公开(公告)日:2016-06-14

    申请号:US13097191

    申请日:2011-04-29

    IPC分类号: G10L25/48

    CPC分类号: G10L25/48

    摘要: A method, system and computer readable storage medium for assessing speech prosody. The method includes the steps of: receiving input speech data; acquiring a prosody constraint; assessing prosody of the input speech data according to the prosody constraint; and providing assessment result where at least of the steps is carried out using a computer device.

    摘要翻译: 一种用于评估言语韵律的方法,系统和计算机可读存储介质。 该方法包括以下步骤:接收输入语音数据; 获得韵律约束; 根据韵律约束评估输入语音数据的韵律; 并且提供使用计算机设备进行至少步骤的评估结果。

    Method and system for achieving emotional text to speech utilizing emotion tags assigned to text data
    4.
    发明授权
    Method and system for achieving emotional text to speech utilizing emotion tags assigned to text data 有权
    使用分配给文本数据的情感标签来实现情感文本到文本的方法和系统

    公开(公告)号:US09117446B2

    公开(公告)日:2015-08-25

    申请号:US13221953

    申请日:2011-08-31

    IPC分类号: G10L13/08 G10L13/10

    CPC分类号: G10L13/10 G10L13/02 G10L13/08

    摘要: A method and system for achieving emotional text to speech. The method includes: receiving text data; generating emotion tag for the text data by a rhythm piece; and achieving TTS to the text data corresponding to the emotion tag, where the emotion tags are expressed as a set of emotion vectors; where each emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. A system for the same includes: a text data receiving module; an emotion tag generating module; and a TTS module for achieving TTS, wherein the emotion tag is expressed as a set of emotion vectors; and wherein emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories.

    摘要翻译: 用于实现情感文字到语音的方法和系统。 该方法包括:接收文本数据; 通过节奏片产生文本数据的情感标签; 并且对于与情感标签相对应的文本数据实现TTS,其中情感标签被表达为一组情绪向量; 其中每个情绪向量包括基于多个情绪类别给出的多个情感评分。 一种系统,包括:文本数据接收模块; 情感标签生成模块; 以及用于实现TTS的TTS模块,其中所述情感标签被表达为一组情感向量; 并且其中情绪向量包括基于多个情绪类别给出的多个情绪评分。

    ASSESSING SPEECH PROSODY
    6.
    发明申请
    ASSESSING SPEECH PROSODY 有权
    评估语音预言

    公开(公告)号:US20110270605A1

    公开(公告)日:2011-11-03

    申请号:US13097191

    申请日:2011-04-29

    IPC分类号: G06F17/27 G10L15/00

    CPC分类号: G10L25/48

    摘要: A method, system and computer readable storage medium for assessing speech prosody. The method includes the steps of: receiving input speech data; acquiring a prosody constraint; assessing prosody of the input speech data according to the prosody constraint; and providing assessment result where at least of the steps is carried out using a computer device.

    摘要翻译: 一种用于评估言语韵律的方法,系统和计算机可读存储介质。 该方法包括以下步骤:接收输入语音数据; 获得韵律约束; 根据韵律约束评估输入语音数据的韵律; 并且提供使用计算机设备进行至少步骤的评估结果。

    Method and system for speech synthesis using dynamically updated acoustic unit sets
    7.
    发明授权
    Method and system for speech synthesis using dynamically updated acoustic unit sets 有权
    使用动态更新的声学单元组进行语音合成的方法和系统

    公开(公告)号:US08321223B2

    公开(公告)日:2012-11-27

    申请号:US12472724

    申请日:2009-05-27

    IPC分类号: G10L13/08

    CPC分类号: G10L13/04 G10L13/08 G10L15/30

    摘要: A method for performing speech synthesis on textual content at a client. The method includes the steps of: performing speech synthesis on the textual content based on a current acoustical unit set Scurrent in a corpus at the client; analyzing the textual content and generating a list of target units with corresponding context features, selecting multiple acoustical unit candidates for each target unit according to the context features based on an acoustical unit set Stotal that is more plentiful than the current acoustical unit set Scurrent in the corpus at the client, and determining acoustical units suitable for speech synthesis for the textual content according to the multiple unit candidates; and updating the current acoustical unit set Scurrent in the corpus at the client based on the determined acoustical units.

    摘要翻译: 一种在客户端对文本内容执行语音合成的方法。 该方法包括以下步骤:基于客户端的语料库中的当前声学单元集Scurrent对文本内容执行语音合成; 分析文本内容并生成具有相应上下文特征的目标单元的列表,根据上下文特征,基于声音单元组Stotal选择多个声学单元候选,所述声学单元组Stotal比当前声学单元组S Current更丰富 客户端的语料库,并根据多个单元候选确定适用于文本内容的语音合成的声学单元; 以及基于所确定的声学单元来更新客户端处的语料库中的当前声学单元组Scurrent。

    Speaker and call characteristic sensitive open voice search
    8.
    发明授权
    Speaker and call characteristic sensitive open voice search 有权
    扬声器和呼叫特性敏感开放语音搜索

    公开(公告)号:US08630860B1

    公开(公告)日:2014-01-14

    申请号:US13039467

    申请日:2011-03-03

    摘要: Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.

    摘要翻译: 本文公开的技术包括用于开放式语音启用搜索的扬声器敏感的系统和方法。 技术包括使用语音信息,说话者信息和与口语查询相关联的信息来增强开放语音搜索结果。 这包括将文本索引与语音索引集成以支持整个搜索周期。 给定语音查询,系统可以同时执行两个匹配过程。 这可以包括基于语音识别输出的文本匹配过程,以及基于呼叫者或用户发出查询的特征的语音匹配过程。 呼叫者的特征可以包括语音特征提取的输出和关于呼叫的元数据。 根据这些特点,系统集群呼叫者。 系统可以使用特定的语音和文本集群来修改语音识别结果,以及修改搜索结果。

    SPEAKER AND CALL CHARACTERISTIC SENSITIVE OPEN VOICE SEARCH
    9.
    发明申请
    SPEAKER AND CALL CHARACTERISTIC SENSITIVE OPEN VOICE SEARCH 有权
    扬声器和呼叫特征敏感开放语音搜索

    公开(公告)号:US20140129220A1

    公开(公告)日:2014-05-08

    申请号:US14152136

    申请日:2014-01-10

    IPC分类号: G10L15/26

    摘要: Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.

    摘要翻译: 本文公开的技术包括用于开放式语音启用搜索的扬声器敏感的系统和方法。 技术包括使用语音信息,说话者信息和与口语查询相关联的信息来增强开放语音搜索结果。 这包括将文本索引与语音索引集成以支持整个搜索周期。 给定语音查询,系统可以同时执行两个匹配过程。 这可以包括基于语音识别输出的文本匹配过程,以及基于呼叫者或用户发出查询的特征的语音匹配过程。 呼叫者的特征可以包括语音特征提取的输出和关于呼叫的元数据。 根据这些特点,系统集群呼叫者。 系统可以使用特定的语音和文本集群来修改语音识别结果,以及修改搜索结果。

    METHOD AND SYSTEM FOR SPEECH SYNTHESIS
    10.
    发明申请
    METHOD AND SYSTEM FOR SPEECH SYNTHESIS 有权
    语音合成方法与系统

    公开(公告)号:US20090299746A1

    公开(公告)日:2009-12-03

    申请号:US12472724

    申请日:2009-05-27

    IPC分类号: G10L13/08

    CPC分类号: G10L13/04 G10L13/08 G10L15/30

    摘要: A method for performing speech synthesis to a textual content at a client. The method includes the steps of: performing speech synthesis to the textual content based on a current acoustical unit set Scurrent in a corpus at the client; analyzing the textual content and generating a list of target units with corresponding context features, selecting multiple acoustical unit candidates for each target unit according to the context features based on an acoustical unit set Stotal that is more plentiful than the current acoustical unit set Scurrent in the corpus at the client, and determining acoustical units suitable for speech synthesis for the textual content according to the multiple unit candidates; and updating the current acoustical unit set Scurrent in the corpus at the client based on the determined acoustical units.

    摘要翻译: 一种用于对客户端的文本内容执行语音合成的方法。 该方法包括以下步骤:基于在客户端的语料库中的当前声学单元集Scurrent对文本内容执行语音合成; 分析文本内容并生成具有相应上下文特征的目标单元的列表,根据上下文特征,基于声音单元组Stotal选择多个声学单元候选,该声音单元组Stotal比当前声学单元组S Current 客户端的语料库,并根据多个单元候选确定适用于文本内容的语音合成的声学单元; 以及基于所确定的声学单元来更新客户端处的语料库中的当前声学单元组Scurrent。