Method and system for achieving emotional text to speech utilizing emotion tags assigned to text data
    11.
    发明授权
    Method and system for achieving emotional text to speech utilizing emotion tags assigned to text data 有权
    使用分配给文本数据的情感标签来实现情感文本到文本的方法和系统

    公开(公告)号:US09117446B2

    公开(公告)日:2015-08-25

    申请号:US13221953

    申请日:2011-08-31

    IPC分类号: G10L13/08 G10L13/10

    CPC分类号: G10L13/10 G10L13/02 G10L13/08

    摘要: A method and system for achieving emotional text to speech. The method includes: receiving text data; generating emotion tag for the text data by a rhythm piece; and achieving TTS to the text data corresponding to the emotion tag, where the emotion tags are expressed as a set of emotion vectors; where each emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. A system for the same includes: a text data receiving module; an emotion tag generating module; and a TTS module for achieving TTS, wherein the emotion tag is expressed as a set of emotion vectors; and wherein emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories.

    摘要翻译: 用于实现情感文字到语音的方法和系统。 该方法包括:接收文本数据; 通过节奏片产生文本数据的情感标签; 并且对于与情感标签相对应的文本数据实现TTS,其中情感标签被表达为一组情绪向量; 其中每个情绪向量包括基于多个情绪类别给出的多个情感评分。 一种系统,包括:文本数据接收模块; 情感标签生成模块; 以及用于实现TTS的TTS模块,其中所述情感标签被表达为一组情感向量; 并且其中情绪向量包括基于多个情绪类别给出的多个情绪评分。

    Audio archive generation and presentation

    公开(公告)号:US09025736B2

    公开(公告)日:2015-05-05

    申请号:US12025535

    申请日:2008-02-04

    摘要: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.

    Method and apparatus for automatically converting voice
    15.
    发明授权
    Method and apparatus for automatically converting voice 有权
    自动转换语音的方法和装置

    公开(公告)号:US08170878B2

    公开(公告)日:2012-05-01

    申请号:US12181553

    申请日:2008-07-29

    IPC分类号: G10L19/00

    摘要: The invention proposes a method and apparatus for significantly improving the quality of voice morphing and guaranteeing the similarity of converted voice. The invention sets several standard speakers in a TTS database, and selects the voices of different standard speakers for speech synthesis according to different roles, wherein the voice of the selected standard speaker is similar to the original role to a certain extent. Then the invention further performs voice morphing on the standard voice similar to the original voice to a certain extent, in order to accurately mimic the voice of the original speaker, so as to make the converted voice closer to the original voice features while guaranteeing the similarity.

    摘要翻译: 本发明提出了一种显着提高语音变形质量并保证转换语音相似性的方法和装置。 本发明在TTS数据库中设置几个标准扬声器,并根据不同的角色选择不同标准扬声器的语音合成语音,其中所选标准扬声器的声音在一定程度上与原始角色相似。 然后本发明在一定程度上进一步对与原始语音相似的标准语音进行声音变形,以便准确地模拟原始扬声器的声音,以便使转换的语音更接近原始语音特征,同时保证相似性 。

    Method and system for generating synthesized speech based on human recording
    16.
    发明申请
    Method and system for generating synthesized speech based on human recording 有权
    基于人类记录生成合成语音的方法和系统

    公开(公告)号:US20070033049A1

    公开(公告)日:2007-02-08

    申请号:US11475820

    申请日:2006-06-27

    IPC分类号: G10L13/08

    CPC分类号: G10L13/04

    摘要: A method and system that incorporates human recording with a TTS system to generate synthesized speech with high quality by searching over a database of pre-recorded utterances to select an utterance best matching text content to be synthesized into speech; dividing the best-matched utterance into a plurality of segments to generate remaining segments that are the same as corresponding parts of the text content and difference segments that are different from corresponding parts of the text content; synthesizing speech for the parts of the text content corresponding to the difference segments; and splicing the synthesized speech segments with the remaining segments of the best-matched utterance.

    摘要翻译: 一种将人类记录与TTS系统相结合的方法和系统,通过在数据库上搜索预先录制的话语来选择要合成语音的最佳匹配文本内容,从而产生高质量的合成语音; 将最佳匹配的话语划分成多个段以产生与文本内容的对应部分和与文本内容的对应部分不同的差异段的剩余段; 对与差分片段相对应的文本内容的部分合成语音; 以及将合成的语音片段与最佳匹配的话语的剩余片段拼接。

    Generating and relating text to audio segments
    17.
    发明申请
    Generating and relating text to audio segments 审中-公开
    生成并将文本与音频片段相关联

    公开(公告)号:US20060100877A1

    公开(公告)日:2006-05-11

    申请号:US11268367

    申请日:2005-11-07

    IPC分类号: G10L13/08

    CPC分类号: G06Q10/10 G10L15/26

    摘要: A method, apparatus and system for generating speech minutes. The method comprises the steps of displaying status indicators of respective audio (speech) stream chunks received and text information thereof on a GUI display and establishing the tagging between each audio stream chunk and the corresponding text information by dragging and dropping the status signs of the respective speech stream chunks onto the corresponding text information on the GUI, such that the speech stream, the text information and the corresponding tagging relation form voice tagged meeting minutes.

    摘要翻译: 一种用于产生语音分钟的方法,装置和系统。 该方法包括以下步骤:在GUI显示器上显示各自接收的音频(语音)流块的状态指示符及其文本信息,并通过拖放各个音频(语音)流块的状态标志来标识每个音频流块和对应的文本信息 语音流块在GUI上的对应文本信息上,使得语音流,文本信息和相应的标记关系形成语音标记的会议记录。

    Large-Scale Lateral Nanowire Arrays Nanogenerators
    18.
    发明申请
    Large-Scale Lateral Nanowire Arrays Nanogenerators 有权
    大型横向纳米线阵列纳米发生器

    公开(公告)号:US20110107569A1

    公开(公告)日:2011-05-12

    申请号:US12943499

    申请日:2010-11-10

    IPC分类号: H02N2/18

    CPC分类号: H01L41/316 H02N2/18

    摘要: In a method of making a generating device, a plurality of spaced apart elongated seed members are deposited onto a surface of a flexible non-conductive substrate. An elongated conductive layer is applied to a top surface and a first side of each seed member, thereby leaving an exposed second side opposite the first side. A plurality of elongated piezoelectric nanostructures is grown laterally from the second side of each seed layer. A second conductive material is deposited onto the substrate adjacent each elongated first conductive layer so as to be coupled the distal end of each of the plurality of elongated piezoelectric nanostructures. The second conductive material is selected so as to form a Schottky barrier between the second conductive material and the distal end of each of the plurality of elongated piezoelectric nanostructures and so as to form an electrical contact with the first conductive layer.

    摘要翻译: 在制造发生装置的方法中,多个间隔开的细长种子构件沉积在柔性非导电基底的表面上。 将细长的导电层施加到每个种子构件的顶表面和第一侧,从而留下与第一侧相对的暴露的第二侧。 多个细长的压电纳米结构从每个种子层的第二侧横向生长。 第二导电材料沉积在每个细长的第一导电层上的衬底上,以便耦合到多个细长压电纳米结构中的每一个的远端。 选择第二导电材料以在第二导电材料和多个细长压电纳米结构中的每一个的远端之间形成肖特基势垒,并且与第一导电层形成电接触。

    Method and system for generating synthesized speech based on human recording
    19.
    发明授权
    Method and system for generating synthesized speech based on human recording 有权
    基于人类记录生成合成语音的方法和系统

    公开(公告)号:US07899672B2

    公开(公告)日:2011-03-01

    申请号:US11475820

    申请日:2006-06-27

    IPC分类号: G10L13/08 G10L13/00

    CPC分类号: G10L13/04

    摘要: A method and system that incorporates human recording with a TTS system to generate synthesized speech with high quality by searching over a database of pre-recorded utterances to select an utterance best matching text content to be synthesized into speech; dividing the best-matched utterance into a plurality of segments to generate remaining segments that are the same as corresponding parts of the text content and difference segments that are different from corresponding parts of the text content; synthesizing speech for the parts of the text content corresponding to the difference segments; and splicing the synthesized speech segments with the remaining segments of the best-matched utterance.

    摘要翻译: 一种将人类记录与TTS系统相结合的方法和系统,通过在数据库上搜索预先录制的话语来选择要合成语音的最佳匹配文本内容,从而产生高质量的合成语音; 将最佳匹配的话语划分成多个段以产生与文本内容的对应部分和与文本内容的对应部分不同的差异段的剩余段; 对与差分片段相对应的文本内容的部分合成语音; 以及将合成的语音片段与最佳匹配的话语的剩余片段拼接。

    Flexible Nanogenerators
    20.
    发明申请
    Flexible Nanogenerators 有权
    柔性纳米发生器

    公开(公告)号:US20090066195A1

    公开(公告)日:2009-03-12

    申请号:US12209310

    申请日:2008-09-12

    IPC分类号: H02N2/18 H01L41/22

    CPC分类号: H02N2/18 H01L41/317 Y10T29/42

    摘要: A small scale electrical generator includes an elongated substrate and a first piezoelectric fine wire. The first piezoelectric fine wire is disposed along a surface of the substrate. The first piezoelectric fine wire has a first end and a spaced-apart second end. A first conductive contact secures the first end of the fine wire to a first portion of the substrate and a second conductive contact secures the second end of the fine wire to a second portion of the substrate. A fabric made of interwoven strands that includes fibers from which piezoelectric nanowires extend radially therefrom and conductive nanostructures extend therefrom is configured to generate electricity.

    摘要翻译: 小型发电机包括细长基板和第一压电细线。 第一压电细线沿着基板的表面设置。 第一压电细线具有第一端和间隔开的第二端。 第一导电接触件将细线的第一端固定到基板的第一部分,并且第二导电接触件将细线的第二端固定到基板的第二部分。 由交织股线构成的织物,其包括压电纳米线从其径向延伸的纤维和从其延伸的导电纳米结构的纤维构造成发电。