SPEECH TRANSLATION METHOD AND APPARATUS
    1.
    发明申请
    SPEECH TRANSLATION METHOD AND APPARATUS 有权
    语音翻译方法和设备

    公开(公告)号:US20100114556A1

    公开(公告)日:2010-05-06

    申请号:US12609647

    申请日:2009-10-30

    IPC分类号: G06F17/28

    摘要: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.

    摘要翻译: 一种用于语音翻译的方法和装置。 该方法包括:接收源语音; 提取源语音中的非文本信息; 将源语言翻译成目标语音; 以及根据提取的非文本信息调整翻译的目标语音,使得目标语音保留源语音中的非文本信息。 该装置包括:用于接收源语音的接收模块; 提取模块,用于提取源语音中的非文本信息; 用于将源语音翻译成目标语音的翻译模块; 以及调整模块,用于根据提取的非文本信息调整翻译的目标语音,使得目标语音保留源语音中的非文本信息。

    Speech translation method and apparatus utilizing prosodic information
    2.
    发明授权
    Speech translation method and apparatus utilizing prosodic information 有权
    语音翻译方法和装置利用韵律信息

    公开(公告)号:US09342509B2

    公开(公告)日:2016-05-17

    申请号:US12609647

    申请日:2009-10-30

    摘要: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.

    摘要翻译: 一种用于语音翻译的方法和装置。 该方法包括:接收源语音; 提取源语音中的非文本信息; 将源语言翻译成目标语音; 以及根据提取的非文本信息调整翻译的目标语音,使得目标语音保留源语音中的非文本信息。 该装置包括:用于接收源语音的接收模块; 提取模块,用于提取源语音中的非文本信息; 用于将源语音翻译成目标语音的翻译模块; 以及调整模块,用于根据提取的非文本信息调整翻译的目标语音,使得目标语音保留源语音中的非文本信息。

    Method and apparatus for speech analysis and synthesis
    3.
    发明授权
    Method and apparatus for speech analysis and synthesis 有权
    用于语音分析和综合的方法和装置

    公开(公告)号:US08280739B2

    公开(公告)日:2012-10-02

    申请号:US12061645

    申请日:2008-04-03

    IPC分类号: G10L13/00

    CPC分类号: G10L13/04 G10L25/48

    摘要: The present invention provides a speech analysis method comprising steps of obtaining a speech signal and a corresponding DEGG/EGG signal; regarding the speech signal as the output of a vocal tract filter in a source-filter model taking the DEGG/EGG signal as the input; and estimating the features of the vocal tract filter from the speech signal as the output and the DEGG/EGG signal as the input, wherein the features of the vocal tract filter are expressed by the state vectors of the vocal tract filter at selected time points, and the step of estimating is performed using Kalman filtering.

    摘要翻译: 本发明提供一种语音分析方法,包括获得语音信号和对应的DEGG / EGG信号的步骤; 将语音信号作为采用DEGG / EGG信号作为输入的源滤波器模型中的声道滤波器的输出; 并从语音信号作为输出和DEGG / EGG信号作为输入来估计声道滤波器的特征,其中声道滤波器的特征由选择的时间点处的声道滤波器的状态矢量表示, 并且使用卡尔曼滤波执行估计步骤。

    Audio archive generation and presentation

    公开(公告)号:US09025736B2

    公开(公告)日:2015-05-05

    申请号:US12025535

    申请日:2008-02-04

    摘要: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.

    Voice conversion method and system
    5.
    发明授权
    Voice conversion method and system 有权
    语音转换方法和系统

    公开(公告)号:US08234110B2

    公开(公告)日:2012-07-31

    申请号:US12240148

    申请日:2008-09-29

    IPC分类号: G10L19/06

    CPC分类号: G10L21/00 G10L2021/0135

    摘要: A method, system and computer program product for voice conversion. The method includes performing speech analysis on the speech of a source speaker to achieve speech information; performing spectral conversion based on said speech information, to at least achieve a first spectrum similar to the speech of a target speaker; performing unit selection on the speech of said target speaker at least using said first spectrum as a target; replacing at least part of said first spectrum with the spectrum of the selected target speaker's speech unit; and performing speech reconstruction at least based on the replaced spectrum.

    摘要翻译: 用于语音转换的方法,系统和计算机程序产品。 该方法包括对源扬声器的语音执行语音分析以实现语音信息; 基于所述语音信息执行频谱转换,至少实现与目标说话者的语音相似的第一频谱; 至少使用所述第一光谱作为目标对所述目标说话者的语音进行单元选择; 用所选择的目标说话者的语音单元的频谱代替所述第一频谱的至少一部分; 并至少基于所替换的频谱执行语音重建。

    VOICE CONVERSION METHOD AND SYSTEM
    6.
    发明申请
    VOICE CONVERSION METHOD AND SYSTEM 有权
    语音转换方法与系统

    公开(公告)号:US20090089063A1

    公开(公告)日:2009-04-02

    申请号:US12240148

    申请日:2008-09-29

    IPC分类号: G10L21/00

    CPC分类号: G10L21/00 G10L2021/0135

    摘要: A method, system and computer program product for voice conversion. The method includes performing speech analysis on the speech of a source speaker to achieve speech information; performing spectral conversion based on said speech information, to at least achieve a first spectrum similar to the speech of a target speaker; performing unit selection on the speech of said target speaker at least using said first spectrum as a target; replacing at least part of said first spectrum with the spectrum of the selected target speaker's speech unit; and performing speech reconstruction at least based on the replaced spectrum.

    摘要翻译: 用于语音转换的方法,系统和计算机程序产品。 该方法包括对源扬声器的语音执行语音分析以实现语音信息; 基于所述语音信息执行频谱转换,至少实现与目标说话者的语音相似的第一频谱; 至少使用所述第一光谱作为目标对所述目标说话者的语音进行单元选择; 用所选择的目标说话者的语音单元的频谱代替所述第一频谱的至少一部分; 并至少基于所替换的频谱执行语音重建。

    METHOD AND APPARATUS FOR SPEECH ANALYSIS AND SYNTHESIS
    7.
    发明申请
    METHOD AND APPARATUS FOR SPEECH ANALYSIS AND SYNTHESIS 有权
    用于语音分析和合成的方法和装置

    公开(公告)号:US20080288258A1

    公开(公告)日:2008-11-20

    申请号:US12061645

    申请日:2008-04-03

    IPC分类号: G10L13/00

    CPC分类号: G10L13/04 G10L25/48

    摘要: The present invention provides a speech analysis method comprising steps of obtaining a speech signal and a corresponding DEGG/EGG signal; regarding the speech signal as the output of a vocal tract filter in a source-filter model taking the DEGG/EGG signal as the input; and estimating the features of the vocal tract filter from the speech signal as the output and the DEGG/EGG signal as the input, wherein the features of the vocal tract filter are expressed by the state vectors of the vocal tract filter at selected time points, and the step of estimating is performed using Kalman filtering.

    摘要翻译: 本发明提供一种语音分析方法,包括获得语音信号和对应的DEGG / EGG信号的步骤; 将语音信号作为采用DEGG / EGG信号作为输入的源滤波器模型中的声道滤波器的输出; 并从语音信号作为输出和DEGG / EGG信号作为输入来估计声道滤波器的特征,其中声道滤波器的特征由选择的时间点处的声道滤波器的状态矢量表示, 并且使用卡尔曼滤波执行估计步骤。

    AUDIO ARCHIVE GENERATION AND PRESENTATION
    8.
    发明申请
    AUDIO ARCHIVE GENERATION AND PRESENTATION 有权
    音频存档生成和演示

    公开(公告)号:US20080187109A1

    公开(公告)日:2008-08-07

    申请号:US12025535

    申请日:2008-02-04

    IPC分类号: H04M1/64

    摘要: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.

    摘要翻译: 公开了一种在客户服务环境中自动生成听觉档案的方法,信息处理系统和计算机程序存储产品。 建立与终端用户的通信链路。 检索一个信息表单。 所述信息表格至少包括类别选择信息集和至少一个音频重新编码信息集。 提示最终用户基于信息表单中的信息来回答一组问题。 存储与最终用户给出的一组问题中的每个问题的每个答案相关联的数据集。 数据存储在与问题相对应的一组字段下。 存储在该组问题中的每个问题的字段集下的每个数据集彼此组合。 生成包括已组合的数据集的音频档案文件。

    Method and apparatus for automatically converting voice
    9.
    发明授权
    Method and apparatus for automatically converting voice 有权
    自动转换语音的方法和装置

    公开(公告)号:US08170878B2

    公开(公告)日:2012-05-01

    申请号:US12181553

    申请日:2008-07-29

    IPC分类号: G10L19/00

    摘要: The invention proposes a method and apparatus for significantly improving the quality of voice morphing and guaranteeing the similarity of converted voice. The invention sets several standard speakers in a TTS database, and selects the voices of different standard speakers for speech synthesis according to different roles, wherein the voice of the selected standard speaker is similar to the original role to a certain extent. Then the invention further performs voice morphing on the standard voice similar to the original voice to a certain extent, in order to accurately mimic the voice of the original speaker, so as to make the converted voice closer to the original voice features while guaranteeing the similarity.

    摘要翻译: 本发明提出了一种显着提高语音变形质量并保证转换语音相似性的方法和装置。 本发明在TTS数据库中设置几个标准扬声器,并根据不同的角色选择不同标准扬声器的语音合成语音,其中所选标准扬声器的声音在一定程度上与原始角色相似。 然后本发明在一定程度上进一步对与原始语音相似的标准语音进行声音变形,以便准确地模拟原始扬声器的声音,以便使转换的语音更接近原始语音特征,同时保证相似性 。

    Method and Apparatus for Automatically Converting Voice
    10.
    发明申请
    Method and Apparatus for Automatically Converting Voice 有权
    自动转换语音的方法和装置

    公开(公告)号:US20090037179A1

    公开(公告)日:2009-02-05

    申请号:US12181553

    申请日:2008-07-29

    IPC分类号: G10L13/08

    摘要: The invention proposes a method and apparatus for significantly improving the quality of voice morphing and guaranteeing the similarity of converted voice. The invention sets several standard speakers in a TTS database, and selects the voices of different standard speakers for speech synthesis according to different roles, wherein the voice of the selected standard speaker is similar to the original role to a certain extent. Then the invention further performs voice morphing on the standard voice similar to the original voice to a certain extent, in order to accurately mimic the voice of the original speaker, so as to make the converted voice closer to the original voice features while guaranteeing the similarity.

    摘要翻译: 本发明提出了一种显着提高语音变形质量并保证转换语音相似性的方法和装置。 本发明在TTS数据库中设置几个标准扬声器,并根据不同的角色选择不同标准扬声器的语音合成语音,其中所选标准扬声器的声音在一定程度上与原始角色相似。 然后本发明在一定程度上进一步对与原始语音相似的标准语音进行声音变形,以便准确地模拟原始扬声器的声音,以便使转换的语音更接近原始语音特征,同时保证相似性 。