-
公开(公告)号:US20130054244A1
公开(公告)日:2013-02-28
申请号:US13221953
申请日:2011-08-31
申请人: Shenghua Bao , Jian Chen , Yong Qin , Qin Shi , Zhiwei Shuang , Zhong Su , Liu Wen , Shi Lei Zhang
发明人: Shenghua Bao , Jian Chen , Yong Qin , Qin Shi , Zhiwei Shuang , Zhong Su , Liu Wen , Shi Lei Zhang
IPC分类号: G10L13/08
摘要: A method and system for achieving emotional text to speech. The method includes: receiving text data; generating emotion tag for the text data by a rhythm piece; and achieving TTS to the text data corresponding to the emotion tag, where the emotion tags are expressed as a set of emotion vectors; where each emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. A system for the same includes: a text data receiving module; an emotion tag generating module; and a TTS module for achieving TTS, wherein the emotion tag is expressed as a set of emotion vectors; and wherein emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories.
-
公开(公告)号:US08280739B2
公开(公告)日:2012-10-02
申请号:US12061645
申请日:2008-04-03
申请人: Dan Ning Jiang , Fan Ping Meng , Yong Qin , Zhi Wei Shuang
发明人: Dan Ning Jiang , Fan Ping Meng , Yong Qin , Zhi Wei Shuang
IPC分类号: G10L13/00
摘要: The present invention provides a speech analysis method comprising steps of obtaining a speech signal and a corresponding DEGG/EGG signal; regarding the speech signal as the output of a vocal tract filter in a source-filter model taking the DEGG/EGG signal as the input; and estimating the features of the vocal tract filter from the speech signal as the output and the DEGG/EGG signal as the input, wherein the features of the vocal tract filter are expressed by the state vectors of the vocal tract filter at selected time points, and the step of estimating is performed using Kalman filtering.
摘要翻译: 本发明提供一种语音分析方法,包括获得语音信号和对应的DEGG / EGG信号的步骤; 将语音信号作为采用DEGG / EGG信号作为输入的源滤波器模型中的声道滤波器的输出; 并从语音信号作为输出和DEGG / EGG信号作为输入来估计声道滤波器的特征,其中声道滤波器的特征由选择的时间点处的声道滤波器的状态矢量表示, 并且使用卡尔曼滤波执行估计步骤。
-
公开(公告)号:US20110270605A1
公开(公告)日:2011-11-03
申请号:US13097191
申请日:2011-04-29
申请人: Yong Qin , Qin Shi , Zhiwei Shuang , Shi Lei Zhang
发明人: Yong Qin , Qin Shi , Zhiwei Shuang , Shi Lei Zhang
CPC分类号: G10L25/48
摘要: A method, system and computer readable storage medium for assessing speech prosody. The method includes the steps of: receiving input speech data; acquiring a prosody constraint; assessing prosody of the input speech data according to the prosody constraint; and providing assessment result where at least of the steps is carried out using a computer device.
摘要翻译: 一种用于评估言语韵律的方法,系统和计算机可读存储介质。 该方法包括以下步骤:接收输入语音数据; 获得韵律约束; 根据韵律约束评估输入语音数据的韵律; 并且提供使用计算机设备进行至少步骤的评估结果。
-
公开(公告)号:US20110054901A1
公开(公告)日:2011-03-03
申请号:US12869921
申请日:2010-08-27
申请人: Yong Qin , Qin Shi , Zhiwei Shuang , Shi Lei Zhang , Jie Zhou
发明人: Yong Qin , Qin Shi , Zhiwei Shuang , Shi Lei Zhang , Jie Zhou
IPC分类号: G10L15/04
CPC分类号: G06F17/2827
摘要: A method and apparatus for aligning texts. The method includes acquiring a target text and a reference text and aligning the target text and the reference text at word level based on phoneme similarity. The method can be applied to automatically archiving a multimedia resource and a method of automatically searching a multimedia resource.
摘要翻译: 一种用于对齐文本的方法和装置。 该方法包括获取目标文本和参考文本,并且基于音素相似性将目标文本和文本级别的参考文本对准。 该方法可以应用于自动归档多媒体资源和自动搜索多媒体资源的方法。
-
公开(公告)号:US20100114556A1
公开(公告)日:2010-05-06
申请号:US12609647
申请日:2009-10-30
申请人: Fan Ping Meng , Yong Qin , Zhi Wei Shuang , Shi Lei Zhang
发明人: Fan Ping Meng , Yong Qin , Zhi Wei Shuang , Shi Lei Zhang
IPC分类号: G06F17/28
CPC分类号: G06F17/289 , G06F17/27 , G10L13/033 , G10L13/08 , G10L21/00
摘要: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.
摘要翻译: 一种用于语音翻译的方法和装置。 该方法包括:接收源语音; 提取源语音中的非文本信息; 将源语言翻译成目标语音; 以及根据提取的非文本信息调整翻译的目标语音,使得目标语音保留源语音中的非文本信息。 该装置包括:用于接收源语音的接收模块; 提取模块,用于提取源语音中的非文本信息; 用于将源语音翻译成目标语音的翻译模块; 以及调整模块,用于根据提取的非文本信息调整翻译的目标语音,使得目标语音保留源语音中的非文本信息。
-
公开(公告)号:US20090299746A1
公开(公告)日:2009-12-03
申请号:US12472724
申请日:2009-05-27
申请人: Fan Ping Meng , Yong Qin , Qin Shi , Zhiwei Shuang
发明人: Fan Ping Meng , Yong Qin , Qin Shi , Zhiwei Shuang
IPC分类号: G10L13/08
摘要: A method for performing speech synthesis to a textual content at a client. The method includes the steps of: performing speech synthesis to the textual content based on a current acoustical unit set Scurrent in a corpus at the client; analyzing the textual content and generating a list of target units with corresponding context features, selecting multiple acoustical unit candidates for each target unit according to the context features based on an acoustical unit set Stotal that is more plentiful than the current acoustical unit set Scurrent in the corpus at the client, and determining acoustical units suitable for speech synthesis for the textual content according to the multiple unit candidates; and updating the current acoustical unit set Scurrent in the corpus at the client based on the determined acoustical units.
摘要翻译: 一种用于对客户端的文本内容执行语音合成的方法。 该方法包括以下步骤:基于在客户端的语料库中的当前声学单元集Scurrent对文本内容执行语音合成; 分析文本内容并生成具有相应上下文特征的目标单元的列表,根据上下文特征,基于声音单元组Stotal选择多个声学单元候选,该声音单元组Stotal比当前声学单元组S Current 客户端的语料库,并根据多个单元候选确定适用于文本内容的语音合成的声学单元; 以及基于所确定的声学单元来更新客户端处的语料库中的当前声学单元组Scurrent。
-
-
38.
公开(公告)号:US09342509B2
公开(公告)日:2016-05-17
申请号:US12609647
申请日:2009-10-30
申请人: Fan Ping Meng , Yong Qin , Zhi Wei Shuang , Shi Lei Zhang
发明人: Fan Ping Meng , Yong Qin , Zhi Wei Shuang , Shi Lei Zhang
IPC分类号: G06F17/27 , G10L13/033 , G10L13/08 , G10L21/00 , G06F17/28
CPC分类号: G06F17/289 , G06F17/27 , G10L13/033 , G10L13/08 , G10L21/00
摘要: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.
摘要翻译: 一种用于语音翻译的方法和装置。 该方法包括:接收源语音; 提取源语音中的非文本信息; 将源语言翻译成目标语音; 以及根据提取的非文本信息调整翻译的目标语音,使得目标语音保留源语音中的非文本信息。 该装置包括:用于接收源语音的接收模块; 提取模块,用于提取源语音中的非文本信息; 用于将源语音翻译成目标语音的翻译模块; 以及调整模块,用于根据提取的非文本信息调整翻译的目标语音,使得目标语音保留源语音中的非文本信息。
-
公开(公告)号:US20130268270A1
公开(公告)日:2013-10-10
申请号:US13440176
申请日:2012-04-05
申请人: Dan Ning Jiang , Vaibhava Goel , Dimitri Kanevsky , Yong Qin
发明人: Dan Ning Jiang , Vaibhava Goel , Dimitri Kanevsky , Yong Qin
IPC分类号: G10L15/00
CPC分类号: G10L15/07
摘要: A method is described for use with automatic speech recognition using discriminative criteria for speaker adaptation. An adaptation evaluation is performed of speech recognition performance data for speech recognition system users. Adaptation candidate users are identified based on the adaptation evaluation for whom an adaptation process is likely to improve system performance.
摘要翻译: 描述了一种使用自动语音识别的方法,使用用于说话者适应的歧视性标准。 对语音识别系统用户的语音识别性能数据进行适应评估。 适应候选用户是根据对自适应过程有可能提高系统性能的适应性评估来确定的。
-
公开(公告)号:US08527272B2
公开(公告)日:2013-09-03
申请号:US12869921
申请日:2010-08-27
申请人: Yong Qin , Qin Shi , Zhiwei Shuang , Shi Lei Zhang , Jie Zhou
发明人: Yong Qin , Qin Shi , Zhiwei Shuang , Shi Lei Zhang , Jie Zhou
IPC分类号: G10L15/04
CPC分类号: G06F17/2827
摘要: A method and apparatus for aligning texts. The method includes acquiring a target text and a reference text and aligning the target text and the reference text at word level based on phoneme similarity. The method can be applied to automatically archiving a multimedia resource and a method of automatically searching a multimedia resource.
摘要翻译: 一种用于对齐文本的方法和装置。 该方法包括获取目标文本和参考文本,并且基于音素相似性将目标文本和文本级别的参考文本对准。 该方法可以应用于自动归档多媒体资源和自动搜索多媒体资源的方法。
-
-
-
-
-
-
-
-
-