SPEECH ANALYSIS DEVICE, SPEECH ANALYSIS AND SYNTHESIS DEVICE, CORRECTION RULE INFORMATION GENERATION DEVICE, SPEECH ANALYSIS SYSTEM, SPEECH ANALYSIS METHOD, CORRECTION RULE INFORMATION GENERATION METHOD, AND PROGRAM
    1.
    发明申请
    SPEECH ANALYSIS DEVICE, SPEECH ANALYSIS AND SYNTHESIS DEVICE, CORRECTION RULE INFORMATION GENERATION DEVICE, SPEECH ANALYSIS SYSTEM, SPEECH ANALYSIS METHOD, CORRECTION RULE INFORMATION GENERATION METHOD, AND PROGRAM 审中-公开
    语音分析设备,语音分析和合成设备,校正规则信息生成设备,语音分析系统,语音分析方法,校正规则信息生成方法和程序

    公开(公告)号:US20100217584A1

    公开(公告)日:2010-08-26

    申请号:US12773168

    申请日:2010-05-04

    CPC classification number: G10L21/0208 G10L19/0204

    Abstract: A speech analysis device which accurately analyzes an aperiodic component included in speech in a practical environment where there is background noise includes: a frequency band division unit which divides, into bandpass signals each associated with a corresponding one of frequency bands, an input signal representing a mixed sound of background noise and speech; a noise interval identification unit which identifies a noise interval and a speech interval of the input signal; an SNR calculation unit which calculates an SN ratio; a correlation function calculation unit which calculates an autocorrelation function of each bandpass signal; a correction amount determination unit which determines a correction amount for an aperiodic component ratio, based on the calculated SN ratio; and an aperiodic component ratio calculation unit which calculates, for each frequency band, an aperiodic component ratio of the aperiodic component, based on the determined correction amount and the calculated autocorrelation function.

    Abstract translation: 在具有背景噪声的实际环境中精确地分析包含在语音中的非周期成分的语音分析装置包括:频带分割单元,将分别与频带中相应的一个频带相关联的带通信号分成代表 混合声音的背景噪音和言语; 噪声间隔识别单元,其识别输入信号的噪声间隔和语音间隔; SNR计算单元,其计算SN比; 相关函数计算单元,其计算每个带通信号的自相关函数; 校正量确定单元,其基于所计算的SN比确定非周期分量比的校正量; 以及非周期分量比计算单元,其基于所确定的校正量和所计算的自相关函数,针对每个频带计算非周期性分量的非周期分量比。

    Speech synthesizer, speech synthesizing method, and program
    3.
    发明申请
    Speech synthesizer, speech synthesizing method, and program 有权
    语音合成器,语音合成方法和程序

    公开(公告)号:US20070203702A1

    公开(公告)日:2007-08-30

    申请号:US11783855

    申请日:2007-04-12

    CPC classification number: G10L13/06 G10L13/04

    Abstract: A speech synthesizer that provides high-quality sound along with stable sound quality, including: a target parameter generation unit; a speech element DB; an element selection unit; a mixed parameter judgment unit which determines an optimum parameter combination of target parameters and speech elements; a parameter integration unit which integrates the parameters; and a waveform generation unit which generates synthetic speech. High-quality and stable synthetic speech is generated by combining, per parameter dimension, the parameters with stable sound quality generated by the target parameter generation unit with speech elements with high sound quality and a sense of true speech selected by the element selection unit.

    Abstract translation: 一种提供高质量声音以及稳定音质的语音合成器,包括:目标参数产生单元; 语音元件DB; 元件选择单元; 混合参数判断单元,其确定目标参数和语音元素的最佳参数组合; 参数集成单元,其集成参数; 以及生成合成语音的波形生成单元。 通过将参数尺寸与由目标参数生成单元生成的稳定声音的参数与具有高音质的语音元素和由元素选择单元选择的真实语音感觉相结合,产生高质量和稳定的合成语音。

    Speech analyzer and speech analysis method
    4.
    发明授权
    Speech analyzer and speech analysis method 有权
    语音分析仪和语音分析方法

    公开(公告)号:US08370153B2

    公开(公告)日:2013-02-05

    申请号:US12772439

    申请日:2010-05-03

    CPC classification number: G10L19/06 G10L25/12 G10L25/90

    Abstract: A speech analyzer includes a vocal tract and sound source separating unit which separates a vocal tract feature and a sound source feature from an input speech, based on a speech generation model, a fundamental frequency stability calculating unit which calculates a temporal stability of a fundamental frequency of the input speech in the sound source feature, from the separated sound source feature, a stable analyzed period extracting unit which extracts time information of a stable period, based on the temporal stability, and a vocal tract feature interpolation unit which interpolates a vocal tract feature which is not included in the stable period, using a vocal tract feature included in the extracted stable period.

    Abstract translation: 语音分析器包括基于语音产生模型分离声道特征和声源特征的声道和声源分离单元,基本频率稳定性计算单元,其计算基频的时间稳定性 声源特征中的输入语音,来自分离的声源特征,稳定的分析周期提取单元,其基于时间稳定性提取稳定周期的时间信息;以及声道特征插值单元,其插入声道 特征,不包括在稳定期间,使用包括在提取的稳定期间中的声道特征。

    Speech separating apparatus, speech synthesizing apparatus, and voice quality conversion apparatus
    5.
    发明授权
    Speech separating apparatus, speech synthesizing apparatus, and voice quality conversion apparatus 有权
    语音分离装置,语音合成装置和语音质量转换装置

    公开(公告)号:US08255222B2

    公开(公告)日:2012-08-28

    申请号:US12447519

    申请日:2008-08-06

    CPC classification number: G10L21/02 G10L13/04 G10L19/04 G10L19/06 G10L19/08

    Abstract: A speech separating apparatus includes: a PARCOR calculating unit that extracts vocal tract information from an input speech signal; a filter smoothing unit that smoothes, in a first time constant, the vocal tract information extracted by the PARCOR calculating unit; an inverse filtering unit that calculates a filter coefficient of a filter having a frequency amplitude response characteristic inverse to the vocal tract information smoothed by the filter smoothing unit, so as to filter the input speech signal using the filter having the calculated filter coefficient; and a voicing source modeling unit that cuts out, from the input speech signal filtered by the inverse filtering unit, a waveform included in a second time constant shorter than the first time constant, so as to calculate, for each waveform that is taken, voicing source information from the each waveform.

    Abstract translation: 语音分离装置包括:PARCOR计算单元,从输入语音信号中提取声道信息; 滤波器平滑单元,其以第一时间常数平滑由PARCOR计算单元提取的声道信息; 逆滤波单元,计算具有与由滤波平滑单元平滑化的声道信息相反的频率振幅响应特性的滤波器的滤波器系数,以便使用具有计算滤波器系数的滤波器对输入的语音信号进行滤波; 以及发声源建模单元,其从所述逆滤波单元滤波的输入语音信号中切出包含在比所述第一时间常数短的第二时间常数中的波形,以便针对所拍摄的每个波形计算发声 源信息从每个波形。

    VOICE QUALITY CONVERSION DEVICE, METHOD OF MANUFACTURING THE VOICE QUALITY CONVERSION DEVICE, VOWEL INFORMATION GENERATION DEVICE, AND VOICE QUALITY CONVERSION SYSTEM
    6.
    发明申请
    VOICE QUALITY CONVERSION DEVICE, METHOD OF MANUFACTURING THE VOICE QUALITY CONVERSION DEVICE, VOWEL INFORMATION GENERATION DEVICE, AND VOICE QUALITY CONVERSION SYSTEM 审中-公开
    语音质量转换装置,制造语音质量转换装置的方法,VOWEL信息生成装置和语音质量转换系统

    公开(公告)号:US20120095767A1

    公开(公告)日:2012-04-19

    申请号:US13334119

    申请日:2011-12-22

    CPC classification number: G10L13/033 G10L2021/0135

    Abstract: A device includes: an input speech separation unit which separates an input speech into vocal tract information and voicing source information; a mouth opening degree calculation unit which calculates a mouth opening degree from the vocal tract information; a target vowel database storage unit which stores pieces of vowel information on a target speaker; an agreement degree calculation unit which calculates a degree of agreement between the calculated mouth opening degree and a mouth opening degree included in the vowel information; a target vowel selection unit which selects the vowel information from among the pieces of vowel information, based on the calculated agreement degree; a vowel transformation unit which transforms the vocal tract information on the input speech, using vocal tract information included in the selected vowel information; and a synthesis unit which generates a synthetic speech using the transformed vocal tract information and the voicing source information.

    Abstract translation: 一种设备包括:输入语音分离单元,其将输入语音分离成声道信息和发声源信息; 嘴开度计算单元,从声道信息计算开口度; 目标元音数据库存储单元,其在目标说话者上存储元音信息; 协调度计算单元,计算计算出的开口程度与包含在元音信息中的开口度之间的一致程度; 目标元音选择单元,根据计算出的协议程度从元音信息中选出元音信息; 元音变换单元,其使用包括在所选择的元音信息中的声道信息来变换输入语音的声道信息; 以及合成单元,其使用变换的声道信息和发声源信息来生成合成语音。

    VOICE QUALITY EDIT DEVICE AND VOICE QUALITY EDIT METHOD
    7.
    发明申请
    VOICE QUALITY EDIT DEVICE AND VOICE QUALITY EDIT METHOD 有权
    语音质量编辑设备和语音质量编辑方法

    公开(公告)号:US20100250257A1

    公开(公告)日:2010-09-30

    申请号:US12438642

    申请日:2008-06-04

    CPC classification number: G10L13/033 G10L13/04

    Abstract: This invention includes: a voice quality feature database (101) holding voice quality features; a speaker attribute database (106) holding, for each voice quality feature, an identifier enabling a user to expect a voice quality of the voice quality feature; a weight setting unit (103) setting a weight for each acoustic feature of a voice quality; a scaling unit (105) calculating display coordinates of each voice quality feature based on the acoustic features in the voice quality feature and the weights set by the weight setting unit (103); a display unit (107) displaying the identifier of each voice quality feature on the calculated display coordinates; a position input unit (108) receiving designated coordinates; and a voice quality mix unit (110) (i) calculating a distance between (1) the received designated coordinates and (2) the display coordinates of each of a part or all of the voice quality features, and (ii) mixing the acoustic features of the part or all of the voice quality features together based on a ratio between the calculated distances in order to generate a new voice quality feature.

    Abstract translation: 本发明包括:保持语音质量特征的语音质量特征数据库(101); 扬声器属性数据库(106),其针对每个语音质量特征保持使得用户期望语音质量特征的语音质量的标识符; 权重设定单元,设定语音质量的每个声学特征的权重; 缩放单元(105),基于语音质量特征中的声学特征和由权重设置单元(103)设置的权重来计算每个语音质量特征的显示坐标; 显示单元(107),其在所计算的显示坐标上显示每个语音质量特征的标识符; 接收指定坐标的位置输入单元(108) 以及语音质量混合单元(110)(i)计算(1)接收的指定坐标之间的距离和(2)声音质量特征的一部分或全部中的每一个的显示坐标,以及(ii)混合声学 基于计算出的距离之间的比例,将部分或全部声音质量特征的特征组合在一起,以便产生新的语音质量特征。

    SPEECH ANALYZER AND SPEECH ANALYSYS METHOD
    8.
    发明申请
    SPEECH ANALYZER AND SPEECH ANALYSYS METHOD 有权
    语音分析和语音分析方法

    公开(公告)号:US20100204990A1

    公开(公告)日:2010-08-12

    申请号:US12772439

    申请日:2010-05-03

    CPC classification number: G10L19/06 G10L25/12 G10L25/90

    Abstract: A speech analyzer includes a vocal tract and sound source separating unit which separates a vocal tract feature and a sound source feature from an input speech, based on a speech generation model, a fundamental frequency stability calculating unit which calculates a temporal stability of a fundamental frequency of the input speech in the sound source feature, from the separated sound source feature, a stable analyzed period extracting unit which extracts time information of a stable period, based on the temporal stability, and a vocal tract feature interpolation unit which interpolates a vocal tract feature which is not included in the stable period, using a vocal tract feature included in the extracted stable period.

    Abstract translation: 语音分析器包括基于语音产生模型分离声道特征和声源特征的声道和声源分离单元,基本频率稳定性计算单元,其计算基频的时间稳定性 声源特征中的输入语音,来自分离的声源特征,稳定的分析周期提取单元,其基于时间稳定性提取稳定周期的时间信息;以及声道特征插值单元,其插入声道 特征,不包括在稳定期间,使用包括在提取的稳定期间中的声道特征。

    Speech synthesis apparatus and speech synthesis method
    9.
    发明申请
    Speech synthesis apparatus and speech synthesis method 有权
    语音合成装置和语音合成方法

    公开(公告)号:US20060136213A1

    公开(公告)日:2006-06-22

    申请号:US11352380

    申请日:2006-02-13

    CPC classification number: G10L13/033 G10L13/04

    Abstract: A speech synthesis apparatus which can appropriately transform a voice characteristic of a speech is provided. The speech synthesis apparatus includes an element storing unit in which speech elements are stored, a function storing unit in which transformation functions are stored, an adaptability judging unit which derives a degree of similarity by comparing a speech element stored in the element storing unit with an acoustic characteristic of the speech element used for generating a transformation function stored in the function storing unit, and a selecting unit and voice characteristic transforming unit which transforms, for each speech element stored in the element storing unit, based on the degree of similarity derived by the adaptability judging unit, a voice characteristic of the speech element by applying one of the transformation functions stored in the function storing unit.

    Abstract translation: 提供了可以适当地变换语音的语音特性的语音合成装置。 语音合成装置包括存储有语音元素的元素存储单元,存储变换函数的函数存储单元,通过将存储在元素存储单元中的语音元素与存储单元存储单元的语音元素进行比较而导出相似度的适应性判断单元 用于产生存储在功能存储单元中的变换函数的语音元素的声学特性,以及选择单元和语音特征变换单元​​,用于对存储在元素存储单元中的每个语音元素,基于由 适应性判断单元,通过应用存储在功能存储单元中的一个变换函数来实现语音元素的语音特征。

    Emotion recognition apparatus
    10.
    发明授权
    Emotion recognition apparatus 有权
    情感识别装置

    公开(公告)号:US08204747B2

    公开(公告)日:2012-06-19

    申请号:US11997458

    申请日:2007-05-21

    CPC classification number: G10L17/26 G10L2015/025

    Abstract: An emotion recognition apparatus performs accurate and stable speech-based emotion recognition, irrespective of individual, regional, and language differences of prosodic information. The emotion recognition apparatus includes: a speech recognition unit which recognizes types of phonemes included in the input speech; a characteristic tone detection unit which detects a characteristic tone that relates to a specific emotion, in the input speech; a characteristic tone occurrence indicator computation unit which computes a characteristic tone occurrence indicator for each of the phonemes, based on the types of the phonemes recognized by the speech recognition unit, the characteristic tone occurrence indicator relating to an occurrence frequency of the characteristic tone; and an emotion judgment unit which judges an emotion of the speaker in a phoneme at which the characteristic tone occurs in the input speech, based on the characteristic tone occurrence indicator computed by the characteristic tone occurrence indicator computing unit.

    Abstract translation: 情感识别装置执行准确和稳定的基于语音的情感识别,而不管韵律信息的个体,区域和语言差异。 情感识别装置包括:语音识别单元,其识别输入语音中包括的音素的类型; 在输入语音中检测与特定情感相关的特征音的特征音检测单元; 特征音发生指示符计算单元,其基于由语音识别单元识别的音素的类型来计算每个音素的特征音发生指示符,与特征音的发生频率相关的特征音发生指示符; 以及情绪判断单元,其基于由特征音发生指示符计算单元计算的特征音发生指标,判断在输入语音中出现特征音的音素中的说话者的情感。

Patent Agency Ranking