Voice synthesis apparatus
    2.
    发明公开
    Voice synthesis apparatus 有权
    Sprachsynthese

    公开(公告)号:EP2530671A2

    公开(公告)日:2012-12-05

    申请号:EP12169235.4

    申请日:2012-05-24

    IPC分类号: G10L13/06

    CPC分类号: G10L13/06 G10L25/93

    摘要: In a voice synthesis apparatus, a phoneme piece interpolator part acquires first phoneme piece data of a phoneme piece corresponding to a first value of sound characteristic, and acquires second phoneme piece data of the phoneme piece corresponding to a second value of the sound characteristic. The first phoneme piece data and the second phoneme piece data indicate a spectrum of each frame of the phoneme piece. The phoneme piece interpolator interpolates between each frame of the first phoneme piece data and each frame of the second phoneme piece data corresponding to each frame of the first phoneme piece data so as to create phoneme piece data of the phoneme piece corresponding to a target value of the sound characteristic which is different from the first value and the second value of the sound characteristic. A voice synthesizer generates a voice signal having the target value of the sound characteristic based on the created phoneme piece data.

    摘要翻译: 在语音合成装置中,音素片内插器部分获取与声音特性的第一值对应的音素片段的第一音素数据,并获取与声音特性的第二值对应的音素片段的第二音素片数据。 第一音素片数据和第二音素片数据表示音素片的每一帧的频谱。 音素片内插器在第一音素片数据的每一帧和对应于第一音素片数据的每个帧的第二音素片数据的每一帧之间插值,以便产生与目标值相对应的音素片段的音素片数据 与声音特性的第一值和第二值不同的声音特性。 语音合成器基于所创建的音素片段数据产生具有声音特征的目标值的语音信号。

    Graphical audio signal control
    4.
    发明公开
    Graphical audio signal control 审中-公开
    Grafische Audiosignalsteuerung

    公开(公告)号:EP2485218A2

    公开(公告)日:2012-08-08

    申请号:EP12154218.7

    申请日:2012-02-07

    摘要: Signal processing section (100) of a terminal converts acquired audio signals of a plurality of channels into frequency spectra set, calculates sound image positions corresponding to individual frequency components, and displays, on a display screen, the calculated sound image positions results by use of a coordinate system having coordinate axes of the frequency components and sound image positions. User-designated partial region of the coordinate system is set as a designated region and an amplitude-level adjusting amount is set for the designated region, so that the signal processing section adjusts amplitude levels of frequency components included in the frequency spectra and in the designated region, converts the adjusted frequency components into audio signals and outputs the converted audio signals.

    摘要翻译: 终端的信号处理部(100)将获取的多个频道的音频信号变换为频谱集,计算与各个频率成分对应的声像位置,并在显示画面上显示计算出的声像位置结果, 具有频率分量和声像位置的坐标轴的坐标系。 坐标系的用户指定的部分区域被设置为指定区域,并且为指定区域设置幅度电平调整量,使得信号处理部分调整频谱中包括的频率成分的幅度水平和指定的 将经调整的频率分量转换成音频信号并输出​​转换的音频信号。

    Voice converter with extraction and modification of attribute data
    5.
    发明公开
    Voice converter with extraction and modification of attribute data 审中-公开
    on on。。。。。。。。。。。。。。。。

    公开(公告)号:EP2450887A1

    公开(公告)日:2012-05-09

    申请号:EP12000670.5

    申请日:1999-06-07

    IPC分类号: G10L21/00

    摘要: An apparatus is constructed for converting an input voice signal into an output voice signal according to a target voice signal. In the apparatus, an input device provides the input voice signal composed of original sinusoidal components and original residual components other than the original sinusoidal components. An extracting device extracts original attribute data from at least the sinusoidal components of the input voice signal. The original attribute data is characteristic of the input voice signal. A synthesizing device synthesizes new attribute data based on both of the original attribute data derived from the input voice signal and target attribute data being characteristic of the target voice signal composed of target sinusoidal components and target residual components other than the sinusoidal components.; The target attribute data is derived from at least the target sinusoidal components. An output device operates based on the new attribute data and either of the original residual component and the target residual component for producing the output voice signal.

    摘要翻译: 一种装置,用于根据目标语音信号将输入语音信号转换为输出语音信号。 在该装置中,输入装置提供由原始正弦分量和除原始正弦分量之外的原始剩余分量组成的输入声音信号。 提取装置从输入语音信号的至少正弦分量中提取原始属性数据。 原始属性数据是输入语音信号的特征。 合成装置基于从输入语音信号导出的原始属性数据和由目标正弦分量和除了正弦分量之外的目标残差分量组成的目标语音信号的特征的目标属性数据,合成新的属性数据。 目标属性数据至少从目标正弦分量导出。 输出装置基于新的属性数据和原始剩余分量中的任一个和用于产生输出语音信号的目标剩余分量进行操作。

    Voice synthesizer of multi sounds
    6.
    发明公开
    Voice synthesizer of multi sounds 有权
    语音合成器与多个声音

    公开(公告)号:EP1688912A3

    公开(公告)日:2008-06-25

    申请号:EP06101138.3

    申请日:2006-02-01

    IPC分类号: G10L13/06 G10L21/00

    摘要: In a voice synthesizer, an envelope acquisition portion obtains a spectral envelope of a reference frequency spectrum of a given voice. A spectrum acquisition portion obtains a collective frequency spectrum of a plurality of voices which are generated in parallel to one another. An envelope adjustment portion adjusts a spectral envelope of the collective frequency spectrum obtained by the spectrum acquisition portion so as to approximately match with the spectral envelope of the reference frequency spectrum obtained by the envelope acquisition portion. A voice generation portion generates an output voice signal from the collective frequency spectrum having the spectral envelope adjusted by the envelope adjustment portion.

    Sound signal processing apparatus, sound signal processing method and sound signal processing program
    7.
    发明公开
    Sound signal processing apparatus, sound signal processing method and sound signal processing program 审中-公开
    声音信号处理装置,声音信号处理方法和声音信号处理程序

    公开(公告)号:EP1727123A1

    公开(公告)日:2006-11-29

    申请号:EP06114518.1

    申请日:2006-05-24

    IPC分类号: G10H1/36 G10L15/14

    摘要: A sound signal processing apparatus which is capable of correctly detecting expression modes and expression transitions of a song or performance from an input sound signal. A sound signal produced by performance or singing of musical tones is input and divided into frames of predetermined time periods. Characteristic parameters of the input sound signal are detected on a frame-by-frame basis. An expression determining process is carried out in which a plurality of expression modes of a performance or song are modeled as respective states, the probability that a section including a frame or a plurality of continuous frames lies in a specific state is calculated with respect to a predetermined observed section based on the characteristic parameters, and the optimum route of state transition in the predetermined observed section is determined based on the calculated probabilities so as to determine expression modes of the sound signal and lengths thereof.

    摘要翻译: 一种声音信号处理设备,其能够正确地检测来自输入声音信号的歌曲或演奏的表情模式和表情转换。 通过演奏或唱歌音乐音调产生的声音信号被输入并被分成预定时间段的帧。 输入声音信号的特征参数在逐帧的基础上被检测。 执行表达确定过程,其中表演或歌曲的多个表现模式被建模为相应的状态,包括一个帧或多个连续帧的部分处于特定状态的概率被计算为相对于 基于特征参数确定预定观察截面,并且基于计算出的概率确定预定观察截面中的最佳状态转换路线,以确定声音信号的表达模式及其长度。