VOICE SYNTHESIS
    3.
    发明公开
    VOICE SYNTHESIS 有权
    SPRACHSYNTHESE

    公开(公告)号:EP3065130A1

    公开(公告)日:2016-09-07

    申请号:EP16158430.5

    申请日:2016-03-03

    IPC分类号: G10L13/033

    摘要: A voice synthesis method for generating a voice signal through connection of a phonetic piece extracted from a reference voice, includes selecting, by a piece selection unit, the phonetic piece sequentially; setting, by a pitch setting unit, a pitch transition in which a fluctuation of an observed pitch of the phonetic piece is reflected based on a degree corresponding to a difference value between a reference pitch being a reference of sound generation of the reference voice and the observed pitch of the phonetic piece selected by the piece selection unit; and generating, by a voice synthesis unit, the voice signal by adjusting a pitch of the phonetic piece selected by the piece selection unit based on the pitch transition generated by the pitch setting unit.

    摘要翻译: 一种语音合成方法,用于通过连接从参考语音提取的语音片段来生成语音信号,包括由片选择单元依次选择语音片段; 通过音高设定单元,基于对应于作为参考语音的声音产生的参考音调的参考音调之间的差值的程度来反映音标的观察音高的波动的音调转变, 由片选择单元选择的语音片的观察间距; 以及由语音合成单元通过基于音调设置单元产生的音调转换来调整由片选择单元选择的语音片段的节距来产生语音信号。

    VOICE ANALYSIS METHOD AND DEVICE, VOICE SYNTHESIS METHOD AND DEVICE AND MEDIUM STORING VOICE ANALYSIS PROGRAM
    4.
    发明公开
    VOICE ANALYSIS METHOD AND DEVICE, VOICE SYNTHESIS METHOD AND DEVICE AND MEDIUM STORING VOICE ANALYSIS PROGRAM 有权
    语音分析方法和装置,语音合成方法和设备以及介质其上具有语音分析程序存储

    公开(公告)号:EP2983168A1

    公开(公告)日:2016-02-10

    申请号:EP15185624.2

    申请日:2014-08-07

    发明人: TACHIBANA, Makoto

    摘要: A voice analysis method comprises generating a time series of a relative pitch (R), which is a difference between a pitch (PB) generated from music track data (XB) designating respective notes of a music track in time series, and a pitch (PA) of a reference voice. The music track is divided into unit sections (UA) of a predetermined duration, and singing characteristics data (Z) is generated, which includes, for each of a plurality of statuses (St) of a model (M), classification information for classifying the unit sections (UA) into a plurality of sets and variable information defining a probability distribution of the time series of the relative pitch (R) within each of the classified unit sections (UA). The classification information is generated based on a condition relating to an attribute of the note and based on the condition relating to an attribute of the each of the unit sections (UA).

    摘要翻译: 一种语音分析方法包括产生一个时间序列的相对间距(R),所有这些是从音乐轨道数据(XB)指定在时间序列上一个音乐曲目的respectivement音符产生的音调(PB)之间的差,并且一个间距( PA)的基准语音的。 音乐曲目被划分成预定的持续时间的单位区间(UA),和歌唱特征数据(Z)被产生,它包括,对于每个状态的多个A模型的(ST)(M),分类信息的用于分类 单位区间(UA)成组和多个可变信息定义所述时间序列内的每个所分类的单位区间(UA)的相对间距(R)中的概率分布。 基于与属性的说明和基于与属性到每个单元(UA)的各部分的状况的条件,产生的分类信息。

    CODING, MODIFICATION AND SYNTHESIS OF SPEECH SEGMENTS
    5.
    发明授权
    CODING, MODIFICATION AND SYNTHESIS OF SPEECH SEGMENTS 有权
    加密,修改和语段合成

    公开(公告)号:EP2517197B1

    公开(公告)日:2014-12-17

    申请号:EP10801161.0

    申请日:2010-12-21

    申请人: Telefónica, S.A.

    摘要: The invention relates to a method for speech signal analysis, modification and synthesis comprising a phase for the location of analysis windows by means of an iterative process for the determination of the phase of the first sinusoidal component and comparison between the phase value of said component and a predetermined value, a phase for the selection of analysis frames corresponding to an allophone and readjustment of the duration and the fundamental frequency according to certain thresholds and a phase for the generation of synthetic speech from synthesis frames taking the information of the closest analysis frame as spectral information of the synthesis frame and taking as many synthesis frames as periods that the synthetic signal has. The method allows a coherent location of the analysis windows within the periods of the signal and the exact generation of the synthesis instants in a manner synchronous with the fundamental period.

    Voice conversion device and method
    7.
    发明公开
    Voice conversion device and method 有权
    装置和方法用于语音转换

    公开(公告)号:EP2431967A3

    公开(公告)日:2013-10-23

    申请号:EP11181174.1

    申请日:2011-09-14

    IPC分类号: G10L13/02 G10L21/00 G10L13/06

    摘要: In voice processing, a first distribution generation unit approximates a distribution of feature information representative of voice of a first speaker per a unit interval thereof as a mixed probability distribution which is a mixture of a plurality of first probability distributions corresponding to a plurality of different phones. A second distribution generation unit also approximates a distribution of feature information representative of voice of a second speaker as a mixed probability distribution which is a mixture of a plurality of second probability distributions. A function generation unit generates, for each phone, a conversion function for converting the feature information of voice of the first speaker to that of the second speaker based on respective statistics of the first and second probability distributions that correspond to the phone.

    SYSTEM AND METHOD FOR HYBRID SPEECH SYNTHESIS
    9.
    发明授权
    SYSTEM AND METHOD FOR HYBRID SPEECH SYNTHESIS 有权
    系统和方法混合语音合成

    公开(公告)号:EP2140447B1

    公开(公告)日:2010-12-01

    申请号:EP08742827.2

    申请日:2008-04-14

    申请人: Novaspeech LLC

    IPC分类号: G10L13/02 G10L13/06

    摘要: A speech synthesis system receives symbolic input describing an utterance to be synthesized. In one embodiment, different portions of the utterance are constructed from different sources, one of which is a speech corpus recorded from a human speaker whose voice is to be modeled. The other sources may include other human speech corpora or speech produced using Rule-Based Speech Synthesis (RBSS). At least some portions of the utterance may be constructed by modifying prototype speech units to produce adapted speech units that are contextually appropriate for the utterance. The system concatenates the adapted speech units with the other speech units to produce a speech waveform. In another embodiment, a speech unit of a speech corpus recorded from a human speaker lacks transitions at one or both of its edges. A transition is synthesized using RBSS and concatenated with the speech unit in producing a speech waveform for the utterance.