METHOD OF SYNTHESIS FOR A STEADY SOUND SIGNAL
    2.
    发明公开
    METHOD OF SYNTHESIS FOR A STEADY SOUND SIGNAL 有权
    方法合成一个固定的声音信号

    公开(公告)号:EP1543497A1

    公开(公告)日:2005-06-22

    申请号:EP03797393.0

    申请日:2003-08-05

    发明人: GIGI, Ercan, F.

    IPC分类号: G10L13/06 G10L21/04

    CPC分类号: G10L13/07 G10L13/08 G10L21/01

    摘要: The present invention relates to a method of synthesizing a first sound signal based on a second sound signal, the first sound signal having a required first fundamental frequency and the second sound signal having a second fundamental frequency, the method comprising the steps of, a) determining of required pitch bell locations in the time domain of the first sound signal, the pitch bell locations being distanced by one period of the first fundamental frequency, b) providing of pitch bells by windowing the second sound signal on pitch bell locations in the time domain of the second sound signal, the pitch bell locations being distanced by one period of the second fundamental frequency, c) randomly selecting of a pitch bell from the provided pitch bells for each of the required pitch bell locations, d) performing an overlap and add operation on the selected pitch bells for synthesizing the first signal.

    AUTOMATED PERFORMANCE TECHNOLOGY USING AUDIO WAVEFORM DATA
    3.
    发明公开
    AUTOMATED PERFORMANCE TECHNOLOGY USING AUDIO WAVEFORM DATA 有权
    技术FÜRAUTOMATISIERTE LEISTUNG UNTER VERWENDUNG VON WELLENFORMDATEN

    公开(公告)号:EP2866223A4

    公开(公告)日:2015-12-30

    申请号:EP13809959

    申请日:2013-06-26

    申请人: YAMAHA CORP

    IPC分类号: G10H1/22 G10H1/00 G10H7/02

    摘要: In order to play waveform data back at a variable performance tempo by using waveform data which complies with a desired reference tempo, the present invention performs a timeline-expansion/contraction control on the waveform data to be played back, according to the relationship between the performance tempo and the reference tempo. The present invention also determines whether to limit the playback of the waveform data according to the relationship between the performance tempo and the reference tempo. In the case that playback is to be limited, the present invention stops playback of the waveform data, or reduces the resolution of playback processing and continues playback of the waveform data. The present invention stops playback of the waveform data when, for example, the relationship between the performance tempo and the reference tempo is a relationship in which the waveform data would be played back at a performance tempo which would cause a processing delay or a deterioration of sound quality. As a result, it is possible to preemptively prevent a system freeze and solve problems such as the generation of music which has a slower tempo than the desired performance tempo, or the generation of music which includes the intermittent cutting out of sound due to noise, or a significant reduction to sound quality.

    摘要翻译: 为了通过使用符合所需参考节奏的波形数据以可变的演奏速度播放波形数据,本发明根据所要求的参考节奏的关系对要重放的波形数据执行时间线扩展/收缩控制 演奏速度和参考节奏。 本发明还根据演奏速度与基准速度之间的关系确定是否限制波形数据的重放。 在限制播放的情况下,本发明停止对波形数据的重放,或降低重放处理的分辨率并继续重放波形数据。 本发明在例如性能速度和基准速度之间的关系是以将导致处理延迟或劣化的性能节奏播放波形数据的关系时停止对波形数据的重放 音质。 结果,可以预先防止系统冻结并且解决诸如产生具有比期望的性能节奏更慢的速度的音乐的问题,或者产生包括由于噪声而间断地切断声音的音乐, 或显着降低音质。

    Digital audio processing
    4.
    发明公开
    Digital audio processing 有权
    Digitale Audioverarbeitung

    公开(公告)号:EP2026331A1

    公开(公告)日:2009-02-18

    申请号:EP08162264.9

    申请日:2008-08-12

    IPC分类号: G10L21/04

    CPC分类号: G10L21/01

    摘要: The invention concerns digital audio processing and in particular the detection of periods where samples can be deleted or repeated unobtrusively so as to change the average sample-rate or to provide time delay modification. Differences between succeeding sample values are evaluated and compared with a threshold and samples are deleted or repeated where two or more consecutive sample value differences are less than the said threshold value.

    摘要翻译: 本发明涉及数字音频处理,特别是检测可以删除或重复采样的周期,以便改变平均采样率或提供时间延迟修改。 评估后续采样值之间的差异并与阈值进行比较,并且在两个或更多个连续采样值差小于所述阈值时删除或重复采样。

    Method and apparatus for reproducing speech signals, method and apparatus for decoding the speech, method and apparatus for synthesizing the speech and portable radio terminal apparatus
    5.
    发明授权

    公开(公告)号:EP0770987B1

    公开(公告)日:2003-01-22

    申请号:EP96307741.7

    申请日:1996-10-25

    申请人: SONY CORPORATION

    IPC分类号: G10L13/02 G10L21/04

    摘要: A method for reproducing speech signals at a controlled speed whereby rate conversion of the time axis may be facilitated, and a method for synthesizing the speech whereby pitch conversion can be realized by a simplified structure based on the encoded speech data without changing the phoneme. With the speech reproducing method, an encoding unit 2 discriminates whether an input speech signal is voiced or unvoiced. Based on the results of discrimination, the encoding unit 2 performs sinusoidal synthesis and encoding for a signal portion found to be voiced, while performing vector quantization by closed-loop search for an optimum vector for a portion found to be unvoiced using an analysis-by-synthesis method, in order to find encoded parameters. The decoding unit 3 compands the time axis of the encoded parameters obtained every pre-set frames at a period modification unit 4 for modifying the output period of the parameters for creating modified encoded parameters associated with different time points corresponding to the pre-set frames. A speech synthesis unit 6 synthesizes the voiced speech portion and the unvoiced speech portion based on the modified encoded parameters. With the speech synthesizing unit, an encoded bit stream or encoded data is outputted by an encoded data outputting unit 301., Of these data, at least pitch data and amplitude data of the spectral envelope are sent via a data conversion unit 302 to a waveform synthesis unit 302, where the number of amplitude data of the spectral envelope is changed without changing the shape of the spectral envelope depending on a pitch desired pitch value. A waveform synthesis unit 303 synthesizes the speech waveform based on the converted spectral envelope data and pitch data.

    Subband decoding allowing for high-speed reproducing
    6.
    发明公开
    Subband decoding allowing for high-speed reproducing 失效
    子带的解码,高速重放允许

    公开(公告)号:EP0763905A3

    公开(公告)日:2001-01-03

    申请号:EP96114544.8

    申请日:1996-09-11

    发明人: Miyasaka, Shuji

    IPC分类号: H04B1/66

    CPC分类号: G10L21/01 H04B1/667

    摘要: The reproducing apparatus of the invention reproduces a plurality of band signals which have been subjected to a band division and includes a time-scale modifier which receives the plurality of band signals and time-axis compresses the respective band signals at the same ratio, thereby outputting a plurality of time-axis compressed band signals and a synthesis filter bank for synthesizing the plurality of time-axis compressed band signals.

    TIME-WARPING FRAMES OF WIDEBAND VOCODER
    7.
    发明公开
    TIME-WARPING FRAMES OF WIDEBAND VOCODER 审中-公开
    时间翘曲的宽带VOCODER框架

    公开(公告)号:EP2059925A2

    公开(公告)日:2009-05-20

    申请号:EP07813815.3

    申请日:2007-08-06

    IPC分类号: G10L21/04 G10L19/14

    摘要: A method of communicating speech comprising time-warping a residual low band speech signal to an expanded or compressed version of the residual low band speech signal, time-warping a high band speech signal to an expanded or compressed version of the high band speech signal, and merging the time-warped low band and high band speech signals to give an entire time-warped speech signal. In the low band, the residual low band speech signal is synthesized after time-warping of the residual low band signal while in the high band, an unwarped high band signal is synthesized before time-warping of the high band speech signal. The method may further comprise classifying speech segments and encoding the speech segments. The encoding of the speech segments may be one of code-excited linear prediction, noise-excited linear prediction or 1/8 frame (silence) coding.

    摘要翻译: 一种传送语音的方法,包括将残余低频带语音信号时间扭曲到残余低频带语音信号的扩展或压缩版本,将高频带语音信号时间扭曲成高频带语音信号的扩展或压缩版本, 并合并时间扭曲的低频带和高频带语音信号以给出整个时间扭曲的语音信号。 在低频带中,在残余低频带信号的时间扭曲之后合成残余低频带语音信号,而在高频带中,在高频带语音信号的时间扭曲之前合成未扭曲的高频带信号。 该方法可以进一步包括对语音片段进行分类并对语音片段进行编码。 语音片段的编码可以是码激励线性预测,噪声激励线性预测或1/8帧(静音)编码中的一种。

    METHOD AND SYSTEM FOR ENABLING AUDIO SPEED CONVERSION
    8.
    发明公开
    METHOD AND SYSTEM FOR ENABLING AUDIO SPEED CONVERSION 有权
    方法和系统,用于使声音速度的转换

    公开(公告)号:EP1309965A1

    公开(公告)日:2003-05-14

    申请号:EP01945551.8

    申请日:2001-06-29

    IPC分类号: G10L21/04

    CPC分类号: G10L21/01

    摘要: The present invention provides a method and system for processing an audio signal. According to an exemplary method, an audio signal such as a digital voice signal is received and divided into one or more individual unit cycles. An audio speed conversion operation is enabled by repeating or removing one or more of the individual unit cycles. In particular, repeating one or more of the individual unit cycles decreases audio speed, and removing one or more of the individual unit cycles increases audio speed.

    Apparatus for processing audio signal
    9.
    发明公开
    Apparatus for processing audio signal 失效
    Vorrichtung zur Verarbeitung von Audiosignalen

    公开(公告)号:EP0907161A1

    公开(公告)日:1999-04-07

    申请号:EP97307248.1

    申请日:1997-09-18

    发明人: Shudo, Katsuyuki

    IPC分类号: G10L3/02

    CPC分类号: G10L21/01

    摘要: An apparatus for processing audio signals is provided with a memory for storing the audio signals. The audio signals are written in the memory at write addresses in the memory. The audio signals are read from the memory in accordance with reading addresses at a speed lower than a speed for writing the audio signals into the memory. It is determined whether an amount of audio signals stored in the memory and not yet read therefrom is increasing. The write addresses are then updated when the amount of audio signals not yet read is increasing. When small signals levels of which are lower than a reference level are detected among the audio signals, updating of the write addresses of the small signals may be halted.

    摘要翻译: 用于处理音频信号的装置设置有用于存储音频信号的存储器。 音频信号以存储器中的写入地址写入存储器。 音频信号根据读取地址以比将音频信号写入存储器的速度低的速度从存储器中读取。 确定存储在存储器中并且尚未从其读取的音频信号的量是否在增加。 然后当尚未读取的音频信号的量增加时,更新写入地址。 当音频信号中检测到低于参考电平的小信号电平时,可能会停止更新小信号的写入地址。

    Method and apparatus for reproducing speech signals, method and apparatus for decoding the speech, method and apparatus for synthesizing the speech and portable radio terminal apparatus
    10.
    发明公开

    公开(公告)号:EP0770987A3

    公开(公告)日:1998-07-29

    申请号:EP96307741

    申请日:1996-10-25

    申请人: SONY CORP

    摘要: A method for reproducing speech signals at a controlled speed whereby rate conversion of the time axis may be facilitated, and a method for synthesizing the speech whereby pitch conversion can be realized by a simplified structure based on the encoded speech data without changing the phoneme. With the speech reproducing method, an encoding unit 2 discriminates whether an input speech signal is voiced or unvoiced. Based on the results of discrimination, the encoding unit 2 performs sinusoidal synthesis and encoding for a signal portion found to be voiced, while performing vector quantization by closed-loop search for an optimum vector for a portion found to be unvoiced using an analysis-by-synthesis method, in order to find encoded parameters. The decoding unit 3 compands the time axis of the encoded parameters obtained every pre-set frames at a period modification unit 4 for modifying the output period of the parameters for creating modified encoded parameters associated with different time points corresponding to the pre-set frames. A speech synthesis unit 6 synthesizes the voiced speech portion and the unvoiced speech portion based on the modified encoded parameters. With the speech synthesizing unit, an encoded bit stream or encoded data is outputted by an encoded data outputting unit 301., Of these data, at least pitch data and amplitude data of the spectral envelope are sent via a data conversion unit 302 to a waveform synthesis unit 302, where the number of amplitude data of the spectral envelope is changed without changing the shape of the spectral envelope depending on a pitch desired pitch value. A waveform synthesis unit 303 synthesizes the speech waveform based on the converted spectral envelope data and pitch data.

    摘要翻译: 一种用于以受控的速度,由此在时间轴的速率转换可有利于再现语音信号的方法,以及用于合成语音,由此音调转换可通过基于所述编码的语音数据的简化的结构,而不改变音素来实现的方法。 与语音再现方法与编码单元2判断上是浊音或输入的语音信号不发声无论。 根据鉴别结果,编码单元2采用以执行用于发现为浊音的信号部分正弦合成和编码,而在最佳矢量用于通过闭环搜索执行矢量量化为发现了一个部分到清音分析逐 - 合成方法中,为了找到编码参数。 解码单元3个CompandS在时期修改单元4获得的每个预先设定的帧用于修改参数的输出周期,用于创建具有不同的时间点对应于预先设定的帧相关联的修改的已编码参数的编码参数的时间轴。 的语音合成单元6合成浊音部分以及基于所述修改的已编码参数清音部分。 与语音合成单元到编码比特流或编码的数据的通过在编码数据输出开始单元301输出,论文数据中,至少,音调数据和谱包络的幅度数据通过数据转换单元302发送到波形 合成单元302,在那里频谱包络线的振幅数据的数目而不改变谱包络取决于间距期望间距值的形状改变。 波形合成单元303合成基于转换后的频谱包络数据和音调数据的语音波形。