Automated performance technology using audio waveform data

    公开(公告)号:US09613635B2

    公开(公告)日:2017-04-04

    申请号:US14411094

    申请日:2013-06-26

    Abstract: In order to play waveform data back at a variable performance tempo by using waveform data which complies with a desired reference tempo, the present invention performs a timeline-expansion/contraction control on the waveform data to be played back, according to the relationship between the performance tempo and the reference tempo. The present invention also determines whether to limit the playback of the waveform data according to the relationship between the performance tempo and the reference tempo. In the case that playback is to be limited, the present invention stops playback of the waveform data, or reduces the resolution of playback processing and continues playback of the waveform data. The present invention stops playback of the waveform data when, for example, the relationship between the performance tempo and the reference tempo is a relationship in which the waveform data would be played back at a performance tempo which would cause a processing delay or a deterioration of sound quality. As a result, it is possible to preemptively prevent a system freeze and solve problems such as the generation of music which has a slower tempo than the desired performance tempo, or the generation of music which includes the intermittent cutting out of sound due to noise, or a significant reduction to sound quality.

    Coherent pitch and intensity modification of speech signals

    公开(公告)号:US09922661B2

    公开(公告)日:2018-03-20

    申请号:US15378100

    申请日:2016-12-14

    Inventor: Alexander Sorin

    Abstract: A method comprising: receiving an utterance, an original pitch contour of the utterance, and a target pitch contour for the utterance, wherein the utterance comprises a plurality of consecutive frames, and wherein at least one of said frames is a voiced frame; calculating an original intensity contour of said utterance; generating a pitch modified utterance based on the target pitch contour; calculating an intensity modification factor for each of said frames, based on said original pitch contour and on said target pitch contour, to produce a sequence of intensity modification factors corresponding to said plurality of consecutive frames; calculating a final intensity contour for said utterance by applying said intensity modification factors to said original intensity contour; and generating a coherently modified speech signal by time dependent scaling of the intensity of said pitch modified utterance according to said final intensity contour.

    Pitch marking in speech processing

    公开(公告)号:US09685170B2

    公开(公告)日:2017-06-20

    申请号:US14918601

    申请日:2015-10-21

    Inventor: Slava Shechtman

    CPC classification number: G10L21/01 G10L21/013 G10L25/06 G10L25/90

    Abstract: According to some embodiments of the present invention, there is provided a computerized method for selecting and correcting pitch marks in speech processing and modification. The method comprises an action of receiving a continuous speech signal representing audible speech recorded by a microphone, where a sequence of pitch values and two or more pitch mark temporal values are computed from the continuous speech signal. The method comprises an action of computing for each of the pitch mark temporal values a lower limit temporal value and an upper limit temporal value by a cross-correlation function of the continuous speech signal around the pitch mark temporal values associated with pairs of elements in the sequence and replacing one or more of the pitch mark temporal values with one or more new temporal value between the lower limit temporal value and the upper limit temporal value.

    PITCH MARKING IN SPEECH PROCESSING

    公开(公告)号:US20170117001A1

    公开(公告)日:2017-04-27

    申请号:US14918601

    申请日:2015-10-21

    Inventor: Slava Shechtman

    CPC classification number: G10L21/01 G10L21/013 G10L25/06 G10L25/90

    Abstract: According to some embodiments of the present invention, there is provided a computerized method for selecting and correcting pitch marks in speech processing and modification. The method comprises an action of receiving a continuous speech signal representing audible speech recorded by a microphone, where a sequence of pitch values and two or more pitch mark temporal values are computed from the continuous speech signal. The method comprises an action of computing for each of the pitch mark temporal values a lower limit temporal value and an upper limit temporal value by a cross-correlation function of the continuous speech signal around the pitch mark temporal values associated with pairs of elements in the sequence and replacing one or more of the pitch mark temporal values with one or more new temporal value between the lower limit temporal value and the upper limit temporal value.

    Clock compensation techniques for audio decoding
    9.
    发明授权
    Clock compensation techniques for audio decoding 有权
    音频解码的时钟补偿技术

    公开(公告)号:US09420332B2

    公开(公告)日:2016-08-16

    申请号:US11691688

    申请日:2007-03-27

    Abstract: This disclosure describes audio decoding techniques for decoding audio information that needs to be properly clocked. In accordance with this disclosure, the number of audio samples in decoded audio output can be adjusted to compensate for an estimated error the in decoder clock. That is to say, rather than adjust the decoder clock to synchronize the decoder clock to the encoder clock, this disclosure proposes adding or removing audio samples from the decoded audio output in order to ensure that the decoded audio output is properly timed. In this way, the techniques of this disclosure can eliminate the need for an adjustable or controllable clock at the decoding device, which can save cost and/or allow legacy devices that do not include an adjustable or controllable clock to decode and output audio information that needs to be properly clocked.

    Abstract translation: 本公开描述了用于解码需要正确计时的音频信息的音频解码技术。 根据本公开,可以调整解码音频输出中的音频样本的数量以补偿解码器时钟中的估计误差。 也就是说,而不是调整解码器时钟以将解码器时钟同步到编码器时钟,本公开提议从解码的音频输出中添加或去除音频样本,以便确保解码的音频输出被正确定时。 以这种方式,本公开的技术可以消除在解码设备处对可调节或可控制的时钟的需要,这可以节省成本和/或允许不包括可调节或可控时钟的传统设备来解码和输出音频信息, 需要正确计时。

    VOICE SIGNAL PROCESSING APPARATUS AND VOICE SIGNAL PROCESSING METHOD
    10.
    发明申请
    VOICE SIGNAL PROCESSING APPARATUS AND VOICE SIGNAL PROCESSING METHOD 审中-公开
    语音信号处理设备和语音信号处理方法

    公开(公告)号:US20160217805A1

    公开(公告)日:2016-07-28

    申请号:US14736289

    申请日:2015-06-11

    Abstract: A voice signal processing apparatus and a voice signal processing method are provided. Calculate a value of an interpolation parametric function corresponding to a sampling signal frame according to three consecutive sample values in the sampling signal frame, and calculate an interpolated value between two adjacent sampling points in a frequency-lowered signal frame according to the value of the interpolation parametric function.

    Abstract translation: 提供语音信号处理装置和语音信号处理方法。 根据采样信号帧中的三个连续采样值,计算与采样信号帧对应的插值参数函数的值,并根据插补值计算降频信号帧中两个相邻采样点之间的内插值 参数函数。

Patent Agency Ranking