摘要:
An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing the residual signal using the weighting coefficients. Therefore, a low frequency band, which is auditively important, can be analyzed with a higher frequency resolution as compared with a high frequency band, whereby efficient signal compression utilizing human auditory characteristics is realized.
摘要:
An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing the residual signal using the weighting coefficients. Therefore, a low frequency band, which is auditively important, can be analyzed with a higher frequency resolution as compared with a high frequency band, whereby efficient signal compression utilizing human auditory characteristics is realized.
摘要:
A temporal processing apparatus includes: a splitter splitting an audio signal, included in the sub-band domain, into diffuse signals indicating reverberating components and direct signals indicating non-reverberating components; a downmix unit generating a downmix signal by downmixing the direct signals; BPFs respectively generating a bandpass downmix signal and bandpass diffuse signals; normalization processing units respectively generating a normalized downmix signal and normalized diffuse signals; a scale computation processing unit computing, on a predetermined time slot basis, a scale factor indicating the magnitude of energy of the normalized downmix signal with respect to energy of the normalized diffuse signals; a calculating unit generating scale diffuse signals; a HPF generating high-pass diffuse signals; an adding unit generating addition signals; and a synthesis filter bank performing synthesis filter processing on the addition signals and transforming the addition signals into the time domains.
摘要:
A temporal processing apparatus (energy shaping apparatus) (600a) includes: a splitter (601) splitting an audio signal, included in the sub-band domain, which are obtained through a hybrid time and frequency transformation into diffuse signals indicating reverberating components and direct signals indicating non-reverberating components; a downmix unit (604) generating a downmix signal by downmixing the direct signals; BPFs (605 and 606) respectively generating a bandpass downmix signal and bandpass diffuse signals, by performing bandpass processing on the downmix signal and the diffuse signals on a sub-band-to-sub-band basis, which are split on the sub-band basis; normalization processing units (607 and 608) respectively generating a normalized downmix signal and normalized diffuse signals by normalizing the bandpass downmix signal and the bandpass diffuse signals with regard to respective energy; a scale computation processing unit (609) computing, on a predetermined time slot basis, a scale factor indicating the magnitude of energy of the normalized downmix signal with respect to energy of the normalized diffuse signals; a calculating unit (611) generating scale diffuse signals by multiplying the normalized diffuse signals by the scale factor; a HPF (612) generating high-pass diffuse signals by performing high-pass processing on the scale diffuse signals; an adding unit (613) generating addition signals by adding the high-pass diffuse signals and the direct signals; and a synthesis filter bank (614) performing synthesis filter processing on the addition signals and transforming the addition signals into the time domains
摘要:
An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency spectrum, and an encoded data stream generating unit (205) that encodes to output the lower frequency spectrum obtained by the MDCT unit (202) and the extension data obtained by the BWE encoding unit (204). The BWE encoding unit (204) generates as the extension data (i) a first parameter which specifies a lower subband which is to be copied as the higher frequency spectrum from among a plurality of the lower subbands which form the lower frequency spectrum obtained by the MDCT unit (202) and (ii) a second parameter which specifies a gain of the lower subband after being copied.
摘要:
An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency spectrum, and an encoded data stream generating unit (205) that encodes to output the lower frequency spectrum obtained by the MDCT unit (202) and the extension data obtained by the BWE encoding unit (204). The BWE encoding unit (204) generates as the extension data (i) a first parameter which specifies a lower subband which is to be copied as the higher frequency spectrum from among a plurality of the lower subbands which form the lower frequency spectrum obtained by the MDCT unit (202) and (ii) a second parameter which specifies a gain of the lower subband after being copied.
摘要:
An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency spectrum, and an encoded data stream generating unit (205) that encodes to output the lower frequency spectrum obtained by the MDCT unit (202) and the extension data obtained by the BWE encoding unit (204). The BWE encoding unit (204) generates as the extension data (i) a first parameter which specifies a lower subband which is to be copied as the higher frequency spectrum from among a plurality of the lower subbands which form the lower frequency spectrum obtained by the MDCT unit (202) and (ii) a second parameter which specifies a gain of the lower subband after being copied.
摘要:
An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency spectrum, and an encoded data stream generating unit (205) that encodes to output the lower frequency spectrum obtained by the MDCT unit (202) and the extension data obtained by the BWE encoding unit (204). The BWE encoding unit (204) generates as the extension data (i) a first parameter which specifies a lower subband which is to be copied as the higher frequency spectrum from among a plurality of the lower subbands which form the lower frequency spectrum obtained by the MDCT unit (202) and (ii) a second parameter which specifies a gain of the lower subband after being copied.
摘要:
An encoding device (100) includes (i) a first encoding unit (132) that encodes spectral data in the lower frequency band represented by a plularity of parameters, out of the spectral data obtained by transforming an audio signal inputted for a fixed time length, (ii) a second quantizing unit (133) that generates sub information representing characteristics of the spectral data in the higher frequency by fewer parameters than those for the lower frequency band, out of the spectral data obtained by the transformation, (iii) a second encoding unit (134) that encodes the generated sub information, and (iv) a stream output unit (140) that outputs the data encoded by the first encoding unit (132) and the data encoded by the second encoding unit (134).
摘要:
An encoding device (200) is comprised of a band dividing unit (201) that divides an input signal (207) into a low frequency signal (208) representing a signal in the lower frequency band and a high frequency signal (209) representing a signal in the higher frequency band, a lower frequency band encoding unit (202) that encodes the low frequency signal (208) and generates a low frequency code (213), a similarity judging unit (203) that judges similarity between the high frequency signal (209) and the low frequency signal (208) and generates switching information (210), “n” higher frequency band encoding units 205 that encode the high frequency signal (209) through respective encoding methods and generate a high frequency code (212), a switching unit (204) that selects one of the higher frequency band encoding units (205) and has the selected higher frequency band encoding unit (205) perform encoding, and a code multiplexing unit (206) that multiplexes the low frequency code (213), the high frequency code (212) and the switching information (210), and generates an output code (214).