摘要:
The present invention relates to a method and system for audio encoding and decoding and a method for estimating a noise level, and the method for estimating a noise level in the present invention comprises: estimating a power spectrum of an audio signal to be encoded according to a frequency domain coefficient of the audio signal to be encoded; and estimating a noise level of a zero bit encoding subband audio signal according to the power spectrum obtained by calculating, and this noise level for controlling an energy proportion of noise filling to spectral band replication during decoding; wherein a zero bit encoding subband refers to an encoding subband of which allocated bit number is zero. The present invention can well reconstruct the uncoded frequency domain coefficients.
摘要:
Hierarchical audio coding and decoding method and system and hierarchical audio coding and decoding method for transient signals are provided. In the present invention, by introducing a processing method for transient signal frames in the hierarchical audio coding and decoding methods, a segmented time-frequency transform is performed on the transient signal frames, and then the frequency-domain coefficients obtained by transformation are rearranged respectively within the core layer and within the extended layer, so as to perform the same subsequent coding processes, such as bit allocation, frequency-domain coefficient coding, etc., as those on the steady-state signal frames, thus enhancing the coding efficiency of the transient signal frames and improving the quality of the hierarchical audio coding and decoding.
摘要:
The present invention relates to a method and device for spectral band replication, and a method and system for audio decoding, and the method for spectral band replication comprises: A. searching for the position of a certain tone of an audio signal in MDCT frequency domain coefficients; B. according to the tone position, determining a spectral band replication period which is a bandwidth from a 0 frequency point to a frequency point of tone position, and a source frequency segment which is a frequency segment from a frequency point of the 0 frequency point shifting copyband_offset frequency points backwards to a frequency point of the frequency point of the tone position shifting the copyband_offset frequency points backwards, wherein said offset copyband_offset is greater than or equal to 0; and C. according to the spectral band replication period, carrying out spectral band replication on zero bit encoding subbands.
摘要:
Hierarchical audio coding and decoding method and system and hierarchical audio coding and decoding method for transient signals are provided. In the present invention, by introducing a processing method for transient signal frames in the hierarchical audio coding and decoding methods, a segmented time-frequency transform is performed on the transient signal frames, and then the frequency-domain coefficients obtained by transformation are rearranged respectively within the core layer and within the extended layer, so as to perform the same subsequent coding processes, such as bit allocation, frequency-domain coefficient coding, etc., as those on the steady-state signal frames, thus enhancing the coding efficiency of the transient signal frames and improving the quality of the hierarchical audio coding and decoding.
摘要:
The present invention relates to a method and system for audio encoding and decoding and a method for estimating a noise level, and the method for estimating a noise level in the present invention comprises: estimating a power spectrum of an audio signal to be encoded according to a frequency domain coefficient of the audio signal to be encoded; and estimating a noise level of a zero bit encoding subband audio signal according to the power spectrum obtained by calculating, and this noise level for controlling an energy proportion of noise filling to spectral band replication during decoding; wherein a zero bit encoding subband refers to an encoding subband of which allocated bit number is zero. The present invention can well reconstruct the uncoded frequency domain coefficients.
摘要:
The audio coding method and system of lattice vector quantization is provided in the invention. The method comprises: dividing frequency domain coefficients of an audio signal for which a modified discrete cosine transform (MDCT) has been performed into a plurality of coding sub-bands, and quantizing and coding an amplitude envelope value of each coding sub-band to obtain coded bits of amplitude envelopes; performing bit allocation on each coding sub-band, and performing normalization, quantization and coding respectively on vectors in a low bit coding sub-band with pyramid lattice vector quantization and on vectors in a high bit coding sub-band with sphere lattice vector quantization to obtain coded bits of the frequency domain coefficients; multiplexing and packing the coded bits of the amplitude envelope and the coded bits of the frequency domain coefficients of each coding sub-band, then sending them to a decoding side.
摘要:
The invention provides a compensation method for audio frame loss in a MDCT domain, the method comprising: when a frame currently lost is a Pth frame, obtaining a set of frequencies to be predicted, and for each frequency in the set, using phases and amplitudes of a plurality of frames before a (P−1)th frame in a MDCT-MDST domain to predict a phase and an amplitude of the Pth frame, and using the predicted phase and amplitude to obtain a MDCT coefficient of the Pth frame at each corresponding frequency; for a frequency outside the set, using MDCT coefficients of a plurality of frames before the Pth frame to calculate a MDCT coefficient value of the Pth frame at the frequency; performing an IMDCT for the MDCT coefficients of the Pth frame to obtain a time domain signal of the Pth frame.
摘要:
The invention provides a compensation method for audio frame loss in a MDCT domain, the method comprising: when a frame currently lost is a Pth frame, obtaining a set of frequencies to be predicted, and for each frequency in the set, using phases and amplitudes of a plurality of frames before a (P−1)th frame in a MDCT-MDST domain to predict a phase and an amplitude of the Pth frame, and using the predicted phase and amplitude to obtain a MDCT coefficient of the Pth frame at each corresponding frequency; for a frequency outside the set, using MDCT coefficients of a plurality of frames before the Pth frame to calculate a MDCT coefficient value of the Pth frame at the frequency; performing an IMDCT for the MDCT coefficients of the Pth frame to obtain a time domain signal of the Pth frame.
摘要:
A hierarchical audio coding, decoding method and system are provided. The method includes dividing frequency domain coefficients of an audio signal after MDCT into a plurality of coding sub-bands, quantizing and coding amplitude envelope values of coding sub-bands; allocating bits to each coding sub-band of the core layer, quantizing and coding core layer frequency domain coefficients to obtain coded bits of core layer frequency domain coefficients; calculating the amplitude envelope value of each coding sub-band of the core layer residual signal; allocating bits to each coding sub-band of the extended layer, quantizing and coding the extended layer coding signal to obtain coded bits of the extended layer coding signal; multiplexing and packing amplitude value envelope coded bits of each coding sub-band composed by core layer and extended layer frequency domain coefficients, core layer frequency coefficients coded bits, and extended layer coding signal coded bits, then transmitting to the decoding end.
摘要:
A hierarchical audio coding, decoding method and system are provided. The method includes dividing frequency domain coefficients of an audio signal after MDCT into a plurality of coding sub-bands, quantizing and coding amplitude envelope values of coding sub-bands; allocating bits to each coding sub-band of the core layer, quantizing and coding core layer frequency domain coefficients to obtain coded bits of core layer frequency domain coefficients; calculating the amplitude envelope value of each coding sub-band of the core layer residual signal; allocating bits to each coding sub-band of the extended layer, quantizing and coding the extended layer coding signal to obtain coded bits of the extended layer coding signal; multiplexing and packing amplitude value envelope coded bits of each coding sub-band composed by core layer and extended layer frequency domain coefficients, core layer frequency coefficients coded bits, and extended layer coding signal coded bits, then transmitting to the decoding end.