摘要:
Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to amplify segments of relatively low intensity and attenuate segments of relatively high intensity. The compressed audio signal is then expanded back to substantially the original dynamic range that applies inverse gain values to amplify segments of relatively high intensity and attenuate segments of relatively low intensity. A QMF filterbank is used to analyze the initial audio signal to obtain a frequency domain representation.
摘要:
Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to amplify segments of relatively low intensity and attenuate segments of relatively high intensity. The compressed audio signal is then expanded back to substantially the original dynamic range that applies inverse gain values to amplify segments of relatively high intensity and attenuate segments of relatively low intensity. A QMF filterbank is used to analyze the initial audio signal to obtain a frequency domain representation.
摘要:
Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to amplify segments of relatively low intensity and attenuate segments of relatively high intensity. The compressed audio signal is then expanded back to the substantially the original dynamic range that applies inverse gain values to amplify segments of relatively high intensity and attenuating segments of relatively low intensity. A QMF filterbank is used to analyze the initial audio signal to obtain a frequency domain representation.
摘要:
The present invention relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR). A system and a method for generating a high frequency component of a signal from a low frequency component of the signal is described. The system comprises an analysis filter bank providing a plurality of analysis subband signals of the low frequency component of the signal. It also comprises a non-linear processing unit to generate a synthesis subband signal with a synthesis frequency by modifying the phase of a first and a second of the plurality of analysis subband signals and by combining the phase-modified analysis subband signals. Finally, it comprises a synthesis filter bank for generating the high frequency component of the signal from the synthesis subband signal.
摘要:
The present invention relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR). A system and a method for generating a high frequency component of a signal from a low frequency component of the signal is described. The system comprises an analysis filter bank providing a plurality of analysis subband signals of the low frequency component of the signal. It also comprises a non-linear processing unit to generate a synthesis subband signal with a synthesis frequency by modifying the phase of a first and a second of the plurality of analysis subband signals and by combining the phase-modified analysis subband signals. Finally, it comprises a synthesis filter bank for generating the high frequency component of the signal from the synthesis subband signal.
摘要:
The present document relates an audio encoding and decoding system (referred to as an audio codec system). In particular, the present document relates to a transform-based audio codec system which is particularly well suited for voice encoding/decoding. A transform-based speech encoder (100, 170) configured to encode a speech signal into a bitstream is described. The encoder (100, 170) comprises a framing unit (101) configured to receive a set (132, 332) of blocks; wherein the set (132, 332) of blocks comprises a plurality of sequential blocks (131) of transform coefficients; wherein the plurality of blocks (131) is indicative of samples of the speech signal; wherein a block (131) of transform coefficients comprises a plurality of transform coefficients for a corresponding plurality of frequency bins (301). Furthermore, the encoder (100, 170) comprises an envelope estimation unit (102) configured to determine a current envelope (133) based on the plurality of sequential blocks (131) of transform coefficients; wherein the current envelope (133) is indicative of a plurality of spectral energy values (303) for the corresponding plurality of frequency bins (301). In addition, the encoder (100, 170) comprises an envelope interpolation unit (104) configured to determine a plurality of interpolated envelopes (136) for the plurality of blocks (131) of transform coefficients, respectively, based on the current envelope (133); Furthermore, the encoder (100, 170) comprises a flattening unit (108) configured to determine a plurality of blocks (140) of flattened transform coefficients by flattening the corresponding plurality of blocks (131) of transform coefficients using the corresponding plurality of interpolated envelopes (136), respectively; wherein the bitstream is determined based on the plurality of blocks (140) of flattened transform coefficients.
摘要:
The present document relates an audio encoding and decoding system (referred to as an audio codec system). In particular, the present document relates to a transform-based audio codec system which is particularly well suited for voice encoding/decoding. A quantization unit (112) configured to quantize a first coefficient of a block (141) of coefficients is described. The block (141) of coefficients comprises a plurality of coefficients for a plurality of corresponding frequency bins (301). The quantization unit (112) is configured to provide a set (326, 327) of quantizers. The set (326, 327) of quantizers comprises a plurality of different quantizers (321, 322, 323) associated with a plurality of different signal-to-noise ratios, referred to as SNR, respectively. The plurality of different quantizers (321, 322, 323) includes a noise-filling quantizer (321); one or more dithered quantizers (322); and one or more undithered quantizers (323). The quantization unit (112) is further configured to determine an SNR indication indicative of a SNR attributed to the first coefficient, and to select a first quantizer from the set (326, 327) of quantizers, based on the SNR indication. In addition, the quantization unit (112) is configured to quantize the first coefficient using the first quantizer.