摘要:
An audio signal decoder (200) for providing a decoded representation (212) of an audio content on the basis of an encoded representation (310) of the audio content comprises a transform domain path (230, 240, 242, 250, 260) configured to obtain a time-domain representation (212) of a portion of the audio content encoded in a transform-domain mode on the basis of a first set (220) of spectral coefficients, a representation (224) of an aliasing-cancellation stimulus signal and a plurality of linear-prediction-domain parameters (222). The transform domain path comprises a spectrum processor (230) configured to apply a spectrum shaping to the first set of spectral coefficients in dependence on at least a subset of the linear-prediction-domain parameters, to obtain a spectrally-shaped version (232) of the first set of spectral coefficients. The transform domain path comprises a first frequency-domain-to-time-domain converter (240) configured to obtain a time-domain representation of the audio content on the basis of the spectrally-shaped version of the first set of spectral coefficients. The transform domain path comprises an aliasing-cancellation stimulus filter configured to filter (250) the aliasing-cancellation stimulus signal (324) in dependence on at least a subset of the linear-prediction-domain parameters (222), to derive an aliasing-cancellation synthesis signal (252) from the aliasing-cancellation stimulus signal. The transform domain path also comprises a combiner (260) configured to combine the time-domain representation (242) of the audio content with the aliasing-cancellation synthesis signal (252), or a post-processed version thereof, to obtain an aliasing reduced time-domain signal.
摘要:
An audio encoder for encoding an audio signal, comprises: a first encoding processor (600) for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor (600) comprises: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor (700) for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor (610), so that the second encoding processing (610) is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal; a controller configured for analyzing the audio signal and for determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion of the audio signal is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal comprising a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second audio signal portion.
摘要:
An apparatus for generating a plurality of audio channels for a first speaker setup is characterized by an imaginary speaker determiner, an energy distribution calculator, a processor and a renderer. The imaginary speaker determiner is configured to determine a position of an imaginary speaker not contained in the first speaker setup to obtain a second speaker setup containing the imaginary speaker. The energy distribution calculator is configured to calculate an energy distribution from the imaginary speaker to the other speakers in the second speaker setup. The processor is configured to repeat the energy distribution to obtain a downmix information for a downmix from the second speaker setup to the first speaker setup. The renderer is configured to generate the plurality of audio channels using the downmix information.
摘要:
Embodiments of the present invention provide an encoder comprising a quantization stage, an entropy encoder, a residual quantization stage and a coded signal former. The quantization stage is configured to quantize an input signal using a dead zone in order to obtain a plurality of quantized values. The entropy encoder is configured to encode the plurality of quantized values using an entropy encoding scheme in order to obtain a plurality of entropy encoded values. The residual quantization stage is configured to quantize a residual signal caused by the quantization stage, wherein the residual quantization stage is configured to determine at least one quantized residual value in dependence on the dead zone of the quantization stage. The coded signal former is configured to form a coded signal from the plurality of entropy encoded values and the at least one quantized residual value.
摘要:
A decoder for generating an audio output signal comprising one or more audio output channels is provided. The decoder comprises a receiving interface (110) for receiving an audio input signal comprising a plurality of audio object signals, for receiving loudness information on the audio object signals, and for receiving rendering information indicating whether one or more of the audio object signals shall be amplified or attenuated. Moreover, the decoder comprises a signal processor (120) for generating the one or more audio output channels of the audio output signal. The signal processor (120) is configured to determine a loudness compensation value depending on the loudness information and depending on the rendering information. Furthermore, the signal processor (120) is configured to generate the one or more audio output channels of the audio output signal from the audio input signal depending on the rendering information and depending on the loudness compensation value. Moreover, an encoder is provided.
摘要:
An audio encoder for encoding an audio signal, comprises: a first encoding processor (600) for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor (600) comprises: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor (700) for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor (610), so that the second encoding processing (610) is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal; a controller configured for analyzing the audio signal and for determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion of the audio signal is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal comprising a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second audio signal portion.
摘要:
An apparatus for generating four or more audio output signals has a panning gain determiner and a signal processor. The panning gain determiner is configured to determine a proper subset from a set of five or more loudspeaker positions, so that the proper subset has four or more of the five or more loudspeaker positions. Moreover, the panning gain determiner is configured to determine the proper subset depending on a panning position and on the five or more loudspeaker positions, and to determine a panning gain for each of the four or more audio output signals by determining the panning gain depending on the panning position and on the four or more loudspeaker positions of the proper subset. The signal processor is configured to generate each of the four or more audio output signals depending on the panning gain for the audio output signal and on an audio input signal.