摘要:
An apparatus for encoding an audio signal having a plurality of channels is provided. The apparatus comprises a downmixer (1010) for downmixing the plurality of channels to obtain a downmix signal. Moreover, the apparatus comprises a residual signal calculator (1020) adapted for calculating a residual signal. Furthermore, the apparatus comprises a phase information calculator (1030) adapted for calculating information on a phase difference between the downmix and the residual signal to obtain phase information. Moreover, the apparatus comprises an output generator (1040) for outputting the phase information.
摘要:
An audio encoder for providing an encoded audio information on the basis of an input audio information comprises a low frequency encoder configured to encode a low frequency portion of the input audio information to obtain an encoded representation of the low frequency portion, and a bandwidth extension information provider configured to provide bandwidth extension information on the basis of the input audio information. The audio encoder is configured to selectively include bandwidth extension information into the encoded audio information in a signal-adaptive manner. An audio decoder comprises a low frequency decoder configured to decode an encoded representation of a low frequency portion to obtain a decoded representation of the low frequency portion, and a bandwidth extension configured to obtain a bandwidth extension signal using a blind bandwidth extension for portions of an audio content for which no bandwidth extension parameters are included in the encoded audio information, and to obtain the bandwidth extension signal using a parameter-guided bandwidth extension for portions of the audio content for which bandwidth extension parameters are included in the encoded audio information.
摘要:
Zum Ermitteln eines Schätzwerts für einen Bedarf an Informationseinheiten zum Codieren eines Signals wird neben der erlaubten Störung für ein Frequenzband und einer Energie des Frequenzbands zusätzlich ein Maß (nl(b)) für die Verteilung der Energie in dem Frequenzband berücksichtigt (102, 104, 106). Damit wird ein besserer Schätzwert für den Bedarf an Informationseinheiten erhalten, so dass effizienter und genauer codiert werden kann.
摘要:
An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation is configured to provide a first downmix signal and a second downmix signal on the basis of a jointly encoded representation of the first downmix signal and the second downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a first audio channel signal and a second audio channel signal on the basis of the first downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a third audio channel signal and a fourth audio channel signal on the basis of the second downmix signal using a multi-channel decoding. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the first audio channel signal and the third audio channel signal, to obtain a first bandwidth-extended channel signal and a third bandwidth-extended channel signal. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the second audio channel signal and the fourth audio channel signal, to obtain a second bandwidth extended channel signal and a fourth bandwidth extended channel signal. An audio encoder uses a related concept.
摘要:
An audio or video encoder and an audio or video decoder are based on a combination of two audio or video channels (201, 202) to obtain a first combination signal (204) as a mid signal and a residual signal (205) which can be derived using a predicted side signal derived from the mid signal. The first combination signal and the prediction residual signal are encoded (209) and written (212) into a data stream (213) together with the prediction information (206) derived by an optimizer (207) based on an optimization target (208) and a prediction direction indicator indicating a prediction direction associated with the residual signal. A decoder uses the prediction residual signal, the first combination signal, the prediction direction indicator and the prediction information to derive a decoded first channel signal and a decoded second channel signal. In an encoder example or in a decoder example, a real-to-imaginary transform can be applied for estimating the imaginary part of the spectrum of the first combination signal. For calculating the prediction signal used in the derivation of the prediction residual signal, the real-valued first combination signal is multiplied by a real portion of the complex prediction information and the estimated imaginary part of the first combination signal is multiplied by an imaginary portion of the complex prediction information.
摘要:
An audio or video encoder and an audio or video decoder are based on a combination of two audio or video channels (201, 202) to obtain a first combination signal (204) as a mid signal and a residual signal (205) which can be derived using a predicted side signal derived from the mid signal. The first combination signal and the prediction residual signal are encoded (209) and written (212) into a data stream (213) together with the prediction information (206) derived by an optimizer (207) based on an optimization target (208) and a prediction direction indicator indicating a prediction direction associated with the residual signal. A decoder uses the prediction residual signal, the first combination signal, the prediction direction indicator and the prediction information to derive a decoded first channel signal and a decoded second channel signal. In an encoder example or in a decoder example, a real-to-imaginary transform can be applied for estimating the imaginary part of the spectrum of the first combination signal. For calculating the prediction signal used in the derivation of the prediction residual signal, the real-valued first combination signal is multiplied by a real portion of the complex prediction information and the estimated imaginary part of the first combination signal is multiplied by an imaginary portion of the complex prediction information.
摘要:
An audio or video encoder and an audio or video decoder are based on a combination of two audio or video channels (201, 202) to obtain a first combination signal (204) as a mid signal and a residual signal (205) which can be derived using a predicted side signal derived from the mid signal. The first combination signal and the prediction residual signal are encoded (209) and written (212) into a data stream (213) together with the prediction information (206) derived by an optimizer (207) based on an optimization target (208) and a prediction direction indicator indicating a prediction direction associated with the residual signal. A decoder uses the prediction residual signal, the first combination signal, the prediction direction indicator and the prediction information to derive a decoded first channel signal and a decoded second channel signal. In an encoder example or in a decoder example, a real-to-imaginary transform can be applied for estimating the imaginary part of the spectrum of the first combination signal. For calculating the prediction signal used in the derivation of the prediction residual signal, the real-valued first combination signal is multiplied by a real portion of the complex prediction information and the estimated imaginary part of the first combination signal is multiplied by an imaginary portion of the complex prediction information.
摘要:
Embodiments provide an apparatus for encoding a multi-channel signal having at least three channels. The apparatus comprises an iteration processor, a channel encoder and an output interface. The iteration processor is configured to calculate, in a first iteration step, inter-channel correlation values between each pair of the at least three channels, for selecting, in the first iteration step, a pair having a highest value or having a value above a threshold, and for processing the selected pair using a multi-channel processing operation to derive first multi-channel parameters for the selected pair and to derive first processed channels. Further, the iteration processor is configured to perform the calculating, the selecting and the processing in a second iteration step using at least one of the processed channels to derive second multi-channel parameters and second processed channels. The channel encoder is configured to encode channels resulting from an iteration processing performed by the iteration processor to obtain encoded channels. The output interface is configured to generate an encoded multi-channel signal having the encoded channels and the first and the second multi-channel parameters.
摘要:
An apparatus for encoding an audio signal having a plurality of channels is provided. The apparatus comprises a downmixer (1010) for down mixing the plurality of channels to obtain a downmix signal. Moreover, the apparatus comprises a residual signal calculator (1020) adapted for calculating a residual signal. Furthermore, the apparatus comprises a phase information calculator (1030) adapted for calculating information on a phase difference between the downmix and the residual signal to obtain phase information. Moreover, the apparatus comprises an output generator (1040) for outputting the phase information.