摘要:
An apparatus for encoding an audio signal having a stream of audio samples 100 comprises: a windower 102 for applying a prediction coding analysis window 200 to the stream of audio samples to obtain windowed data for a prediction analysis and for applying a transform coding analysis window 204 to the stream of audio samples to obtain windowed data for a transform analysis, wherein the transform coding analysis window is associated with audio samples within a current frame of audio samples and with audio samples of a predefined portion of a future frame of audio samples being a transform-coding look-ahead portion 206, wherein the prediction coding analysis window is associated with at least the portion of the audio samples of the current frame and with audio samples of a predefined portion of the future frame being a prediction coding look-ahead portion 208, wherein the transform coding look-ahead portion 206 and the prediction coding look-ahead portion 208 are identically to each other or are different from each other by less than 20% of the prediction coding look-ahead portion 208 or less than 20% of the transform coding look-ahead portion 206; and an encoding processor 104 for generating prediction coded data for the current frame using the windowed data for the prediction analysis or for generating transform coded data for the current frame using the windowed data for the transform analysis.
摘要:
An apparatus for encoding an audio signal having a stream of audio samples 100 comprises: a windower 102 for applying a prediction coding analysis window 200 to the stream of audio samples to obtain windowed data for a prediction analysis and for applying a transform coding analysis window 204 to the stream of audio samples to obtain windowed data for a transform analysis, wherein the transform coding analysis window is associated with audio samples within a current frame of audio samples and with audio samples of a predefined portion of a future frame of audio samples being a transform-coding look-ahead portion 206, wherein the prediction coding analysis window is associated with at least the portion of the audio samples of the current frame and with audio samples of a predefined portion of the future frame being a prediction coding look-ahead portion 208, wherein the transform coding look-ahead portion 206 and the prediction coding look-ahead portion 208 are identically to each other or are different from each other by less than 20% of the prediction coding look-ahead portion 208 or less than 20% of the transform coding look-ahead portion 206; and an encoding processor 104 for generating prediction coded data for the current frame using the windowed data for the prediction analysis or for generating transform coded data for the current frame using the windowed data for the transform analysis.
摘要:
An audio encoder (100) for encoding segments of coefficients, the segments of coefficients representing different time or frequency resolutions of a sampled audio signal, the audio encoder (100) comprising a processor (110) for deriving a coding context for a currently encoded coefficient of a current segment based on a previously encoded coefficient of a previous segment, the previously encoded coefficient representing a different time or frequency resolution than the currently encoded coefficient. The audio encoder (100) further comprises an entropy encoder (120) for entropy encoding the current coefficient based on the coding context to obtain an encoded audio stream.
摘要:
An audio decoder for decoding an encoded audio signal, comprises: a prediction parameter decoder (180) for performing a decoding of data for a prediction coded frame from the encoded audio signal; a transform parameter decoder (183) for performing a decoding of data for a transform coded frame from the encoded audio signal, wherein the transform parameter decoder (183) is configured for performing a spectral-time transform and for applying a synthesis window to transformed data to obtain data for the current frame and a future frame, the synthesis window having a first overlap portion, an adjacent second overlap portion and an adjacent third overlap portion (206), the third overlap portion being associated with audio samples for the future frame and the non-overlap portion (208) being associated with data of the current frame; and an overlap-adder (184) for overlapping and adding synthesis windowed samples associated with the third overlap portion of a synthesis window for the current frame and synthesis windowed samples associated with the first overlap portion of a synthesis window for the future frame to obtain a first portion of audio samples for the future frame, wherein a rest of the audio samples for the future frame are synthesis windowed samples associated with the second non-overlapping portion of the synthesis window for the future frame obtained without overlap-adding, when the current frame and the future frame comprise transform-coded data.
摘要:
Audio encoder 2" for encoding a multichannel signal 4 is shown. The audio encoder comprises a downmixer 12 for downmixing the multichannel signal 4 to obtain a downmix signal 14, a linear prediction domain core encoder 16 for encoding the downmix signal 14, wherein the downmix signal 14 has a low band and a high band, wherein the linear prediction domain core encoder 16 is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank 82 for generating a spectral representation of the multichannel signal 4, and a joint multichannel encoder 18 configured to process the spectral representation comprising the low band and the high band of the multichannel signal to generate multichannel information 20.
摘要:
A processor for processing an audio signal (200), comprises: an analyzer (202) for deriving a window control signal (204) from the audio signal (200) indicating a change from a first asymmetric window (1400) to a second window (1402), or indicating a change from a third window (1450) to a fourth asymmetric window (1452), wherein the second window (1402) is shorter than the first window (1400), or wherein the third window (1450) is shorter than the fourth window (1452); a window constructor (206) for constructing the second window (1402) using a first overlap portion (800) of the first asymmetric window (1400), wherein the window constructor (206) is configured to determine a first overlap portion (1000) of the second window (1402) using a truncated first overlap portion of the first asymmetric window, or wherein the window constructor is configured to calculate a second overlap portion (1330) of the third window (1450) using a truncated second overlap portion (814) of the fourth asymmetric window (1452); and a windower (208) for applying the first and second windows or the third and fourth windows to obtain windowed audio signal portions (210).
摘要:
An audio encoder for providing an output signal using an input audio signal comprises a patch generator, a comparator and an output interface. The patch generator generates at least one bandwidth extension high-frequency signal, wherein a bandwidth extension high-frequency signal comprises a high-frequency band. The high-frequency band of the bandwidth extension high-frequency signal is based on a low frequency band of the input audio signal. A comparator calculates a plurality of comparison parameters. A comparison parameter is calculated based on a comparison of the input audio signal and a generated bandwidth extension high-frequency signal. Each comparison parameter of the plurality of comparison parameters is calculated based on a different offset frequency between the input audio signal and a generated bandwidth extension high-frequency signal. Further, the comparator determines a comparison parameter from the plurality of comparison parameters, wherein the determined comparison parameter fulfills a predefined criterion.
摘要:
An apparatus for encoding comprises a first domain converter (510), a switchable bypass (50), a second domain converter (410), a first processor (420) and a second processor (520) to obtain an encoded audio signal having different signal portions represented by coded data in different domains, which have been coded by different coding algorithms. Corresponding decoding stages in the decoder together with a bypass for bypassing a domain converter allow the generation of a decoded audio signal with high quality and low bit rate.