摘要:
A method for processing an audio signal (100) includes receiving an encoded audio signal and generating a decoded audio signal by processing consecutive audio frames of the audio signal to avoid discontinuities. Processing consecutive audio frames of the audio signal to avoid discontinuities includes using linear predictive filtering for removing (S102, S104, S300-S308, S400-S402) a discontinuity (106a, 106b) between a filtered past frame and a filtered current frame of the audio signal. The method includes filtering the current frame of the audio signal and removing the discontinuity by modifying a beginning portion of the filtered current frame by a signal obtained by linear predictive filtering a predefined signal with initial states of the linear predictive filter defined on the basis of a last part of the unfiltered past frame filtered using the set of filter parameters for filtering the current frame.
摘要:
An audio encoder for encoding an audio signal, comprises: a first encoding processor (600) for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor (600) comprises: a time frequency converter (602) for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; an analyzer (604) for analyzing the frequency domain representation up to the maximum frequency to determine first spectral portions to be encoded with a first spectral resolution and second spectral regions to be encoded with a second spectral resolution, the second spectral resolution being lower than the first spectral resolution; a spectral encoder (606) for encoding the first spectral portions with the first spectral resolution and for encoding the second spectral portions with the second spectral resolution; a second encoding processor (610) for encoding a second different audio signal portion in the time domain; a controller (620) configured for analyzing the audio signal and for determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion of the audio signal is the second audio signal portion encoded in the time domain; and an encoded signal former (630) for forming an encoded audio signal comprising a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second audio signal portion.
摘要:
An audio decoder (100;200;300) for providing a decoded audio information (112;212;312) on the basis of an encoded audio information (110;210;310), the audio decoder comprises a linear-prediction-domain decoder (120;220;320) configured to provide a first decoded audio information (122;222;322; S C (n)) on the basis of an audio frame encoded in a linear prediction domain, a frequency domain decoder (130;230;330) configured to provide a second decoded audio information (132;232;332; S M (n)) on the basis of an audio frame encoded in a frequency domain, and a transition processor (140;240;340).The transition processor is configured to obtain a zero-input-response (150; 256;348) of a linear predictive filtering(148; 254; 346), wherein an initial state (146;252;344) of the linear predictive filtering is defined in dependence on the first decoded audio information and the second decoded audio information. The transition processor is also configured to modify the second decoded audio information (132; 232;332;S M (n)), which is provided on the basis of an audio frame encoded in the frequency domain following an audio frame encoded in the linear prediction domain, in dependence on the zero-input-response, to obtain a smooth transition between the first decoded audio information (S C (n)) and the modified second decoded audio information ( S M ^ n ).
摘要:
An apparatus for encoding an audio signal having a stream of audio samples 100 comprises: a windower 102 for applying a prediction coding analysis window 200 to the stream of audio samples to obtain windowed data for a prediction analysis and for applying a transform coding analysis window 204 to the stream of audio samples to obtain windowed data for a transform analysis, wherein the transform coding analysis window is associated with audio samples within a current frame of audio samples and with audio samples of a predefined portion of a future frame of audio samples being a transform-coding look-ahead portion 206, wherein the prediction coding analysis window is associated with at least the portion of the audio samples of the current frame and with audio samples of a predefined portion of the future frame being a prediction coding look-ahead portion 208, wherein the transform coding look-ahead portion 206 and the prediction coding look-ahead portion 208 are identically to each other or are different from each other by less than 20% of the prediction coding look-ahead portion 208 or less than 20% of the transform coding look-ahead portion 206; and an encoding processor 104 for generating prediction coded data for the current frame using the windowed data for the prediction analysis or for generating transform coded data for the current frame using the windowed data for the transform analysis.
摘要:
An apparatus for encoding an audio signal having a stream of audio samples 100 comprises: a windower 102 for applying a prediction coding analysis window 200 to the stream of audio samples to obtain windowed data for a prediction analysis and for applying a transform coding analysis window 204 to the stream of audio samples to obtain windowed data for a transform analysis, wherein the transform coding analysis window is associated with audio samples within a current frame of audio samples and with audio samples of a predefined portion of a future frame of audio samples being a transform-coding look-ahead portion 206, wherein the prediction coding analysis window is associated with at least the portion of the audio samples of the current frame and with audio samples of a predefined portion of the future frame being a prediction coding look-ahead portion 208, wherein the transform coding look-ahead portion 206 and the prediction coding look-ahead portion 208 are identically to each other or are different from each other by less than 20% of the prediction coding look-ahead portion 208 or less than 20% of the transform coding look-ahead portion 206; and an encoding processor 104 for generating prediction coded data for the current frame using the windowed data for the prediction analysis or for generating transform coded data for the current frame using the windowed data for the transform analysis.
摘要:
An assignment of one of phase set of different loss concealment tools of an audio decoder to a portion of the audio signal to be decoded from a data stream, which portion is affected by loss, that is the selection out of the set of different loss concealment tools, may be made in a manner leading to a more pleasant loss concealment if the assignment/selection is done based on two measures: A first measure which is determined measures a spectral position of a spectral centroid of a spectrum of the audio signal and a second measure which is determined measures a temporal predictability of the audio signal. The assigned or selected loss concealment tool may then be used to recover the portion of the audio signal.
摘要:
A method for processing an audio signal is described, which uses linear predictive filtering for removing a discontinuity between a filtered past frame and a filtered current frame of the audio signal. The method comprises filtering the current frame of the audio signal and removing the discontinuity by modifying a beginning portion of the filtered current frame by a signal obtained by linear predictive filtering a predefined signal with initial states of the linear predictive filter defined on the basis of a last part of the unfiltered past frame filtered using the set of filter parameters for filtering the current frame. The linear predictive filter is defined as A z = 1 ∑ m = 0 M a m z − m with M the filter order and a m the filter coefficients, with α 0 = 1.
摘要:
An audio decoder for decoding an encoded audio signal, comprises: a prediction parameter decoder (180) for performing a decoding of data for a prediction coded frame from the encoded audio signal; a transform parameter decoder (183) for performing a decoding of data for a transform coded frame from the encoded audio signal, wherein the transform parameter decoder (183) is configured for performing a spectral-time transform and for applying a synthesis window to transformed data to obtain data for the current frame and a future frame, the synthesis window having a first overlap portion, an adjacent second overlap portion and an adjacent third overlap portion (206), the third overlap portion being associated with audio samples for the future frame and the non-overlap portion (208) being associated with data of the current frame; and an overlap-adder (184) for overlapping and adding synthesis windowed samples associated with the third overlap portion of a synthesis window for the current frame and synthesis windowed samples associated with the first overlap portion of a synthesis window for the future frame to obtain a first portion of audio samples for the future frame, wherein a rest of the audio samples for the future frame are synthesis windowed samples associated with the second non-overlapping portion of the synthesis window for the future frame obtained without overlap-adding, when the current frame and the future frame comprise transform-coded data.
摘要:
A method is described that processes an audio signal (100). A discontinuity between a filtered past frame and a filtered current frame of the audio signal is removed using linear predictive filtering (102, 110, 112).