摘要:
An apparatus for encoding an audio signal having a stream of audio samples 100 comprises: a windower 102 for applying a prediction coding analysis window 200 to the stream of audio samples to obtain windowed data for a prediction analysis and for applying a transform coding analysis window 204 to the stream of audio samples to obtain windowed data for a transform analysis, wherein the transform coding analysis window is associated with audio samples within a current frame of audio samples and with audio samples of a predefined portion of a future frame of audio samples being a transform-coding look-ahead portion 206, wherein the prediction coding analysis window is associated with at least the portion of the audio samples of the current frame and with audio samples of a predefined portion of the future frame being a prediction coding look-ahead portion 208, wherein the transform coding look-ahead portion 206 and the prediction coding look-ahead portion 208 are identically to each other or are different from each other by less than 20% of the prediction coding look-ahead portion 208 or less than 20% of the transform coding look-ahead portion 206; and an encoding processor 104 for generating prediction coded data for the current frame using the windowed data for the prediction analysis or for generating transform coded data for the current frame using the windowed data for the transform analysis.
摘要:
An apparatus for generating a decoded two-channel signal, comprises: an audio processor (802) for decoding an encoded two-channel signal to obtain a first set of first spectral portions; a parametric decoder (804) for providing parametric data for a second set of second spectral portions and a two-channel identification identifying either a first or a second different two-channel representation for the second spectral portions; and a frequency regenerator (806) for regenerating a second spectral portion depending on a first spectral portion of the first set of first spectral portions, the parametric data for the second portion and the two-channel identification for the second portion.
摘要:
An audio decoder for decoding an encoded audio signal, comprises: a prediction parameter decoder (180) for performing a decoding of data for a prediction coded frame from the encoded audio signal; a transform parameter decoder (183) for performing a decoding of data for a transform coded frame from the encoded audio signal, wherein the transform parameter decoder (183) is configured for performing a spectral-time transform and for applying a synthesis window to transformed data to obtain data for the current frame and a future frame, the synthesis window having a first overlap portion, an adjacent second overlap portion and an adjacent third overlap portion (206), the third overlap portion being associated with audio samples for the future frame and the non-overlap portion (208) being associated with data of the current frame; and an overlap-adder (184) for overlapping and adding synthesis windowed samples associated with the third overlap portion of a synthesis window for the current frame and synthesis windowed samples associated with the first overlap portion of a synthesis window for the future frame to obtain a first portion of audio samples for the future frame, wherein a rest of the audio samples for the future frame are synthesis windowed samples associated with the second non-overlapping portion of the synthesis window for the future frame obtained without overlap-adding, when the current frame and the future frame comprise transform-coded data.
摘要:
An apparatus for generating an enhanced signal from an input signal (600), wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal (600), comprises a mapper (602) for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region comprising a noise-filling region (302); and a noise filler (604) configured for generating first noise values for the noise-filling region (302) in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.
摘要:
An embodiment of an apparatus (100) for generating audio subband values in audio subband channels comprises an analysis windower (110) for windowing a frame (120) of time-domain audio input samples being in a time sequence extending from an early sample to a later sample using an analysis window function (190) comprising a sequence of window coefficients to obtain windowed samples. The analysis window function (190) comprises a first group (200) of window coefficients and a second group (210) of window coefficients. The first group (200) of window coefficients is used for windowing later time-domain samples and the second group (210) of window coefficients is used for windowing an earlier time-domain samples. The apparatus (100) further comprises a calculator (170) for calculating the audio subband values using the windowed samples.
摘要:
An apparatus for generating a plurality of spectral patterns is provided. The apparatus comprises a signal generator (165) for generating a plurality of signals in a first domain, a signal transformation unit (175) for transforming each signal of the plurality of signals from the first domain to a second domain to obtain a plurality of spectral patterns, each pattern of the plurality of transformed spectral patterns comprising a plurality of coefficients, a postprocessing unit (185) for truncating the transformed spectral patterns by removing one or more of the coefficients of the transformed spectral patterns to obtain a plurality of processed patterns, and a storage unit (195) comprising a database or a memory, wherein the storage unit (195) is configured to store each processed pattern of the plurality of processed patterns in the database or the memory.
摘要:
A signal processor for providing a processed version of an input signal in dependence on the input signal comprises a windower configured to window a portion of the input signal, or of a pre-processed version thereof, in dependence on a signal processing window described by signal processing window values for a plurality of window value index values, in order to obtain the processed version of the input signal. The signal processor also comprises a window provider for providing the signal processing window values for a plurality of window value index values in dependence on one or more window shape parameters.
摘要:
Audio encoder 2" for encoding a multichannel signal 4 is shown. The audio encoder comprises a downmixer 12 for downmixing the multichannel signal 4 to obtain a downmix signal 14, a linear prediction domain core encoder 16 for encoding the downmix signal 14, wherein the downmix signal 14 has a low band and a high band, wherein the linear prediction domain core encoder 16 is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank 82 for generating a spectral representation of the multichannel signal 4, and a joint multichannel encoder 18 configured to process the spectral representation comprising the low band and the high band of the multichannel signal to generate multichannel information 20.
摘要:
An apparatus for decoding an encoded audio signal comprising an encoded representation of a first set of first spectral portions and an encoded representation of parametric data indicating spectral energies for a second set of second spectral portions, comprises: an audio decoder (900) for decoding the encoded representation (901 b) of the first set of the first spectral portions to obtain a first set of first spectral portions (904) and for decoding the encoded representation of the parametric data to obtain a decoded parametric data (902) for the second set of second spectral portions indicating, for individual reconstruction bands, individual energies; a frequency regenerator (906) for reconstructing spectral values in a reconstruction band (920) comprising a second spectral portion (922, 923) using a first spectral portion of the first set of the first spectral portions and an individual energy for the reconstruction band, the reconstruction band comprising a first spectral portion (921) and the second spectral portion; wherein the frequency regenerator (906) is configured for determining (912) a survive energy information comprising an accumulated energy of the first spectral portion having frequency values in the reconstruction band, determining (918) a tile energy information of further spectral portions (922, 923) of the reconstruction band (920) for frequency values different from the first spectral portion (921) having frequencies in the reconstruction band (920), wherein the further spectral portions (922, 923) are to be generated by frequency regeneration using a first spectral portion (302) different from the first spectral portion (921, 306) in the reconstruction band; determining (914) a missing energy in the reconstruction band (920) using the individual energy for the reconstruction band and the survive energy information; and adjusting (916) the further spectral portions in the reconstruction band based on the missing energy information and the tile energy information.