摘要:
A decoder for decoding an encoded audio signal to obtain a phase-adjusted audio signal is provided. The decoder comprises a decoding unit (110) and a phase adjustment unit (120). The decoding unit (110) is adapted to decode the encoded audio signal to obtain a decoded audio signal. The phase adjustment unit (120) is adapted to adjust the decoded audio signal to obtain the phase-adjusted audio signal. The phase adjustment unit (120) is configured to receive control information depending on a vertical phase coherence of the encoded audio signal. Moreover, the phase adjustment unit (120) is adapted to adjust the decoded audio signal based on the control information.
摘要:
An audio decoder device for decoding a bitstream includes a bitstream receiver configured to receive the bitstream and to derive an encoded audio signal from the bitstream; a core decoder module configured for deriving a decoded audio signal in a time domain from the encoded audio signal; a temporal envelope generator configured to determine a temporal envelope of the decoded audio signal; a bandwidth extension module configured to produce a frequency domain bandwidth extension signal; a time-to-frequency converter configured to transform the decoded audio signal into a frequency domain decoded audio signal; a combiner configured to combine the frequency domain decoded audio signal and the frequency domain bandwidth extension signal in order to produce a bandwidth extended frequency domain audio signal; and a frequency-to-time converter configured to transform the bandwidth extended frequency domain audio signal into a bandwidth-extended time domain audio signal.
摘要:
An apparatus for synthesizing a parameterized representation of an audio signal comprising a time portion of an audio signal, band pass filter information for a plurality of band pass filters, the band pass filter information indicating time-varying band pass filter center frequencies of band pass filters having varying band widths, which depend on a band pass filter center frequency of the corresponding band pass filter, and having amplitude modulation or phase modulation or frequency modulation information for each band pass filter for the time portion of the audio signal, comprises: an amplitude modulation synthesizer (201) for synthesizing an amplitude modulation component based on the amplitude modulation information; a frequency modulation or phase modulation synthesizer for synthesizing instantaneous frequency of phase information based on the information on a carrier frequency and a frequency modulation information for a respective band width, wherein distances in frequency between adjacent carrier frequencies are different over a frequency spectrum, an oscillator (203) for generating an output signal representing an instantaneously amplitude modulated, frequency modulated or phase modulated oscillation signal (204) for each band pass filter channel; and a combiner (205) for combining signals from the band pass filter channels and for generating an audio output signal (206) based on the signals from the band pass filter channels.
摘要:
An apparatus for generating a bandwidth extended audio signal from an input signal, includes a patch generator for generating one or more patch signals from the input signal, wherein the patch generator is configured for performing a time stretching of subband signals from an analysis filterbank, and wherein the patch generator further includes a phase adjuster for adjusting phases of the subband signals using a filterbank-channel dependent phase correction.
摘要:
An audio decoder device for decoding a bitstream includes a bitstream receiver configured to receive the bitstream and to derive an encoded audio signal from the bitstream; a core decoder module configured for deriving a decoded audio signal in a time domain from the encoded audio signal; a temporal envelope generator configured to determine a temporal envelope of the decoded audio signal; a bandwidth extension module configured to produce a frequency domain bandwidth extension signal; a time-to-frequency converter configured to transform the decoded audio signal into a frequency domain decoded audio signal; a combiner configured to combine the frequency domain decoded audio signal and the frequency domain bandwidth extension signal in order to produce a bandwidth extended frequency domain audio signal; and a frequency-to-time converter configured to transform the bandwidth extended frequency domain audio signal into a bandwidth-extended time domain audio signal.
摘要:
An apparatus for processing an audio signal comprising a sequence of blocks (114) of spectral values comprises: a processor (100) for processing the sequence of blocks using at least one modification values (102) for a first block to obtain aliasing-reduced or aliasing-free first result signal in an overlap range (170) and using at least one second different modification value (106) for a second block of the sequence of blocks to obtain an aliasing-reduced or aliasing-free second result signal (108) in the overlap range (170); and a combiner (110) for combining the first result signal (104) and the second result signal (108) in the overlap range (170) to obtain a processed signal (112) for the overlap range (170).
摘要:
A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to render a plurality of decoded audio signals, which are obtained on the basis of the encoded representation, in dependence on one or more rendering parameters, to obtain a plurality of rendered audio signals. The multichannel audio decoder is configured to derive one or more decorrelated audio signals from the rendered audio signals, and to combine the rendered audio signals, or a scaled version thereof, with the one or more decorrelated audio signals, to obtain the output audio signals. A multi-channel audio encoder provides a decorrelation method parameter to control an audio decoder.
摘要:
An apparatus for decoding to obtain a reconstructed audio signal envelope includes a signal envelope reconstructor for generating the reconstructed audio signal envelope depending on one or more splitting points and an output interface for outputting the reconstructed audio signal envelope. The signal envelope reconstructor is configured to generate the reconstructed audio signal envelope such that the one or more splitting points divide the reconstructed audio signal envelope into two or more audio signal envelope portions, and to generate the reconstructed audio signal envelope such that, for each of the two or more signal envelope portions, an absolute value of its signal envelope portion value is greater than half of an absolute value of the signal envelope portion value of each of the other signal envelope portions.
摘要:
An audio decoder is proposed for decoding a multi-object audio signal consisting of a downmix signal X and side information PSI. The side information comprises object-specific side information PSI i for an audio object s i in a time/frequency region R(t R ,f R ), and object-specific time/frequency resolution information TFRI i indicative of an object-specific time/frequency resolution TFR h of the object-specific side information for the audio object s i in the time/frequency region R(t R ,f R ). The audio decoder comprises an object-specific time/frequency resolution determiner 110 configured to determine the object-specific time/frequency resolution information TFRI i from the side information PSI for the audio object s i . The audio decoder further comprises an object separator 120 configured to separate the audio object s i from the downmix signal X using the object-specific side information in accordance with the object-specific time/frequency resolution TFRI i . A corresponding encoder and corresponding methods for decoding or encoding are also described.
摘要:
A decoder for generating a frequency enhanced audio signal (120), comprises: a feature extractor (104) for extracting a feature from a core signal (100); a side information extractor (110) for extracting a selection side information associated with the core signal; a parameter generator (108) for generating a parametric representation for estimating a spectral range of the frequency enhanced audio signal (120) not defined by the core signal (100), wherein the parameter generator (108) is configured to provide a number of parametric representation alternatives (702, 704, 706, 708) in response to the feature (112), and wherein the parameter generator (108) is configured to select one of the parametric representation alternatives as the parametric representation in response to the selection side information (712 to 718); and a signal estimator (118) for estimating the frequency enhanced audio signal (120) using the parametric representation selected.