摘要:
Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).
摘要:
A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to render a plurality of decoded audio signals, which are obtained on the basis of the encoded representation, in dependence on one or more rendering parameters, to obtain a plurality of rendered audio signals. The multichannel audio decoder is configured to derive one or more decorrelated audio signals from the rendered audio signals, and to combine the rendered audio signals, or a scaled version thereof, with the one or more decorrelated audio signals, to obtain the output audio signals. A multi-channel audio encoder provides a decorrelation method parameter to control an audio decoder.
摘要:
An apparatus for encoding one or more audio objects to obtain an encoded signal is provided. The apparatus comprises a downmixer (110) for downmixing the one or more audio objects to obtain one or more unprocessed downmix signals. Moreover, the apparatus comprises a processing module (120) for processing the one or more unprocessed downmix signals to obtain one or more processed downmix signals. Furthermore, the apparatus comprises a signal calculator (130) for calculating one or more additional signals, wherein the signal calculator (130) is configured to calculate each of the one or more additional signals based on a difference between one of the one or more processed downmix signals and one of the one or more unprocessed downmix signals. Moreover, the apparatus comprises an object information generator (140) for generating parametric audio object information for the one or more audio objects and additional parametric information for the additional signal. Furthermore, the apparatus comprises an output interface (150) for outputting the encoded signal, the encoded signal comprising the parametric audio object information for the one or more audio objects and the additional parametric information for the one or more additional signals. Moreover, a corresponding apparatus for decoding is provided.
摘要:
A decoder is provided. The decoder comprises a parametric decoding unit (110) for generating a plurality of first estimated audio object signals by upmixing three or more downmix signals, wherein the three or more downmix signals encode a plurality of original audio object signals, wherein the parametric decoding unit (110) is configured to upmix the three or more downmix signals depending on parametric side information indicating information on the plurality of original audio object signals. Moreover, the decoder comprises a residual processing unit (120) for generating a plurality of second estimated audio object signals by modifying one or more of the first estimated audio object signals, wherein the residual processing unit (120) is configured to modify said one or more of the first estimated audio object signals depending on one or more residual signals.
摘要:
A decoder for generating an audio output signal comprising one or more audio output Channels from a downmix signal comprising three or more downmix Channels, wherein the downmix signal encodes three or more audio object Signals is provided. The decoder comprises an input Channel router (110) for receiving the three or more downmix Channels and for receiving side information, and at least two Channel processing units (121, 122) for generating at least two processed Channels to obtain the one or more audio output Channels. The input Channel router (110) is configured to feed each of at least two of the three or more downmix Channels into at least one of the at least two Channel processing units (121, 122), so that each of the at least two Channel processing units receives one or more of the three or more downmix Channels, and so that each of the at least two Channel processing units (121, 122) receives less than the total number of the three or more downmix Channels. Each Channel processing unit of the at least two Channel processing units (121, 122) is configured to generate one or more of the at least two processed Channels depending on the side information and depending on said one or more of the at least two of the three or more downmix Channels received by said Channel processing unit from the input Channel router.
摘要:
An apparatus for improving a perceived quality of sound reproduction of an audio output signal is provided. The apparatus comprises an active noise cancellation unit (110) for generating a noise cancellation signal based on an environmental audio signal, wherein the environmental audio signal comprises noise signal portions, the noise signal portions resulting from recording environmental noise. Moreover, the apparatus comprises a residual noise characteristics estimator (120) for determining a residual noise characteristic depending on the environmental noise and the noise cancellation signal. Furthermore, the apparatus comprises a perceptual noise compensation unit (130) for generating a noise-compensated signal based on an audio target signal and based on the residual noise characteristic. Moreover, the apparatus comprises a combiner (140) for combining the noise cancellation signal and the noise- compensated signal to obtain the audio output signal.
摘要:
An apparatus for generating an audio output signal to simulate a recording of a virtual microphone at a configurable virtual position in an environment includes a sound events position estimator and an information computation module. The former is adapted to estimate a sound source position indicating a position of a sound source in the environment, wherein the sound events position estimator is adapted to estimate the sound source position based on first and second direction information provided by first and second real spatial microphones, respectively, located at first and second real microphone positions in the environment, respectively. The information computation module is adapted to generate the audio output signal based on a first recorded audio input signal, on the first real microphone position, on the virtual position of the virtual microphone, and on the sound source position.