摘要:
An apparatus for generating a plurality of audio channels for a first speaker setup is characterized by an imaginary speaker determiner, an energy distribution calculator, a processor and a renderer. The imaginary speaker determiner is configured to determine a position of an imaginary speaker not contained in the first speaker setup to obtain a second speaker setup containing the imaginary speaker. The energy distribution calculator is configured to calculate an energy distribution from the imaginary speaker to the other speakers in the second speaker setup. The processor is configured to repeat the energy distribution to obtain a downmix information for a downmix from the second speaker setup to the first speaker setup. The renderer is configured to generate the plurality of audio channels using the downmix information.
摘要:
An audio decoder device for decoding a compressed input audio signal comprising at least one core decoder (6, 24) having one or more processors (36, 36') for generating a processor output signal (37) based on a processor input signal (38, 38'), wherein a number of output channels (37.1, 37.2, 37.1', 37.2') of the processor output signal (37, 37') is higher than a number of input channels (38.1, 38.1') of the processor input signal (38, 38'), wherein each of the one or more processors (36, 36') comprises a decorrelator (39, 39') and a mixer (40, 40'), wherein a core decoder output signal (13) having a plurality of channels (13.1, 13.2, 13.3, 13,4) comprises the processor output signal (37, 37'), and wherein the core decoder output signal (13) is suitable for a reference loudspeaker setup (42); at least one format converter device (9, 10) configured to convert the core decoder output signal (13) into an output audio signal (31), which is suitable for a target loudspeaker setup (45); and a control device (46) configured to control at least one or more processors (36, 36') in such way that the decorrelator (39, 39') of the processor (36, 36') may be controlled independently from the mixer (40, 40') of the processor (36, 36'), wherein the control device (46) is configured to control at least one of the decorrelators (39, 39') of the one or more processors (36, 36') depending on the target loudspeaker setup (45).
摘要:
In multichannel audio coding, an improved coding efficiency is achieved by the following measure: the noise filling of zero-quantized scale factor bands is performed using noise filling sources other than artificially generated noise or spectral replica. In particular, the coding efficiency in multichannel audio coding may be rendered more efficient by performing the noise filling based on noise generated using spectral lines from a previous frame of, or a different channel of the current frame of, the multichannel audio signal.
摘要:
A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to perform a weighted combination of a downmix signal, a decorrelated signal and a residual signal, to obtain one of the output audio signals. The multi-channel audio decoder is configured to determine a weight describing a contribution of the decorrelated signal in the weighted combination in dependence on the residual signal. A multi-channel audio encoder for providing an encoded representation of a multi-channel audio signal is configured to obtain a downmix signal on the basis of the multi-channel audio signal, to provide parameters describing dependencies between the channels of the multi-channel audio signal, and to provide a residual signal. The multi-channel audio encoder is configured to vary an amount of residual signal included into the encoded representation in dependence on the multi-channel audio signal.
摘要:
An audio signal decoder for providing an upmix signal representation in dependence on a downmix signal representation and an object-related parametric information comprises an object separator configured to decompose the downmix signal representation, to provide a first audio information describing a first set of one or more audio objects of a first audio object type and a second audio information describing a second set of one or more audio objects of a second audio object type, in dependence on the downmix signal representation and using at least a part of the object-related parametric information. The audio signal decoder also comprises an audio signal processor configured to receive the second audio information and to process the second audio information in dependence on the object-related parametric information, to obtain a processed version of the second audio information. The audio signal decoder also comprises an audio signal combiner configured to combine the first audio information with the processed version of the second audio information, to obtain the upmix signal representation.
摘要:
A device for generating a binaural signal based on a multi-channel signal representing a plurality of channels and intended for reproduction by a speaker configuration having a virtual sound source position associated to each channel, is described. It comprises a correlation reducer for differently processing, and thereby reducing a correlation between, at least one of a left and a right channel of the plurality of channels, a front and a rear channel of the plurality of channels, and a center and a non-center channel of the plurality of channels, in order to obtain an inter-similarity reduced set of channels; a plurality of directional filters, a first mixer for mixing outputs of the directional filters modeling the acoustic transmission to the first ear canal of the listener, and a second mixer for mixing outputs of the directional filters modeling the acoustic transmission to the second ear canal of the listener. According to another aspect, a center level reduction for forming the downmix for a room processor is performed. According to even another aspect, an inter-similarity decreasing set of head-related transfer functions is formed.
摘要:
An apparatus for encoding comprises a first domain converter (510), a switchable bypass (50), a second domain converter (410), a first processor (420) and a second processor (520) to obtain an encoded audio signal having different signal portions represented by coded data in different domains, which have been coded by different coding algorithms. Corresponding decoding stages in the decoder together with a bypass for bypassing a domain converter allow the generation of a decoded audio signal with high quality and low bit rate.
摘要:
Zum Ermitteln eines Schätzwerts für einen Bedarf an Informationseinheiten zum Codieren eines Signals wird neben der erlaubten Störung für ein Frequenzband und einer Energie des Frequenzbands zusätzlich ein Maß (nl(b)) für die Verteilung der Energie in dem Frequenzband berücksichtigt (102, 104, 106). Damit wird ein besserer Schätzwert für den Bedarf an Informationseinheiten erhalten, so dass effizienter und genauer codiert werden kann.
摘要:
In order to carry out a flexible signaling of a synchronous mode or of an asynchronous mode during multichannel parameter reconstruction, a parameter configuration instruction is inserted into the data flow and is used by a configuration device on the part of a multichannel decoder for configuring a multichannel reconstruction device. If the parameter configuration instruction has a first meaning, the configuration device searches within its input data for additional items of configuration information whereas, when the parameter configuration instruction has another meaning, the configuration device, based on items of information concerning an encoding algorithm with which transmission channel data have been encoded, carry out a configuration setting of the multichannel reconstruction device whereby efficiently and flexibly ensuring that a correct allocation between parameter data and decoded transmission channel data is always maintained.
摘要:
Bei dem Übergang in den logarithmischen Bereich muss nicht die gesamte Bitbreite des von dem Quadrat des Wertes linear abhängigen Ergebnisses berücksichtigt werden. Vielmehr ist es möglich, bei einem Wert mit x Bits eine Skalierung des Ergebnisses derart vorzunehmen, dass eine Darstellung mit weniger als x Bits des Ergebnisses ausreichend ist, um basierend auf demselben die logarithmische Darstellung zu gewinnen. Die Auswirkung des Skalierungsfaktors auf die sich ergebende logarithmische Darstellung kann durch Addieren oder Subtrahieren eines durch die Logarithmusfunktion angewendet auf den Skalierungsfaktor gewonnenen Korrekturwertes zu bzw. von der skalierten logarithmischen Darstellung ohne Dynamikverlust aufgehoben werden. Auf diese Weise werden ein Verfahren und eine Vorrichtung zum Bilden einer Darstellung eines von einem Quadrat eines Wertes linear abhängigen Ergebnisses geschaffen, so dass die Berechnung einfach bzw. mit wenig Hardwareaufwand möglich ist.