摘要:
A method and a device for processing a stereo signal obtained from an encoder, which codes an N-channel audio signal into spatial parameters (P) and a stereo down-mix comprising first and second stereo signals (L 0 , R 0 ). A first signal and a third signal are added in order to obtain a first output signal (L 0w ), wherein the first signal QL 0wL ) comprises the first stereo signal (L 0 ) modified by a first complex function (g 1 ), and the third signal (L 0wR ) comprises the second stereo signal (R 0 ) modified by a third complex function (g 3 ). A second signal and a fourth signal are added to obtain a second output signal (R 0w ). The fourth signal (R 0wR ) comprises the second stereo signal (R 0 ) modified by a fourth complex function (g 4 ), and the second signal (R 0wL ) comprises the first stereo signal (L 0 ) modified by a second complex function (g 2 ). The complex functions (g 1 ,g 2 ,g 3 ,g 4 ) are functions of the spatial parameters (P) and are chosen such that an energy value of the difference (L 0wL -P 0wL ) between the first signal and the second signal is larger than or equal to the energy value of the sum (L 0wL +R 0wL ) of the first and the second signal and the energy value of the difference (R 0wR -L 0wR ) between the fourth signal and the third signal is larger than or equal to the energy value of the sum (R 0wR +L 0wR ) of the fourth signal and the third signal.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
An audio encoder (109) has a hierarchical encoding structure and generates a data stream comprising one or more audio channels as well as parametric audio encoding data. The encoder (109) comprises an encoding structure processor (305) which inserts decoder tree structure data into the data stream. The decoder tree structure data comprises at least one data value indicative of a channel split characteristic for an audio channel at a hierarchical layer of the hierarchical decoder structure and may specifically specify the decoder tree structures to be applied by a decoder. A decoder (115) comprises a receiver (401) which receives the data stream and a decoder structure processor (405) for generating the hierarchical decoder structure in response to the decoder tree structure data. A decode processor (403) then generates output audio channels from the data stream using the hierarchical decoder structure.
摘要:
A device (1) for converting a first number (M) of input audio channels into a second, larger number (N) of output audio channels comprises: decorrelation units (3) for decomposing the input audio channels into a set of decorrelated auxiliary channels, at least one upmix unit (4) for combining the decorrelated auxiliary channels into the output audio channels, and at least one pre-processing unit (2) for pre-processing the input audio channels and feeding the pre-processed input audio channels to the decorrelation units (3). The pre-processing unit (2) and the upmix unit (4) are preferably controlled by audio parameters.
摘要:
An audio object coder for generating an encoded object signal using a plurality of audio objects includes a downmix information generator for generating downmix information indicating a distribution of the plurality of audio objects into at least two downmix channels, an audio object parameter generator for generating object parameters for the audio objects, and an output interface for generating the imported audio output signal using the downmix information and the object parameters. An audio synthesizer uses the downmix information for generating output data usable for creating a plurality of output channels of the predefined audio output configuration.
摘要:
An audio object coder for generating an encoded object signal using a plurality of audio objects includes a downmix information generator for generating downmix information indicating a distribution of the plurality of audio objects into at least two downmix channels, an audio object parameter generator for generating object parameters for the audio objects, and an output interface for generating the imported audio output signal using the downmix information and the object parameters. An audio synthesizer uses the downmix information for generating output data usable for creating a plurality of output channels of the predefined audio output configuration.
摘要:
An apparatus for synthesizing a rendered output signal having a first audio channel and a second audio channel includes a decorrelator stage (356) for generating a decorrelator signal based on a downmix signal, and a combiner (364) for performing a weighted combination of the downmix signal and a decorrelated signal based on parametric audio object information (362), downmix information (354) and target rendering information (360). The combiner solves the problem of optimally combining matrixing with decorrelation for a high quality stereo scene reproduction of a number of individual audio objects using a multichannel downmix.
摘要:
For a multi-channel reconstruction of audio signals based on at least one base channel, an energy measure is used for compensating energy losses due to an predictive upmix. The energy measure can be applied in the encoder or the decoder. Furthermore, a decorrelated signal is added to output channels generated by an energy-loss introducing upmix procedure. The energy of the decorrelated signal is smaller than or equal to an energy error introduced by the predictive upmix. Thus, problems occurring for prediction based up-mix methods such as up-mixing signals that are coded with High Frequency Reconstruction techniques are solved, so that the correct correlation between the up-mixed channels is obtained or the up-mix is adapted to arbitrary down-mixes.
摘要:
A multi-channel audio signal having at least three channels can be reconstructed such, that the reconstructed channels are at least partly de-correlated from each other using a downmixed signal derived from an original multi-channel signal and a set of decorrelated signals provided by a de-correlator (101) that derives the set of de-correlated signals from the downmix signal, wherein the de-correlated signals within the set of de-correlated signals are mutually mostly orthogonal to each other, i.e. an orthogonality relation between channel pairs is satisfied within an orthogonality tolerance range.
摘要:
Encoding an audio signal is provided wherein the audio signal includes a first audio channel and a second audio channel, the encoding comprising subband filtering each of the first audio channel and the second audio channel in a complex modulated filterbank to provide a first plurality of subband signals for the first audio channel and a second plurality of subband signals for the second audio channel, downsampling each of the subband signals to provide a first plurality of downsampled subband signals and a second plurality of downsampled subband signals, further subband filtering at least one of the downsampled subband signals in a further filterbank in order to provide a plurality of sub-subband signals, deriving spatial parameters from the sub-subband signals and from those downsampled subband signals that are not further subband filtered, and deriving a single channel audio signal comprising derived subband signals derived from the first plurality of downsampled subband signals and the second plurality of downsampled subband signals. Further, decoding is provided wherein an encoded audio signal comprising an encoded single channel audio signal and a set of spatial parameters is decoded by decoding the encoded single channel audio channel to obtain a plurality of downsampled subband signals, further subband filtering at least one of the downsampled subband signals in a further filterbank in order to provide a plurality of sub-subband signals, and deriving two audio channels from the spatial parameters, the sub-subband signals and those downsampled subband signals that are not further subband filtered.