摘要:
An apparatus for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information, which are included in a bitstream representation of an audio content, and in dependence on a rendering information, has a distortion limiter configured to adjust upmix parameters using a distortion control scheme to avoid or limit audible distortions which are caused by an inappropriate choice of rendering parameters. The distortion limiter is configured to obtain a distortion limitation control parameter, which is included in the bitstream representation of the audio content, and to adjust a distortion control scheme in dependence on the distortion limitation control parameter.
摘要:
Parameters being a measure for a characteristic of a channel or of a pair of channels, wherein the parameter is a measure for a characteristic of the channel or of the pair of channels with respect to another channel of a multi-channel signal can be quantized more efficiently using a quantization rule that is generated based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal. With generation of the quantization rule taking into account a psycho acoustic approach, the size of an encoded representation of the multi-channel signal can be decreased by coarser quantization without significantly disturbing the perceptual quality of the multi-channel signal when reconstructed from the encoded representation.
摘要:
A method and a device for processing a stereo signal obtained from an encoder, which codes an N-channel audio signal into spatial parameters (P) and a stereo down-mix comprising first and second stereo signals (L 0 , R 0 ). A first signal and a third signal are added in order to obtain a first output signal (L 0w ), wherein the first signal QL 0wL ) comprises the first stereo signal (L 0 ) modified by a first complex function (g 1 ), and the third signal (L 0wR ) comprises the second stereo signal (R 0 ) modified by a third complex function (g 3 ). A second signal and a fourth signal are added to obtain a second output signal (R 0w ). The fourth signal (R 0wR ) comprises the second stereo signal (R 0 ) modified by a fourth complex function (g 4 ), and the second signal (R 0wL ) comprises the first stereo signal (L 0 ) modified by a second complex function (g 2 ). The complex functions (g 1 ,g 2 ,g 3 ,g 4 ) are functions of the spatial parameters (P) and are chosen such that an energy value of the difference (L 0wL -P 0wL ) between the first signal and the second signal is larger than or equal to the energy value of the sum (L 0wL +R 0wL ) of the first and the second signal and the energy value of the difference (R 0wR -L 0wR ) between the fourth signal and the third signal is larger than or equal to the energy value of the sum (R 0wR +L 0wR ) of the fourth signal and the third signal.
摘要:
An audio object coder for generating an encoded object signal using a plurality of audio objects includes a downmix information generator for generating downmix information indicating a distribution of the plurality of audio objects into at least two downmix channels, an audio object parameter generator for generating object parameters for the audio objects, and an output interface for generating the imported audio output signal using the downmix information and the object parameters. An audio synthesizer uses the downmix information for generating output data usable for creating a plurality of output channels of the predefined audio output configuration.
摘要:
An audio object coder for generating an encoded object signal using a plurality of audio objects includes a downmix information generator for generating downmix information indicating a distribution of the plurality of audio objects into at least two downmix channels, an audio object parameter generator for generating object parameters for the audio objects, and an output interface for generating the imported audio output signal using the downmix information and the object parameters. An audio synthesizer uses the downmix information for generating output data usable for creating a plurality of output channels of the predefined audio output configuration.
摘要:
An audio encoder (109) has a hierarchical encoding structure and generates a data stream comprising one or more audio channels as well as parametric audio encoding data. The encoder (109) comprises an encoding structure processor (305) which inserts decoder tree structure data into the data stream. The decoder tree structure data comprises at least one data value indicative of a channel split characteristic for an audio channel at a hierarchical layer of the hierarchical decoder structure and may specifically specify the decoder tree structures to be applied by a decoder. A decoder (115) comprises a receiver (401) which receives the data stream and a decoder structure processor (405) for generating the hierarchical decoder structure in response to the decoder tree structure data. A decode processor (403) then generates output audio channels from the data stream using the hierarchical decoder structure.
摘要:
A device (1) for converting a first number (M) of input audio channels into a second, larger number (N) of output audio channels comprises: decorrelation units (3) for decomposing the input audio channels into a set of decorrelated auxiliary channels, at least one upmix unit (4) for combining the decorrelated auxiliary channels into the output audio channels, and at least one pre-processing unit (2) for pre-processing the input audio channels and feeding the pre-processed input audio channels to the decorrelation units (3). The pre-processing unit (2) and the upmix unit (4) are preferably controlled by audio parameters.