摘要:
Parameters being a measure for a characteristic of a channel or of a pair of channels, wherein the parameter is a measure for a characteristic of the channel or of the pair of channels with respect to another channel of a multi-channel signal can be quantized more efficiently using a quantization rule that is generated based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal. With generation of the quantization rule taking into account a psycho acoustic approach, the size of an encoded representation of the multi-channel signal can be decreased by coarser quantization without significantly disturbing the perceptual quality of the multi-channel signal when reconstructed from the encoded representation.
摘要:
On an encoder-side, a multi-channel input signal is analyzed for obtaining smoothing control information, which is to be used by a decoder-side multi-channel synthesis for smoothing quantized transmitted parameters or values derived from the quantized transmitted parameters for providing an improved subjective audio quality in particular for slowly moving point sources and rapidly moving point sources having tonal material such as fast moving sinusoids.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
A parametric representation of a multi-channel audio signal describes the spatial properties of the audio signal well with compact side information when a coherence information, describing the coherence between a first and a second channel, is derived within a hierarchical encoding process only for channel pairs including a first channel having only information of a left side with respect to a listening position and including a second channel having only information from a right side with respect to a listening position. As within the hierarchical process the multiple audio channels of the audio signal are downmixed iteratively into monophonic channels, one can pick the relevant parameters from an encoding step involving only channel pairs carrying the information needed to describe the spatial properties of the multi-channel audio signal.
摘要:
A better compromise between a too high bitstream and decoding overhead on the one hand and flexibility of frame element positioning on the other hand is achieved by arranging that each of the sequence of frames of the bitstream comprises a sequence of N frame elements and, on the other hand, the bitstream comprises a configuration block comprising a field indicating the number of elements N and a type indication syntax portion indicating, for each element position of the sequence of N element positions, an element type out of a plurality of element types with, in the sequences of N frame elements of the frames, each frame element being of the element type indicated, by the type indication portion, for the respective element position at which the respective frame element is positioned within the sequence of N frame elements of the respective frame in the bitstream. Thus, the frames are equally structured in that each frame comprises the same sequence of N frame elements of the frame element type indicated by the type indication syntax portion, positioned within the bitstream in the same sequential order. This sequential order is commonly adjustable for the sequence of frames by use of the type indication syntax portion which indicates, for each element position of the sequence of N element positions, an element type out of a plurality of element types.
摘要:
Frame elements which shall be made available for skipping may are transmitted more efficiently by arranging that a default payload length information is transmitted separately within a configuration block, with the length information within the frame elements, in turn, being subdivided into a default payload length flag followed, if the default payload length flag is not set, by a payload length value explicitly coding the payload length of the respective frame element. However, if the default payload length flag is set, an explicit transmission of the payload length may be avoided. Rather, any frame element, the default extension payload length flag of which is set, has the default payload length and any frame element, the default extension payload length flag of which is not set, has a payload length corresponding to the payload length value. By this measure, transmission effectiveness is increased.
摘要:
On an encoder-side, a multi-channel input signal is analyzed for obtaining smoothing control information, which is to be used by a decoder-side multi-channel synthesis for smoothing quantized transmitted parameters or values derived from the quantized transmitted parameters for providing an improved subjective audio quality in particular for slowly moving point sources and rapidly moving point sources having tonal material such as fast moving sinusoids.
摘要:
An apparatus for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information, which are included in a bitstream representation of an audio content, in independence on a user-specified rendering matrix, the apparatus has a distortion limiter configured to obtain a modified rendering matrix using a linear combination of a user-specified rendering matrix in a target rendering matrix in dependence on a linear combination parameter. The apparatus also has a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and the object-related parametric information using the modified rendering matrix. The apparatus is also configured to evaluate a bitstream element representing the linear combination parameter in order to obtain the linear combination parameter.