摘要:
An audio signal encoder (600) for providing a downmix signal representation (614) and an object-related parametric information (616) on the basis of a plurality of object signals (x 1 to x N ) comprises a downmixer (620) configured to provide one or more downmix signals in dependence on downmix coefficients (d 1 to d N ) associated with the object signals (x 1 to x N ), such that the one or more downmix signals comprise a superposition of a plurality of object signals, and a side information provider (630) configured to provide an inter-object-relationship side information (OLD, IOC) describing level differences and correlation characteristics of object signals (x 1 to x N ) and an individual-object side information describing one or more individual properties of the individual object signals (x 1 to x N ).
摘要:
An apparatus for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information, which are included in a bitstream representation of an audio content, in independence on a user-specified rendering matrix, the apparatus has a distortion limiter configured to obtain a modified rendering matrix using a linear combination of a user-specified rendering matrix in a target rendering matrix in dependence on a linear combination parameter. The apparatus also has a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and the object-related parametric information using the modified rendering matrix. The apparatus is also configured to evaluate a bitstream element representing the linear combination parameter in order to obtain the linear combination parameter.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
An apparatus for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information, which are included in a bitstream representation of an audio content, in independence on a user-specified rendering matrix, the apparatus has a distortion limiter configured to obtain a modified rendering matrix using a linear combination of a user-specified rendering matrix in a target rendering matrix in dependence on a linear combination parameter. The apparatus also has a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and the object-related parametric information using the modified rendering matrix. The apparatus is also configured to evaluate a bitstream element representing the linear combination parameter in order to obtain the linear combination parameter.
摘要:
Frame elements which shall be made available for skipping may are transmitted more efficiently by arranging that a default payload length information is transmitted separately within a configuration block, with the length information within the frame elements, in turn, being subdivided into a default payload length flag followed, if the default payload length flag is not set, by a payload length value explicitly coding the payload length of the respective frame element. However, if the default payload length flag is set, an explicit transmission of the payload length may be avoided. Rather, any frame element, the default extension payload length flag of which is set, has the default payload length and any frame element, the default extension payload length flag of which is not set, has a payload length corresponding to the payload length value. By this measure, transmission effectiveness is increased.
摘要:
On an encoder-side, a multi-channel input signal is analyzed for obtaining smoothing control information, which is to be used by a decoder-side multi-channel synthesis for smoothing quantized transmitted parameters or values derived from the quantized transmitted parameters for providing an improved subjective audio quality in particular for slowly moving point sources and rapidly moving point sources having tonal material such as fast moving sinusoids.
摘要:
An audio encoder and an audio decoder are based on a combination of two audio channels (201, 202) to obtain a first combination signal (204) as a mid signal and a residual signal (205) which can be derived using a predicted side signal derived from the mid signal. The first combination signal and the prediction residual signal are encoded (209) and written (212) into a data stream (213) together with the prediction information (206) derived by an optimizer (207) based on an optimization target (208). A decoder uses the prediction residual signal, the first combination signal and the prediction information to derive a decoded first channel signal and a decoded second channel signal. In an encoder example or in a decoder example, a real-to-imaginary transform can be applied for estimating the imaginary part of the spectrum of the first combination signal. For calculating the prediction signal used in the derivation of the prediction residual signal, the real-valued first combination signal is multiplied by a real portion of the complex prediction information and the estimated imaginary part of the first combination signal is multiplied by an imaginary portion of the complex prediction information.
摘要:
A better compromise between a too high bitstream and decoding overhead on the one hand and flexibility of frame element positioning on the other hand is achieved by arranging that each of the sequence of frames of the bitstream comprises a sequence of N frame elements and, on the other hand, the bitstream comprises a configuration block comprising a field indicating the number of elements N and a type indication syntax portion indicating, for each element position of the sequence of N element positions, an element type out of a plurality of element types with, in the sequences of N frame elements of the frames, each frame element being of the element type indicated, by the type indication portion, for the respective element position at which the respective frame element is positioned within the sequence of N frame elements of the respective frame in the bitstream. Thus, the frames are equally structured in that each frame comprises the same sequence of N frame elements of the frame element type indicated by the type indication syntax portion, positioned within the bitstream in the same sequential order. This sequential order is commonly adjustable for the sequence of frames by use of the type indication syntax portion which indicates, for each element position of the sequence of N element positions, an element type out of a plurality of element types.
摘要:
An apparatus for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information, which are included in a bitstream representation of an audio content, and in dependence on a rendering information, has a distortion limiter configured to adjust upmix parameters using a distortion control scheme to avoid or limit audible distortions which are caused by an inappropriate choice of rendering parameters. The distortion limiter is configured to obtain a distortion limitation control parameter, which is included in the bitstream representation of the audio content, and to adjust a distortion control scheme in dependence on the distortion limitation control parameter.