摘要:
Parameters being a measure for a characteristic of a channel or of a pair of channels, wherein the parameter is a measure for a characteristic of the channel or of the pair of channels with respect to another channel of a multi-channel signal can be quantized more efficiently using a quantization rule that is generated based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal. With generation of the quantization rule taking into account a psycho acoustic approach, the size of an encoded representation of the multi-channel signal can be decreased by coarser quantization without significantly disturbing the perceptual quality of the multi-channel signal when reconstructed from the encoded representation.
摘要:
A selected channel of a multi-channel signal represented by frames composed from sampling values having a high time resolution is provided that can be encoded with higher quality when a wave form parameter representation representing a wave form of an intermediate resolution representation of the selected channel is derived. The wave form parameter representation with the intermediate resolution can be used to shape a reconstructed channel to retrieve a channel having a signal envelope close to a selected original channel. The time scale on which the shaping is performed is shorter than the time scale of a framewise processing, thus enhancing the quality of the reconstructed channel. On the other hand, the shaping time scale is larger than the time scale of the sampling values, significantly reducing the amount of data needed by the wave form parameter representation.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
There is disclosed audio synthesizer (300) for generating a synthesis signal (336) from a downmix signal (324, x) having a number of downmix channels, the synthesis signal (336) having a number of synthesis channels, the downmix signal (324, x) being a downmixed version of an original signal (212) having a number of original channels, the audio synthesizer (300) comprising: a first path (610c') including: a first mixing matrix block (600c) configured for synthesizing a first component (336M') of the synthesis signal according to a first mixing matrix (MM) calculated from: a covariance matrix (CYR) associated to the synthesis signal (212); and a covariance matrix (Cx) associated to the downmix signal (324),
a second path (610c) for synthesizing a second component (336R') of the synthesis signal, wherein the second component (336R') is a residual component, the second path (610c) including: a prototype signal block (612c) configured for upmixing the downmix signal (324) from the number of downmix channels to the number of synthesis channels; a decorrelator (614c) configured for decorrelating the upmixed prototype signal (613c); a second mixing matrix block (618c) configured for synthesizing the second component (336R') of the synthesis signal according to a second mixing matrix (MR) from the decorrelated version (615c) of the downmix signal (324), the second mixing matrix (MR) being a residual mixing matrix,
wherein the audio synthesizer (300) is configured to calculate (618c) the second mixing matrix (MR) from: the residual covariance matrix (Cr) provided by the first mixing matrix block(600c); and an estimate of the covariance matrix of the decorrelated prototype signals (Cy ) obtained from the covariance matrix (Cx) associated to the downmix signal (324),
wherein the audio synthesizer (300) further comprises an adder block (620c) for summing the first component (336M') of the synthesis signal with the second component (336R') of the synthesis signal.
摘要:
A bandwidth extension decoder (500), (600) for providing a bandwidth extended audio signal (532) based on an input audio signal (502) and a parameter signal (504), wherein the parameter signal (504) comprises an indication of an offset frequency and an indication of a power density parameter, comprises: a patch generator (510) configured to generate a bandwidth extension high-frequency signal (512) comprising a high-frequency band, wherein the high-frequency band of the bandwidth extension high-frequency signal (512) is generated based on a frequency shift of a frequency band of the input audio signal (502), wherein the frequency shift is based on the offset frequency, and wherein the patch generator (510) is configured to amplify or attenuate the high-frequency band of the bandwidth extension high-frequency signal (512) by a factor equal to the value of the power density parameter or equal to the reciprocal value of the power density parameter, respectively; a combiner (529) configured to combine the bandwidth extension high-frequency signal (512) and the input audio signal (502) to obtain the bandwidth extended audio signal (532); and an output interface (530) configured to provide the bandwidth extended audio signal (532) .