摘要:
An apparatus for upmixing a downmix audio signal describing one or more downmix audio channels into an upmixed audio signal describing a plurality of upmixed audio channels includes an upmixer and a parameter determinator. The upmixer is configured to apply temporally variable upmix parameters to upmix the downmix audio signal in order to obtain the upmixed audio signal, wherein the temporally variable upmix parameters include temporally variable smoothened phase values. The parameter determinator is configured to obtain one or more temporally smoothened upmix parameters for usage by the upmixer on the basis of a quantized upmix parameter input information. The parameter determinator is configured to combine a scaled version of a previous smoothened phase value with a scaled version of an input phase information using a phase change limitation algorithm, to determine a current smoothened phase value on the basis of the previous smoothened phase value and the phase input information.
摘要:
An apparatus for upmixing a downmix audio signal describing one or more downmix audio channels into an upmixed audio signal describing a plurality of upmixed audio channels includes an upmixer configured to apply temporally variable upmixing parameters to upmix the downmix audio signal in order to obtain the upmixed audio signal. The apparatus also includes a parameter interpolator, wherein the parameter interpolator is configured to obtain one or more temporally interpolated upmix parameters to be used by the upmixer on the basis of a first complex-valued upmix parameter and a subsequent second complex-valued upmix parameter. The parameter interpolator is configured to separately interpolate between a magnitude value of the first complex-valued upmix parameter and a magnitude value of the second complex-valued upmix parameter, and between a phase value of the first complex-valued upmix parameter and a phase value of the second complex-valued upmix parameter, to obtain the one or more temporally interpolated upmix parameters. A respective method can be implemented, for example, as a computer program.
摘要:
An audio decoder for decoding a multi-audio-object signal having an audio signal of a first type and an audio signal of a second type encoded therein is described, the multi-audio-object signal having a downmix signal and side information, the side information having level information of the audio signals of the first and second types in a first predetermined time/frequency resolution, and a residual signal specifying residual level values in a second predetermined time/frequency resolution, the audio decoder having a processor for computing prediction coefficients based on the level information; and an up-mixer for up-mixing the downmix signal based on the prediction coefficients and the residual signal to obtain a first up-mix audio signal approximating the audio signal of the first type and/or a second up-mix audio signal approximating the audio signal of the second type.
摘要:
Binaural rendering a multi-channel audio signal into a binaural output signal is described. The multi-channel audio signal has a stereo downmix signal into which a plurality of audio signals are downmixed, and side information having a downmix information, as well as object level information of the plurality of audio signals and inter-object cross correlation information. Based on a first rendering prescription, a preliminary binaural signal is computed from the first and second channels of the stereo downmix signal. A decorrelated signal is generated as an perceptual equivalent to a mono downmix of the first and second channels of the stereo downmix signal being, however, decorrelated to the mono downmix. Depending on a second rendering prescription, a corrective binaural signal is computed from the decorrelated signal and the preliminary binaural signal is mixed with the corrective binaural signal to obtain the binaural output signal.
摘要:
An audio decoder for decoding a multi-audio-object signal having an audio signal of a first type and an audio signal of a second type encoded therein is described, the multi-audio-object signal having a downmix signal and side information, the side information having level information of the audio signals of the first and second types in a first predetermined time/frequency resolution, and a residual signal specifying residual level values in a second predetermined time/frequency resolution, the audio decoder having a processor for computing prediction coefficients based on the level information; and an up-mixer for up-mixing the downmix signal based on the prediction coefficients and the residual signal to obtain a first up-mix audio signal approximating the audio signal of the first type and/or a second up-mix audio signal approximating the audio signal of the second type.
摘要:
A method for decoding a multi-audio-object signal having audio signals of first and second types encoded therein, the multi-audio-object signal having a downmix signal and side information having level information of the audio signals of the first and second types in a first predetermined time/frequency resolution, the method including computing a prediction coefficient matrix C based on the level information; and up-mixing the downmix signal based on the prediction coefficients to obtain a first and/or a second up-mix audio signal approximating the audio signals of the first and second types, respectively, wherein up-mixing yields the first and/or second up-mix signals S1 and S2 from the downmix signal d according to a computation representable by ( S 1 S 2 ) = D - 1 { ( 1 C ) d + H } , with “1” denoting—depending on the number of channels of d—a scalar, or an identity matrix, and D−1 being a matrix uniquely determined by a downmix prescription according to which the audio signals of the first and second types are downmixed into the downmix signal, and which is also included by the side information, and H being a term independent from d.