摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
A headphone down mix signal (314) can be efficiently derived from a parametric down mix of a multi-channel signal (312), when modified HRTFs (310) (head related transfer functions) are derived from HRTFs (308) of a multi-channel signal using a level parameter (306) having information on a level relation between two channels of the multi-channel signals such that a modified HRTF (310) is stronger influenced by the HRTF (308) of a channel having a higher level than by the HRTF (308) of a channel having a lower level. Modified HRTFs (310) are derived within the decoding process taking into account the relative strength of the channels associated to the HRTFs (308). The HRTFs (308) are thus modified such that a down mix signal (314) of a parametric representation of a multi-channel signal can directly be used to synthesize the headphone down mix signal (314) without the need of an intermediate full parametric multi-channel reconstruction of the parametric down mix.
摘要:
Binaural rendering a multi-channel audio signal into a binaural output signal (24) is described. The multi-channel audio signal comprises a stereo downmix signal (18) into which a plurality of audio signals are downmixed, and side information comprising a downmix information (DMG, DCLD) indicating, for each audio signal, to what extent the respective audio signal has been mixed into a first channel and a second channel of the stereo downmix signal (18), respectively, as well as object level information of the plurality of audio signals and inter-object cross correlation information describing similarities between pairs of audio signals of the plurality of audio signals. Based on a first rendering prescription, a preliminary binaural signal (54) is computed from the first and second channels of the stereo downmix signal (18). A decorrelated signal (X n,k d) is generated as an perceptual equivalent to a mono downmix (58) of the first and second channels of the stereo downmix signal (18) being, however, decorrelated to the mono downmix (58). Depending on a second rendering prescription (P2 1,m ), a corrective binaural signal (64) is computed from the decorrelated signal (62) and the preliminary binaural signal (54) is mixed with the corrective binaural signal (64) to obtain the binaural output signal (24).
摘要:
An intermediate channel representation of a multi-channel signal can be reconstructed highly efficient and with high fidelity, when upmix parameters for upmixing a transmitted downmix signal to the intermediate channel representation are derived that allow for an upmix using the same upmixing algorithms as within the multi-channel reconstruction. This can be achieved when a parameter re-calculator is used to derive the upmix parameters that takes into account also parameters having information on channels that are not included in the intermediate channel representation.
摘要:
A headphone down mix signal (314) can be efficiently derived from a parametric down mix of a multi-channel signal (312), when modified HRTFs (310) (head related transfer functions) are derived from HRTFs (308) of a multi-channel signal using a level parameter (306) having information on a level relation between two channels of the multi-channel signals such that a modified HRTF (310) is stronger influenced by the HRTF (308) of a channel having a higher level than by the HRTF (308) of a channel having a lower level. Modified HRTFs (310) are derived within the decoding process taking into account the relative strength of the channels associated to the HRTFs (308). The HRTFs (308) are thus modified such that a down mix signal (314) of a parametric representation of a multi-channel signal can directly be used to synthesize the headphone down mix signal (314) without the need of an intermediate full parametric multi-channel reconstruction of the parametric down mix.
摘要:
An audio object coder for generating an encoded object signal using a plurality of audio objects includes a downmix information generator for generating downmix information indicating a distribution of the plurality of audio objects into at least two downmix channels, an audio object parameter generator for generating object parameters for the audio objects, and an output interface for generating the imported audio output signal using the downmix information and the object parameters. An audio synthesizer uses the downmix information for generating output data usable for creating a plurality of output channels of the predefined audio output configuration.
摘要:
An audio object coder for generating an encoded object signal using a plurality of audio objects includes a downmix information generator for generating downmix information indicating a distribution of the plurality of audio objects into at least two downmix channels, an audio object parameter generator for generating object parameters for the audio objects, and an output interface for generating the imported audio output signal using the downmix information and the object parameters. An audio synthesizer uses the downmix information for generating output data usable for creating a plurality of output channels of the predefined audio output configuration.
摘要:
A filter unit (102) for generating new subband filter impulse responses from input subband filter impulse responses comprises a processor (820) for examining the input filter impulse responses from at least two input subband filter input responses to find input filter impulse responses having higher values, and at least one filter impulse response having a value being lower than the higher values, and a filter calculator (305) for generating said new subband filter impulse responses using the filter impulse response values having the higher values, wherein said new subband filter impulse responses do not include the input filter impulse responses having the lower value or comprise zero-valued filter impulse responses corresponding to filter impulse responses having the lower value.
摘要:
A synthesizer for generating a decorrelation signal using an input signal is operative on a plurality of subband signals, wherein a subband signal includes a sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal, which is smaller than a bandwidth of the input signal. The synthesizer includes a filter stage (201) for filtering each subband signal using a reverberation filter to obtain a plurality of reverberated subband signals, wherein a plurality of reverberated subband signals together represent the decorrelation signal. This decorrelation signal is used for reconstructing a signal based on a parametrically encoded stereo signal consisting of a mono signal and a coherence measure.