摘要:
Binaural rendering a multi-channel audio signal into a binaural output signal is described. The multi-channel audio signal has a stereo downmix signal into which a plurality of audio signals are downmixed, and side information having a downmix information, as well as object level information of the plurality of audio signals and inter-object cross correlation information. Based on a first rendering prescription, a preliminary binaural signal is computed from the first and second channels of the stereo downmix signal. A decorrelated signal is generated as an perceptual equivalent to a mono downmix of the first and second channels of the stereo downmix signal being, however, decorrelated to the mono downmix. Depending on a second rendering prescription, a corrective binaural signal is computed from the decorrelated signal and the preliminary binaural signal is mixed with the corrective binaural signal to obtain the binaural output signal.
摘要:
A device for generating a binaural signal based on a multi-channel signal representing a plurality of channels and intended for reproduction by a speaker configuration having a virtual sound source position associated to each channel, is described. It includes a correlation reducer for differently processing, and thereby reducing a correlation between, at least one of a left and a right channel of the plurality of channels, a front and a rear channel of the plurality of channels, and a center and a non-center channel of the plurality of channels, in order to obtain an inter-similarity reduced set of channels; a plurality of directional filters, a first mixer for mixing outputs of the directional filters modeling the acoustic transmission to the first ear canal of the listener, and a second mixer for mixing outputs of the directional filters modeling the acoustic transmission to the second ear canal of the listener. According to another aspect, a center level reduction for forming the downmix for a room processor is performed. According to even another aspect, an inter-similarity decreasing set of head-related transfer functions is formed.
摘要:
A device for generating a binaural signal based on a multi-channel signal representing a plurality of channels and intended for reproduction by a speaker configuration having a virtual sound source position associated to each channel, is described. It includes a correlation reducer for differently processing, and thereby reducing a correlation between, at least one of a left and a right channel of the plurality of channels, a front and a rear channel of the plurality of channels, and a center and a non-center channel of the plurality of channels, in order to obtain an inter-similarity reduced set of channels; a plurality of directional filters, a first mixer for mixing outputs of the directional filters modeling the acoustic transmission to the first ear canal of the listener, and a second mixer for mixing outputs of the directional filters modeling the acoustic transmission to the second ear canal of the listener. According to another aspect, a center level reduction for forming the downmix for a room processor is performed. According to even another aspect, an inter-similarity decreasing set of head-related transfer functions is formed.
摘要:
In a case of transient audio input signals, in a multi-channel audio reconstruction, uncorrelated output signals are generated from an audio input signal in that the audio input signal is mixed with a representation of the audio input signal delayed by a delay time such that, in a first time interval, a first output signal corresponds to the audio input signal, and a second output signal corresponds to the delayed representation of the audio input signal, wherein, in a second time interval, the first output signal corresponds to the delayed representation of the audio input signal, and the second output signal corresponds to the audio input signal.
摘要:
A device for generating an encoded stereo signal from a multi-channel representation includes a multi-channel decoder generating three of more multi-channels from at least one basic channel and parametric information. The three or more multi-channels are subjected to headphone signal processing to generate an uncoded first stereo channel and an uncoded second stereo channel which are then supplied to a stereo encoder to generate an encoded stereo file on the output side. The encoded stereo file may be supplied to any suitable player in the form of a CD player or a hardware player such that a user of the player does not only get a normal stereo impression but a multi-channel impression.
摘要:
In a case of transient audio input signals, in a multi-channel audio reconstruction, uncorrelated output signals are generated from an audio input signal in that the audio input signal is mixed with a representation of the audio input signal delayed by a delay time such that, in a first time interval, a first output signal corresponds to the audio input signal, and a second output signal corresponds to the delayed representation of the audio input signal, wherein, in a second time interval, the first output signal corresponds to the delayed representation of the audio input signal, and the second output signal corresponds to the audio input signal.
摘要:
The present document relates to processing of multimedia data, notably the encoding, the transmission, the decoding and the rendering of multimedia data, e.g. audio files or bitstreams. In particular, the present document relates to the implementation of loudness control in multimedia players. A method for providing loudness related data to a media player is described. The method comprises the steps of providing a first loudness related value associated with an audio signal; wherein the first loudness related value has been determined according to a first procedure; of converting the first loudness related value into a second loudness related value using a reversible relation; wherein the second loudness related value is associated with a second procedure for determining loudness related values; of storing the second loudness related value in metadata associated with the audio signal; and of providing the metadata to the media player.
摘要:
The present document relates to processing of multimedia data, notably the encoding, the transmission, the decoding and the rendering of multimedia data, e.g. audio files or bitstreams. In particular, the present document relates to the implementation of loudness control in multimedia players. A method for providing loudness related data to a media player is described. The method comprises the steps of providing a first loudness related value associated with an audio signal; wherein the first loudness related value has been determined according to a first procedure; of converting the first loudness related value into a second loudness related value using a reversible relation; wherein the second loudness related value is associated with a second procedure for determining loudness related values; of storing the second loudness related value in metadata associated with the audio signal; and of providing the metadata to the media player.