摘要:
A decoder (115) generates a multi channel audio signal, such as a surround sound signal, from a received first signal. The multi-channel signal comprises a second set of audio channels and the first signal comprises a first set of audio channels. The decoder (115) comprises a receiver (401) which receives the first signal. The receiver (401) is coupled to an estimate processor (405) which generates estimated parametric data for the second set of audio channels in response to characteristics of the first set of audio channels. The estimated parametric data relates characteristics of the second set of audio channels to characteristics of the first set of audio channels. The decoder (115) furthermore comprises a spatial audio decoder (403) which decodes the first signal in response to the estimated parametric data to generate the multi-channel signal comprising the second set of channels. The invention allows use of spatial audio decoding with signals that are not encoded by a spatial audio encoder.
摘要:
A decoder (115) generates a multi channel audio signal, such as a surround sound signal, from a received first signal. The multi-channel signal comprises a second set of audio channels and the first signal comprises a first set of audio channels. The decoder (115) comprises a receiver (401) which receives the first signal. The receiver (401) is coupled to an estimate processor (405) which generates estimated parametric data for the second set of audio channels in response to characteristics of the first set of audio channels. The estimated parametric data relates characteristics of the second set of audio channels to characteristics of the first set of audio channels. The decoder (115) furthermore comprises a spatial audio decoder (403) which decodes the first signal in response to the estimated parametric data to generate the multi-channel signal comprising the second set of channels. The invention allows use of spatial audio decoding with signals that are not encoded by a spatial audio encoder.
摘要:
A multi-channel audio encoder (10) for encoding a multi-channel audio signal (101), e.g. a 5.1 channel audio signal, into a spatial down-mix (102), e.g. a stereo signal, and associated parameters (104, 105). The encoder (10) comprises first and second units (110, 120). The first unit (110) encodes the multi-channel audio signal (101) into the spatial down-mix (102) and parameters (104). These parameters (104) enable a multi-channel decoder (20) to reconstruct the multi-channel audio signal (203) from the spatial down-mix (102). The second unit (120) generates, from the spatial down-mix (102), parameters (105) that enable the decoder to reconstruct the spatial down-mix (202) from an alternative down-mix (103), e.g. a so-called artistic down-mix that has been manually mixed in a sound studio. In this way, the decoder (20) can efficiently deal with a situation in which an alternative down-mix (103) is received instead of the regular spatial, down-mix (102). In the decoder (20), first the spatial down-mix (202) is reconstructed from the alternative down-mix (103) and the parameters (105). Next, the spatial down-mix (202) is decoded into the multi-channel audio signal (203).
摘要:
An apparatus for generating a binaural audio signal includes a de-multiplexer and decoder which receives audio data comprising an audio M-channel audio signal which is a downmix of an N-channel audio signal and spatial parameter data for upmixing the M-channel audio signal to the N-channel audio signal. A conversion processor converts spatial parameters of the spatial parameter data into first binaural parameters in response to at least one binaural perceptual transfer function. A matrix processor converts the M-channel audio signal into a first stereo signal in response to the first binaural parameters. A stereo filter generates the binaural audio signal by filtering the first stereo signal. The filter coefficients for the stereo filter are determined in response to the at least one binaural perceptual transfer function by a coefficient processor. The combination of parameter conversion/processing and filtering allows a high quality binaural signal to be generated with low complexity.
摘要:
An apparatus for generating a binaural audio signal includes a de-multiplexer and decoder which receives audio data comprising an audio M-channel audio signal which is a downmix of an N-channel audio signal and spatial parameter data for upmixing the M-channel audio signal to the N-channel audio signal. A conversion processor converts spatial parameters of the spatial parameter data into first binaural parameters in response to at least one binaural perceptual transfer function. A matrix processor converts the M-channel audio signal into a first stereo signal in response to the first binaural parameters. A stereo filter generates the binaural audio signal by filtering the first stereo signal. The filter coefficients for the stereo filter are determined in response to the at least one binaural perceptual transfer function by a coefficient processor. The combination of parameter conversion/processing and filtering allows a high quality binaural signal to be generated with low complexity.
摘要:
A method and a device are described for processing a stereo signal obtained from an encoder, which encodes an N-channel audio signal into spatial parameters (P) and a stereo down-mix comprising first and second stereo signals (LO, RO). A first signal and a third signal are added in order to obtain a first output signal (L0w), wherein the first signal (L0wL) comprises the first stereo signal (LO) modified by a first complex function (g1), and the third signal (L0wR) comprises the second stereo signal (RO) modified by a third complex function (g3). A second signal and a fourth signal are added to obtain a second output signal (R0w). The fourth signal (R0wR) comprises the second stereo signal (RO) modified by a fourth complex function (g4), and the second signal (R0wL) comprises the first stereo signal (LO) modified by a second complex function (g2). The complex functions (g1, g2, g3, g4) are functions of the spatial parameters (P) and are chosen to be such that an energy value of the difference (L0wL,R0wL) between the first signal and the second signal is larger than or equal to the energy value of the sum (L0wL+R0wL) of the first and the second signal, and the energy value of the difference (R0wR−L0wR) between the fourth signal and the third signal is larger than or equal to the energy value of the sum (R0wR+L0wR) of the fourth signal and the third signal.
摘要:
A method and a device are described for processing a stereo signal obtained from an encoder, which encodes an N-channel audio signal into spatial parameters (P) and a stereo down-mix comprising first and second stereo signals (L0, R0). A first signal and a third signal are added in order to obtain a first output signal (Low), wherein the first signal (L0wL) comprises the first stereo signal (L0) modified by a first complex function (g1), and the third signal (L0wR) comprises the second stereo signal (R0) modified by a third complex function (g3). A second signal and a fourth signal are added to obtain a second output signal (R0w). The fourth signal (R0wR) comprises the second stereo signal (R0) modified by a fourth complex function (g4), and the second signal (R0wL) comprises the first stereo signal (L0) modified by a second complex function (g2). The complex functions (g1, g2, g3, g4) are functions of the spatial parameters (P) and are chosen to be such that an energy value of the difference (L0wL,R0wL) between the first signal and the second signal is larger than or equal to the energy value of the sum (L0wL+R0wL) of the first and the second signal, and the energy value of the difference (R0wR−L0wR) between the fourth signal and the third signal is larger than or equal to the energy value of the sum (R0wR+L0wR) of the fourth signal and the third signal.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeaker configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters have an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
A device (1) for converting a first number (M) of input audio channels into a second, larger number (N) of output audio channels comprises: decorrelation units (3) for decomposing the input audio channels into a set of decorrelated auxiliary channels, at least one upmix unit (4) for combining the decorrelated auxiliary channels into the output audio channels, and at least one pre-processing unit (2) for pre-processing the input audio channels and feeding the pre-processed input audio channels to the decorrelation units (3). The pre-processing unit (2) and the upmix unit (4) are preferably controlled by audio parameters.
摘要:
Techniques are described for combining parametric multi-channel audio coding with matrixing, reconstructing a full-quality multi-channel, independent of the decoder. A stereo signal is obtained from encoding an N-channel audio signal into spatial parameters and a stereo down-mix signal having first and second stereo signals, including adding a first signal and a third signal to obtain a first output signal, the first signal having the first stereo signal modified by a first complex function, the third signal having the second stereo signal modified by a third complex function. A second signal and fourth signal are similarly added to obtain a second output signal. Complex functions are chosen such that an energy value of the difference between first signal and the second signals (fourth signal and third signals) is larger than or equal to the energy value of the sum of the first and the second signal (fourth signal and third signal).