摘要:
A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal is provided. The downmix signal encodes one or more audio object signals. The decoder comprises a control unit (181) for setting an activation indication to an activation state depending on a signal property of at least one of the one or more audio object signals. Moreover, the decoder comprises a first analysis module (182) for transforming the downmix signal to obtain a first transformed downmix comprising a plurality of first subband channels. Furthermore, the decoder comprises a second analysis module (183) for generating, when the activation indication is set to the activation state, a second transformed downmix by transforming at least one of the first subband channels to obtain a plurality of second subband channels, wherein the second transformed downmix comprises the first subband channels which have not been transformed by the second analysis module and the second subband channels. Moreover, the decoder comprises an un-mixing unit (184), wherein the un-mixing unit (184) is configured to un-mix the second transformed downmix, when the activation indication is set to the activation state, based on parametric side information on the one or more audio object signals to obtain the audio output signal, and to un-mix the first transformed downmix, when the activation indication is not set to the activation state, based on the parametric side information on the one or more audio object signals to obtain the audio output signal. Furthermore, an encoder is provided.
摘要:
A decoder for generating an audio output signal having one or more audio output channels from a downmix signal having one or more downmix channels is provided. The downmix signal encodes one or more audio object signals. The decoder has a threshold determiner for determining a threshold value depending on a signal energy and/or a noise energy of at least one of the of or more audio object signals and/or depending on a signal energy and/or a noise energy of at least one of the one or more downmix channels. Moreover, the decoder has a processing unit for generating the one or more audio output channels from the one or more downmix channels depending on the threshold value.
摘要:
An audio signal encoder (600) for providing a downmix signal representation (614) and an object-related parametric information (616) on the basis of a plurality of object signals (x 1 to x N ) comprises a downmixer (620) configured to provide one or more downmix signals in dependence on downmix coefficients (d 1 to d N ) associated with the object signals (x 1 to x N ), such that the one or more downmix signals comprise a superposition of a plurality of object signals, and a side information provider (630) configured to provide an inter-object-relationship side information (OLD, IOC) describing level differences and correlation characteristics of object signals (x 1 to x N ) and an individual-object side information describing one or more individual properties of the individual object signals (x 1 to x N ).
摘要:
According to the present invention, multiple parametrically encoded audio signals can be efficiently combined using an audio signal generator (100), which generates an audio output signal (120) by combining the down-mix channels (110a, 112a) and the associated parameters (110b, 112b) of the audio signals directly within the parameter domain, i.e. without reconstructing or decoding the individual input audio signals prior to the generation of the audio output signal (120). This is achieved by direct mixing of the associated down-mix channels (110a, 112a) of the individual input signals. It is one key feature of the present invention that the combination of the down-mix channels (110a, 112a) is achieved by simple, computationally inexpensive arithmetic operations.
摘要:
An audio signal encoder (600) for providing a downmix signal representation (614) and an object-related parametric information (616) on the basis of a plurality of object signals (x 1 to x N ) comprises a downmixer (620) configured to provide one or more downmix signals in dependence on downmix coefficients (d 1 to d N ) associated with the object signals (x 1 to x N ), such that the one or more downmix signals comprise a superposition of a plurality of object signals, and a side information provider (630) configured to provide an inter-object-relationship side information (OLD, IOC) describing level differences and correlation characteristics of object signals (x 1 to x N ) and an individual-object side information describing one or more individual properties of the individual object signals (x 1 to x N ).
摘要:
A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal comprising a plurality of time-domain downmix samples is provided. The downmix signal encodes two or more audio object signals. The decoder comprises a window-sequence generator (134) for determining a plurality of analysis windows, wherein each of the analysis windows comprises a plurality of time-domain downmix samples of the downmix signal. Each analysis window of the plurality of analysis windows has a window length indicating the number of the time-domain downmix samples of said analysis window. The window-sequence generator (134) is configured to determine the plurality of analysis windows so that the window length of each of the analysis windows depends on a signal property of at least one of the two or more audio object signals. Moreover, the decoder comprises a t/f-analysis module (135) for transforming the plurality of time-domain downmix samples of each analysis window of the plurality of analysis windows from a time-domain to a time-frequency domain depending on the window length of said analysis window, to obtain a transformed downmix. Furthermore, the decoder comprises an un-mixing unit (136) for un-mixing the transformed downmix based on parametric side information on the two or more audio object signals to obtain the audio output signal. Moreover, an encoder is provided.
摘要:
An apparatus for adapting input audio information, encoding one or more audio objects, to obtain adapted audio information is provided. The input audio information comprises two or more input audio downmix channels and further comprises input parametric side information. The adapted audio information comprises one or more adapted audio downmix channels and further comprises adapted parametric side information. The apparatus comprises a downmix signal modifier (110) for adapting, depending on adaptation information, the two or more input audio downmix channels to obtain the one or more adapted audio downmix channels. Moreover, the apparatus comprises a parametric side information adapter (120) for adapting, depending on the adaptation information, the input parametric side information to obtain the adapted parametric side information.
摘要:
According to the present invention, multiple parametrically encoded audio signals can be efficiently combined using an audio signal generator (100), which generates an audio output signal (120) by combining the down-mix channels (110a, 112a) and the associated parameters (110b, 112b) of the audio signals directly within the parameter domain, i.e. without reconstructing or decoding the individual input audio signals prior to the generation of the audio output signal (120). This is achieved by direct mixing of the associated down-mix channels (110a, 112a) of the individual input signals. It is one key feature of the present invention that the combination of the down-mix channels (110a, 112a) is achieved by simple, computationally inexpensive arithmetic operations.