摘要:
Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).
摘要:
An apparatus (100) for generating one or more audio channels is provided. The apparatus comprises a metadata decoder (110) for generating one or more reconstructed metadata signals (x 1 ',...,x N ') from one or more processed metadata signals (z 1 ,...,z N ) depending on a control signal (b), wherein each of the one or more reconstructed metadata signals (x 1 ',...,x N ') indicates information associated with an audio object signal of one or more audio object signals, wherein the metadata decoder (110) is configured to generate the one or more reconstructed metadata signals (x 1 ',...,x N ') by determining a plurality of reconstructed metadata samples (x 1 '(n),...,x N '(n)) for each of the one or more reconstructed metadata signals (x 1 ',...,x N '). Moreover, the apparatus comprises an audio channel generator (120) for generating the one or more audio channels depending on the one or more audio object signals and depending on the one or more reconstructed metadata signals (x 1 ',...,x N '). The metadata decoder (110) is configured to receive a plurality of processed metadata samples (z 1 (n),...,z N (n)) of each of the one or more processed metadata signals (z 1 ,...z N ). Moreover, the metadata decoder (110) is configured to receive the control signal (b). Furthermore, the metadata decoder (110) is configured to determine each reconstructed metadata sample (x i '(n)) of the plurality of reconstructed metadata samples (x i '(1),... x i '(n-1), x i '(n)) of each reconstructed metadata signal (x i ') of the one or more reconstructed metadata signals (x 1 ',...,x N '), so that, when the control signal (b) indicates a first state (b(n)=0), said reconstructed metadata sample (x i '(n)) is a sum of one of the processed metadata samples (z i (n)) of one of the one or more processed metadata signals (z i ) and of another already generated reconstructed metadata sample (x i '(n-1)) of said reconstructed metadata signal (x i '), and so that, when the control signal indicates a second state (b(n)=1) being different from the first state, said reconstructed metadata sample (x i '(n)) is said one (z i (n)) of the processed metadata samples (z i (1),...,z i (n)) of said one (z i ) of the one or more processed metadata signals (z 1 ,... ,z N ). Moreover, an apparatus (250) for generating encoded audio information is provided.
摘要:
An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation is configured to provide a first downmix signal and a second downmix signal on the basis of a jointly encoded representation of the first downmix signal and the second downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a first audio channel signal and a second audio channel signal on the basis of the first downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a third audio channel signal and a fourth audio channel signal on the basis of the second downmix signal using a multi-channel decoding. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the first audio channel signal and the third audio channel signal, to obtain a first bandwidth-extended channel signal and a third bandwidth-extended channel signal. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the second audio channel signal and the fourth audio channel signal, to obtain a second bandwidth extended channel signal and a fourth bandwidth extended channel signal. An audio encoder uses a related concept.
摘要:
An apparatus (100) for generating one or more audio channels is provided. The apparatus (100) comprises a metadata decoder (110) for receiving one or more compressed metadata signals. Each of the one or more compressed metadata signals comprises a plurality of first metadata samples. The first metadata samples of each of the one or more compressed metadata signals indicate information associated with an audio object signal of one or more audio object signals. The metadata decoder (110) is configured to generate one or more reconstructed metadata signals, so that each of the one or more reconstructed metadata signals comprises the first metadata samples of one of the one or more compressed metadata signals and further comprises a plurality of second metadata samples. Moreover, the metadata decoder (110) is configured to generate each of the second metadata samples of each reconstructed metadata signal of the one or more reconstructed metadata signals depending on at least two of the first metadata samples of said reconstructed metadata signal. Moreover, the apparatus (100) comprises an audio channel generator (120) for generating the one or more audio channels depending on the one or more audio object signals and depending on the one or more reconstructed metadata signals. Furthermore, an apparatus for generating encoded audio information comprising one or more encoded audio signals and one or more compressed metadata signals is provided.
摘要:
For flexibly signaling a synchronous mode or an asynchronous mode in the multi-channel parameter reconstruction, a parameter configuration cue is inserted in the data stream, which is used by a configurator on the side of a multi-channel decoder to configure a multi-channel reconstructor. If the parameter configuration cue has a first meaning, the configurator will look for further configuration information in its input data, while, when the parameter configuration cue has another meaning, the configurator performs a configuration setting of the multi-channel reconstructor based on information on a coding algorithm with which transmission channel data have been coded, so that it is ensured efficiently on the one hand and flexibly on the other hand that there will always be obtained a correct association between parameter data and decoded transmission channel data.
摘要:
An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.
摘要:
An audio decoder device for decoding a compressed input audio signal comprising at least one core decoder (6, 24) having one or more processors (36, 36') for generating a processor output signal (37) based on a processor input signal (38, 38'), wherein a number of output channels (37.1, 37.2, 37.1', 37.2') of the processor output signal (37, 37') is higher than a number of input channels (38.1, 38.1') of the processor input signal (38, 38'), wherein each of the one or more processors (36, 36') comprises a decorrelator (39, 39') and a mixer (40, 40'), wherein a core decoder output signal (13) having a plurality of channels (13.1, 13.2, 13.3, 13,4) comprises the processor output signal (37, 37'), and wherein the core decoder output signal (13) is suitable for a reference loudspeaker setup (42); at least one format converter device (9, 10) configured to convert the core decoder output signal (13) into an output audio signal (31), which is suitable for a target loudspeaker setup (45); and a control device (46) configured to control at least one or more processors (36, 36') in such way that the decorrelator (39, 39') of the processor (36, 36') may be controlled independently from the mixer (40, 40') of the processor (36, 36'), wherein the control device (46) is configured to control at least one of the decorrelators (39, 39') of the one or more processors (36, 36') depending on the target loudspeaker setup (45).
摘要:
An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.
摘要:
Audio decoder for decoding encoded audio data, comprising: an input interface (1100) for receiving the encoded audio data, the encoded audio data comprising a plurality of encoded channels or a plurality of encoded objects or compress metadata related to the plurality of objects; a core decoder (1300) for decoding the plurality of encoded channels and the plurality of encoded objects; a metadata decompressor (1400) for decompressing the compressed metadata; an object processor (1200) for processing the plurality of decoded objects using the decompressed metadata to obtain a number of output channels (1205) comprising audio data from the objects and the decoded channels; and a post-processor (1700) for converting the number of output channels (1205) into an output format, wherein the audio decoder is configured to bypass the object processor and to feed a plurality of decoded channels into the post-processor (1700), when the encoded audio data does not contain any audio objects and to feed the plurality of decoded objects and the plurality of decoded channels into the object processor (1200), when the encoded audio data comprises encoded channels and encoded objects..