摘要:
An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation is configured to provide a first downmix signal and a second downmix signal on the basis of a jointly encoded representation of the first downmix signal and the second downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a first audio channel signal and a second audio channel signal on the basis of the first downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a third audio channel signal and a fourth audio channel signal on the basis of the second downmix signal using a multi-channel decoding. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the first audio channel signal and the third audio channel signal, to obtain a first bandwidth-extended channel signal and a third bandwidth-extended channel signal. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the second audio channel signal and the fourth audio channel signal, to obtain a second bandwidth extended channel signal and a fourth bandwidth extended channel signal. An audio encoder uses a related concept.
摘要:
An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.
摘要:
An apparatus for encoding an audio signal having a plurality of channels is provided. The apparatus comprises a downmixer (1010) for downmixing the plurality of channels to obtain a downmix signal. Moreover, the apparatus comprises a residual signal calculator (1020) adapted for calculating a residual signal. Furthermore, the apparatus comprises a phase information calculator (1030) adapted for calculating information on a phase difference between the downmix and the residual signal to obtain phase information. Moreover, the apparatus comprises an output generator (1040) for outputting the phase information.
摘要:
Apparatus (100) for adapting a spatial audio signal (2) for an original loudspeaker setup to a playback loudspeaker setup that differs from the original loudspeaker setup. The apparatus comprises a direct-ambience decomposer (130) that is configured to decomposing channel signals in a segment of the original loudspeaker setup into direct sound (D) and ambience components (A), and to determine a direction of arrival of the direct sound components. A direct sound renderer (150) receives a playback loudspeaker setup information and adjusts the direct sound components (D) using the playback loudspeaker setup information so that a perceived direction of arrival of the direct sound components in the playback loudspeaker setup is substantially identical to the direction of arrival of the direct sound components. A combiner (180) combines adjusted direct sound components and possibly modified ambience components to obtain loudspeaker signals for loudspeakers of the playback loudspeaker setup.
摘要:
An apparatus for encoding an audio signal having a plurality of channels is provided. The apparatus comprises a downmixer (1010) for downmixing the plurality of channels to obtain a downmix signal. Moreover, the apparatus comprises a residual signal calculator (1020) adapted for calculating a residual signal. Furthermore, the apparatus comprises a phase information calculator (1030) adapted for calculating information on a phase difference between the downmix and the residual signal to obtain phase information. Moreover, the apparatus comprises an output generator (1040) for outputting the phase information.
摘要:
An apparatus for encoding an audio signal having a plurality of channels is provided. The apparatus comprises a downmixer (1010) for downmixing the plurality of channels to obtain a downmix signal. Moreover, the apparatus comprises a residual signal calculator (1020) adapted for calculating a residual signal. Furthermore, the apparatus comprises a phase information calculator (1030) adapted for calculating information on a phase difference between the downmix and the residual signal to obtain phase information. Moreover, the apparatus comprises an output generator (1040) for outputting the phase information.
摘要:
Audio decoder for decoding encoded audio data, comprising: an input interface (1100) for receiving the encoded audio data, the encoded audio data comprising a plurality of encoded channels or a plurality of encoded objects or compress metadata related to the plurality of objects; a core decoder (1300) for decoding the plurality of encoded channels and the plurality of encoded objects; a metadata decompressor (1400) for decompressing the compressed metadata; an object processor (1200) for processing the plurality of decoded objects using the decompressed metadata to obtain a number of output channels (1205) comprising audio data from the objects and the decoded channels; and a post-processor (1700) for converting the number of output channels (1205) into an output format, wherein the audio decoder is configured to bypass the object processor and to feed a plurality of decoded channels into the post-processor (1700), when the encoded audio data does not contain any audio objects and to feed the plurality of decoded objects and the plurality of decoded channels into the object processor (1200), when the encoded audio data comprises encoded channels and encoded objects..
摘要:
An apparatus for generating a plurality of audio channels for a first speaker setup is characterized by an imaginary speaker determiner, an energy distribution calculator, a processor and a renderer. The imaginary speaker determiner is configured to determine a position of an imaginary speaker not contained in the first speaker setup to obtain a second speaker setup containing the imaginary speaker. The energy distribution calculator is configured to calculate an energy distribution from the imaginary speaker to the other speakers in the second speaker setup. The processor is configured to repeat the energy distribution to obtain a downmix information for a downmix from the second speaker setup to the first speaker setup. The renderer is configured to generate the plurality of audio channels using the downmix information.