摘要:
An audio signal processing decoder having at least one frequency band (36) and being configured for processing an input audio signal (37) having a plurality of input channels (38) in the at least one frequency band (36), wherein the decoder (2) is configured to analyze the input audio signal (37), wherein inter-channel dependencies (39) between the input channels (38) are identified; and to align the phases of the input channels (38) based on the identified inter-channel dependencies (39), wherein the phases of input channels (38) are the more aligned with respect to each other the higher their inter-channel dependency (39) is; and to downmix the aligned input audio signal to an output audio signal (40) having a lesser number of output channels (41) than the number of the input channels (38).
摘要:
An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.
摘要:
Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).
摘要:
An apparatus (100) for playing back an audio object associated with a position is provided. The apparatus (100) comprises a distance calculator (110) for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator (110) is configured to take a solution with a smallest distance. The apparatus (100) is configured to play back the audio object using the speaker corresponding to the solution.
摘要:
A method for processing an audio signal (400) in accordance with a room impulse response (434) is described. The audio signal (400) is separately processed (422, 424) with an early part and a late reverberation of the room impulse response (434), and the processed early part (428) of the audio signal and the reverberated signal (430) are combined (432). A transition from the early part to the late reverberation in the room impulse response is reached when a correlation measure reaches a threshold, the threshold being set dependent on the correlation measure for a selected one of the early reflections in the early part of the room impulse response.
摘要:
A method for processing an audio signal (504) in accordance with a room impulse response is described. The audio signal (504) is processed (502) with an early part of the room impulse response separate from a late reverberation of the room impulse response, wherein the processing (514) of the late reverberation comprises generating a scaled reverberated signal, the scaling (526) being dependent on the audio signal (504). The processed early part (506) of the audio signal (504) and the scaled reverberated signal are combined.
摘要:
A method for processing an audio signal (400) in accordance with a room impulse response (434) is described. The audio signal (400) is separately processed (422, 424) with an early part and a late reverberation of the room impulse response (434), and the processed early part (428) of the audio signal and the reverberated signal (430) are combined (432). A transition from the early part to the late reverberation in the room impulse response is reached when a correlation measure reaches a threshold, the threshold being set dependent on the correlation measure for a selected one of the early reflections in the early part of the room impulse response.