摘要:
A method is described which decodes a downmix matrix (306) for mapping a plurality of input channels (300) of audio content to a plurality of output channels (302), the input and output channels (300, 302) being associated with respective speakers at predetermined positions relative to a listener position, wherein the downmix matrix (306) is encoded by exploiting the symmetry of speaker pairs (S 1 -S 9 ) of the plurality of input channels (300) and the symmetry of speaker pairs (S 10 -S 11 ) of the plurality of output channels (302). Encoded information representing the encoded downmix matrix (306) is received and decoded for obtaining the decoded downmix matrix (306).
摘要:
An apparatus for generating an audio output signal to simulate a recording of a virtual microphone at a configurable virtual position in an environment includes a sound events position estimator and an information computation module. The former is adapted to estimate a sound source position indicating a position of a sound source in the environment, wherein the sound events position estimator is adapted to estimate the sound source position based on first and second direction information provided by first and second real spatial microphones, respectively, located at first and second real microphone positions in the environment, respectively. The information computation module is adapted to generate the audio output signal based on a first recorded audio input signal, on the first real microphone position, on the virtual position of the virtual microphone, and on the sound source position.
摘要:
A method for mapping a plurality of input channels of an input channel configuration to output channels of an output channel configuration comprises providing a set of rules associated with each input channel of the plurality of input channels, wherein the rules define different mappings between the associated input channel and a set of output channels. For each input channel of the plurality of input channels, a rule associated with the input channel is accessed, determination is made whether the set of output channels defined in the accessed rule is present in the output channel configuration, and the accessed rule is selected if the set of output channels defined in the accessed rule is present in the output channel configuration. The input channels are mapped to the output channels according to the selected rule.
摘要:
An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation is configured to provide a first downmix signal and a second downmix signal on the basis of a jointly encoded representation of the first downmix signal and the second downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a first audio channel signal and a second audio channel signal on the basis of the first downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a third audio channel signal and a fourth audio channel signal on the basis of the second downmix signal using a multi-channel decoding. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the first audio channel signal and the third audio channel signal, to obtain a first bandwidth-extended channel signal and a third bandwidth-extended channel signal. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the second audio channel signal and the fourth audio channel signal, to obtain a second bandwidth extended channel signal and a fourth bandwidth extended channel signal. An audio encoder uses a related concept.
摘要:
An apparatus (100) for generating a plurality of parametric audio streams ( 125) (θ i , Ψ i , W i ) from an input spatial audio signal (105) obtained from a recording in a recording space comprises a segmentor (110) and a generator (120). The segmentor (110) is configured for providing at least two input segmental audio signals (115) (W i , X i , Y i , Z i ) from the input spatial audio signal (105), wherein the at least two input segmental audio signals (1 15) (W i , X i , Y i , Z i ) are associated with corresponding segments (Seg i ) of the recording space. The generator (120) is configured for generating a parametric audio stream for each of the at least two input segmental audio signals (115) (W i , X i , Y i , Z i ) to obtain the plurality of parametric audio streams (125) (θ i , Ψ i , W i ).
摘要:
An apparatus for generating an audio output signal to simulate a recording of a virtual microphone at a configurable virtual position in an environment includes a sound events position estimator and an information computation module. The former is adapted to estimate a sound source position indicating a position of a sound source in the environment, wherein the sound events position estimator is adapted to estimate the sound source position based on first and second direction information provided by first and second real spatial microphones, respectively, located at first and second real microphone positions in the environment, respectively. The information computation module is adapted to generate the audio output signal based on a first recorded audio input signal, on the first real microphone position, on the virtual position of the virtual microphone, and on the sound source position.
摘要:
Audio splicing is rendered more effective by the use of one or more truncation unit packets inserted into the audio data stream so as to indicate to an audio decoder, for a predetermined access unit, an end portion of an audio frame with which the predetermined access unit is associated, as to be discarded in playout.
摘要:
An apparatus for generating a plurality of audio channels for a first speaker setup is characterized by an imaginary speaker determiner, an energy distribution calculator, a processor and a renderer. The imaginary speaker determiner is configured to determine a position of an imaginary speaker not contained in the first speaker setup to obtain a second speaker setup containing the imaginary speaker. The energy distribution calculator is configured to calculate an energy distribution from the imaginary speaker to the other speakers in the second speaker setup. The processor is configured to repeat the energy distribution to obtain a downmix information for a downmix from the second speaker setup to the first speaker setup. The renderer is configured to generate the plurality of audio channels using the downmix information.