摘要:
An apparatus for generating an audio output signal to simulate a recording of a virtual microphone at a configurable virtual position in an environment includes a sound events position estimator and an information computation module. The former is adapted to estimate a sound source position indicating a position of a sound source in the environment, wherein the sound events position estimator is adapted to estimate the sound source position based on first and second direction information provided by first and second real spatial microphones, respectively, located at first and second real microphone positions in the environment, respectively. The information computation module is adapted to generate the audio output signal based on a first recorded audio input signal, on the first real microphone position, on the virtual position of the virtual microphone, and on the sound source position.
摘要:
An apparatus (100) for playing back an audio object associated with a position is provided. The apparatus (100) comprises a distance calculator (110) for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator (110) is configured to take a solution with a smallest distance. The apparatus (100) is configured to play back the audio object using the speaker corresponding to the solution.
摘要:
An audio signal processing device (1) for downmixing of a first input signal (X 1 ) and a second input signal (X 2 ) to a downmix signal (X D ) comprising: a dissimilarity extractor (2) configured to receive the first input signal (X 1 ) and the second input (X 2 ) signal as well as to output an extracted signal (Û 2 ), which is lesser correlated with respect to the first input signal (X 1 ) than the second input signal (X 2 ) and a combiner (3) configured to combine the first input signal (X 1 ) and the extracted signal (Û 2 ) in order to obtain the downmix signal (X D ).
摘要:
A method for mapping a plurality of input channels of an input channel configuration to output channels of an output channel configuration comprises providing a set of rules associated with each input channel of the plurality of input channels, wherein the rules define different mappings between the associated input channel and a set of output channels. For each input channel of the plurality of input channels, a rule associated with the input channel is accessed, determination is made whether the set of output channels defined in the accessed rule is present in the output channel configuration, and the accessed rule is selected if the set of output channels defined in the accessed rule is present in the output channel configuration. The input channels are mapped to the output channels according to the selected rule.
摘要:
A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal is provided. The downmix signal encodes one or more audio object signals. The decoder comprises a control unit (181) for setting an activation indication to an activation state depending on a signal property of at least one of the one or more audio object signals. Moreover, the decoder comprises a first analysis module (182) for transforming the downmix signal to obtain a first transformed downmix comprising a plurality of first subband channels. Furthermore, the decoder comprises a second analysis module (183) for generating, when the activation indication is set to the activation state, a second transformed downmix by transforming at least one of the first subband channels to obtain a plurality of second subband channels, wherein the second transformed downmix comprises the first subband channels which have not been transformed by the second analysis module and the second subband channels. Moreover, the decoder comprises an un-mixing unit (184), wherein the un-mixing unit (184) is configured to un-mix the second transformed downmix, when the activation indication is set to the activation state, based on parametric side information on the one or more audio object signals to obtain the audio output signal, and to un-mix the first transformed downmix, when the activation indication is not set to the activation state, based on the parametric side information on the one or more audio object signals to obtain the audio output signal. Furthermore, an encoder is provided.
摘要:
An apparatus for generating an audio output signal to simulate a recording of a virtual microphone at a configurable virtual position in an environment includes a sound events position estimator and an information computation module. The former is adapted to estimate a sound source position indicating a position of a sound source in the environment, wherein the sound events position estimator is adapted to estimate the sound source position based on first and second direction information provided by first and second real spatial microphones, respectively, located at first and second real microphone positions in the environment, respectively. The information computation module is adapted to generate the audio output signal based on a first recorded audio input signal, on the first real microphone position, on the virtual position of the virtual microphone, and on the sound source position.
摘要:
A multi-mode audio signal decoder has a spectral value determinator to obtain sets of decoded spectral coefficients for a plurality of portions of an audio content and a spectrum processor configured to apply a spectral shaping to a set of spectral coefficients in dependence on a set of linear-prediction-domain parameters for a portion of the audio content encoded in a linear-prediction mode, and in dependence on a set of scale factor parameters for a portion of the audio content encoded in a frequency-domain mode. The audio signal decoder has a frequency-domain-to-time-domain converter configured to obtain a time-domain audio representation on the basis of a spectrally-shaped set of decoded spectral coefficients for a portion of the audio content encoded in the linear-prediction mode and for a portion of the audio content encoded in the frequency domain mode. An audio signal encoder is also described.
摘要:
An apparatus for generating an enhanced downmix signal on the basis of a multi-channel microphone signal has a spatial analyzer configured to compute a set of spatial cue parameters having a direction information describing a direction-of-arrival of a direct sound, a direct sound power information and a diffuse sound power information on the basis of the multi-channel microphone signal. The apparatus also has a filter calculator for calculating enhancement filter parameters in dependence on the direction information describing the direction-of-arrival of the direct sound, in dependence on the direct sound power information and in dependence on the diffuse sound power information. The apparatus also has a filter for filtering the microphone signal, or a signal derived therefrom, using the enhancement filter parameters, to obtain the enhanced downmix signal.