摘要:
An apparatus for encoding a first channel and a second channel of an audio input signal including two or more channels to obtain an encoded audio signal according to an embodiment includes a normalizer configured to determine a normalization value for the audio input signal depending on the first channel of the audio input signal and depending on the second channel of the audio input signal. Moreover, the apparatus includes an encoding unit configured to generate a processed audio signal having a first channel and a second channel. The encoding unit is configured to encode the processed audio signal to obtain the encoded audio signal.
摘要:
An apparatus for decomposing an audio signal (100) into a background component signal (140) and a foreground component signal (150), comprises: a block generator (110) for generating a time sequence of blocks of audio signal values; an audio signal analyzer (120) for determining a block characteristic of a current block of the audio signal and for determining an average characteristic for a group of blocks, the group of blocks comprising at least two blocks; and a separator (130) for separating the current block into a background portion and a foreground portion in response to a ratio of the block characteristic of the current block and the average characteristic of the group of blocks, wherein the background component signal (140) comprises the background portion of the current block and the foreground component signal (150) comprises the foreground portion of the current block.
摘要:
An apparatus for decomposing an audio signal into a background component signal and a foreground component signal, comprises: a block generator (110) for generating a time sequence of blocks of audio signal values; an audio signal analyzer (120) for determining a characteristic of a current block of the audio signal and for determining a variability of the characteristic within a group of blocks comprising at least two blocks of the sequence of blocks; and a separator (130) for separating the current block into a background portion (140) and a foreground portion (150) wherein the separator (130) is configured to determine (182) a separation threshold based on the variability and to separate the current block into the background component signal (140) and the foreground component signal (150), when the characteristic of the current block is in a predetermined relation to the separation threshold.
摘要:
An apparatus for mapping a first input channel and a second input channel of an input channel configuration to at least one output channel of an output channel configuration, wherein each input channel and each output channel has a direction in which an associated loudspeaker is located relative to a central listener position, wherein the apparatus is configured to map the first input channel to a first output channel of the output channel configuration. The apparatus is further configured to at least one of a) map the second input channel to the first output channel, comprising processing the second input channel by applying at least one of an equalization filter and a decorrelation filter to the second input channel, and b) despite of the fact that an angle deviation between a direction of the second input channel and a direction of the first output channel is less than an angle deviation between a direction of the second input channel and the second output channel and/or is less than an angle deviation between the direction of the second input channel and the direction of the third output channel, map the second input channel to the second and third output channels by panning between the second and third output channels.
摘要:
Embodiments provide a digital processor including an ambient portion extractor and a spatial effect processing stage. The ambient portion extractor is configured to extract an ambient portion from a multi-channel signal. The spatial effect processing stage is configured to generate a spatial effect signal based on the ambient portion of the multi-channel signal. The digital processor is configured to combine the multi-channel signal or a processed version thereof with the spatial effect signal.
摘要:
An audio signal decoder for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information and in dependence on a rendering information comprises an object parameter determinator. The object parameter determinator is configured to obtain inter-object-correlation values for a plurality of pairs of audio objects. The object parameter determinator is configured to evaluate a bitstream signaling parameter in order to decide whether to evaluate individual inter-object-correlation bitstream parameter values to obtain inter-object-correlation values for a plurality of pairs of related audio objects, or to obtain inter-object-correlation values for a plurality of pairs of related audio objects using a common inter-object-correlation bitstream parameter value. The audio signal decoder also comprises a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and using the inter-object-correlation values for a plurality of pairs of related objects and the rendering information.
摘要:
An apparatus for extracting a direct and/or ambience signal from a downmix signal and spatial parametric information, the downmix signal and the spatial parametric information representing a multi-channel audio signal having more channels than the downmix signal, wherein the spatial parametric information comprises inter-channel relations of the multi-channel audio signal, is described. The apparatus comprises a direct/ambience estimator and a direct/ambience extractor. The direct/ambience estimator is configured for estimating a level information of a direct portion and/or an ambient portion of the multi-channel audio signal based on the spatial parametric information. The direct/ambience extractor is configured for extracting a direct signal portion and/or an ambient signal portion from the downmix signal based on the estimated level information of the direct portion or the ambient portion.
摘要:
An apparatus for generating one or more audio output channels is provided. The apparatus comprises a parameter processor (110) for calculating mixing information and a downmix processor (120) for generating the one or more audio output channels. The downmix processor (120) is configured to receive an audio transport signal comprising one or more audio transport channels. One or more audio channel signals are mixed within the audio transport signal, and one or more audio object signals are mixed within the audio transport signal, and wherein the number of the one or more audio transport channels is smaller than the number of the one or more audio channel signals plus the number of the one or more audio object signals. The parameter processor (110) is configured to receive downmix information indicating information on how the one or more audio channel signals and the one or more audio object signals are mixed within the one or more audio transport channels, and wherein the parameter processor (110) is configured to receive covariance information. Moreover, the parameter processor (110) is configured to calculate the mixing information depending on the downmix information and depending on the covariance information. The downmix processor (120) is configured to generate the one or more audio output channels from the audio transport signal depending on the mixing information. The covariance information indicates a level difference information for at least one of the one or more audio channel signals and further indicates a level difference information for at least one of the one or more audio object signals. However, the covariance information does not indicate correlation information for any pair of one of the one or more audio channel signals and one of the one or more audio object signals.
摘要:
A decoder for generating an audio output signal having one or more audio output channels from a downmix signal having one or more downmix channels is provided. The downmix signal encodes one or more audio object signals. The decoder has a threshold determiner for determining a threshold value depending on a signal energy and/or a noise energy of at least one of the of or more audio object signals and/or depending on a signal energy and/or a noise energy of at least one of the one or more downmix channels. Moreover, the decoder has a processing unit for generating the one or more audio output channels from the one or more downmix channels depending on the threshold value.