摘要:
A multi-channel decorrelator for providing a plurality of decorrelated signals on the basis of a plurality of decorrelator input signals is configured to premix a first set of N decorrelator input signals into a second set of K decorrelator input signals, wherein K K'. The multi-channel decorrelator can be used in a multi-channel audio decoder. A multi-channel audio encoder provides complexity control information for the multi-channel decorrelator.
摘要:
An apparatus for processing an audio signal to obtain control information for a speech enhancement filter comprises a feature extractor for extracting at least one feature per frequency band of a plurality of frequency bands of a short-time spectral representation of a plurality of short-time spectral representations, where the at least one feature represents a spectral shape of the short-time spectral representation in the frequency band. The apparatus additionally comprises a feature combiner for combining the at least one feature for each frequency band using combination parameters to obtain the control information for the speech enhancement filter for a time portion of the audio signal. The feature combiner can use a neural network regression method, which is based on combination parameters determined in a training phase for the neural network.
摘要:
A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to render a plurality of decoded audio signals, which are obtained on the basis of the encoded representation, in dependence on one or more rendering parameters, to obtain a plurality of rendered audio signals. The multichannel audio decoder is configured to derive one or more decorrelated audio signals from the rendered audio signals, and to combine the rendered audio signals, or a scaled version thereof, with the one or more decorrelated audio signals, to obtain the output audio signals. A multi-channel audio encoder provides a decorrelation method parameter to control an audio decoder.
摘要:
An audio signal decoder for providing an upmix signal representation in dependence on a downmix signal representation and an object-related parametric information comprises an object separator configured to decompose the downmix signal representation, to provide a first audio information describing a first set of one or more audio objects of a first audio object type and a second audio information describing a second set of one or more audio objects of a second audio object type, in dependence on the downmix signal representation and using at least a part of the object-related parametric information. The audio signal decoder also comprises an audio signal processor configured to receive the second audio information and to process the second audio information in dependence on the object-related parametric information, to obtain a processed version of the second audio information. The audio signal decoder also comprises an audio signal combiner configured to combine the first audio information with the processed version of the second audio information, to obtain the upmix signal representation.
摘要:
A multi-channel decorrelator for providing a plurality of decorrelated signals on the basis of a plurality of decorrelator input signals is configured to premix a first set of N decorrelator input signals into a second set of K decorrelator input signals, wherein K K'. The multi-channel decorrelator can be used in a multi-channel audio decoder. A multi-channel audio encoder provides complexity control information for the multi-channel decorrelator.
摘要:
An audio signal encoder (600) for providing a downmix signal representation (614) and an object-related parametric information (616) on the basis of a plurality of object signals (x 1 to x N ) comprises a downmixer (620) configured to provide one or more downmix signals in dependence on downmix coefficients (d 1 to d N ) associated with the object signals (x 1 to x N ), such that the one or more downmix signals comprise a superposition of a plurality of object signals, and a side information provider (630) configured to provide an inter-object-relationship side information (OLD, IOC) describing level differences and correlation characteristics of object signals (x 1 to x N ) and an individual-object side information describing one or more individual properties of the individual object signals (x 1 to x N ).
摘要:
An audio signal decoder for providing an upmix signal representation in dependence on a downmix signal representation and an object-related parametric information comprises an object separator configured to decompose the downmix signal representation, to provide a first audio information describing a first set of one or more audio objects of a first audio object type and a second audio information describing a second set of one or more audio objects of a second audio object type, in dependence on the downmix signal representation and using at least a part of the object-related parametric information. The audio signal decoder also comprises an audio signal processor configured to receive the second audio information and to process the second audio information in dependence on the object-related parametric information, to obtain a processed version of the second audio information. The audio signal decoder also comprises an audio signal combiner configured to combine the first audio information with the processed version of the second audio information, to obtain the upmix signal representation.
摘要:
An apparatus for processing an audio signal to obtain control information for a speech enhancement filter comprises a feature extractor for extracting at least one feature per frequency band of a plurality of frequency bands of a short-time spectral representation of a plurality of short-time spectral representations, where the at least one feature represents a spectral shape of the short-time spectral representation in the frequency band. The apparatus additionally comprises a feature combiner for combining the at least one feature for each frequency band using combination parameters to obtain the control information for the speech enhancement filter for a time portion of the audio signal. The feature combiner can use a neural network regression method, which is based on combination parameters determined in a training phase for the neural network.
摘要:
A multi-channel decorrelator for providing a plurality of decorrelated signals on the basis of a plurality of decorrelator input signals is configured to premix a first set of N decorrelator input signals into a second set of K decorrelator input signals, wherein K K'. The multi-channel decorrelator can be used in a multi-channel audio decoder. A multi-channel audio encoder provides complexity control information for the multi-channel decorrelator.
摘要:
An apparatus for processing an audio signal to obtain control information for a speech enhancement filter comprises a feature extractor for extracting at least one feature per frequency band of a plurality of frequency bands of a short-time spectral representation of a plurality of short-time spectral representations, where the at least one feature represents a spectral shape of the short-time spectral representation in the frequency band. The apparatus additionally comprises a feature combiner for combining the at least one feature for each frequency band using combination parameters to obtain the control information for the speech enhancement filter for a time portion of the audio signal. The feature combiner can use a neural network regression method, which is based on combination parameters determined in a training phase for the neural network.