摘要:
An apparatus for processing an audio signal to obtain control information for a speech enhancement filter has a feature extractor for extracting at least one feature per frequency band of a plurality of frequency bands of a short-time spectral representation of a plurality of short-time spectral representations, where the at least one feature represents a spectral shape of the short-time spectral representation in the frequency band. The apparatus additionally has a feature combiner for combining the at least one feature for each frequency band using combination parameters to obtain the control information for the speech enhancement filter for a time portion of the audio signal. The feature combiner can use a neural network regression method, which is based on combination parameters determined in a training phase for the neural network.
摘要:
An apparatus for processing an audio signal to obtain control information for a speech enhancement filter has a feature extractor for extracting at least one feature per frequency band of a plurality of frequency bands of a short-time spectral representation of a plurality of short-time spectral representations, where the at least one feature represents a spectral shape of the short-time spectral representation in the frequency band. The apparatus additionally has a feature combiner for combining the at least one feature for each frequency band using combination parameters to obtain the control information for the speech enhancement filter for a time portion of the audio signal. The feature combiner can use a neural network regression method, which is based on combination parameters determined in a training phase for the neural network.
摘要:
An apparatus for extracting an ambient signal from an input audio signal comprises a gain-value determinator configured to determine a sequence of time-varying ambient signal gain values for a given frequency band of the time-frequency distribution of the input audio signal in dependence on the input audio signal. The apparatus comprises a weighter configured to weight one of the sub-band signals representing the given frequency band of the time-frequency-domain representation with the time-varying gain values, to obtain a weighted sub-band signal. The gain-value determinator is configured to obtain one or more quantitative feature-values describing one or more features of the input audio signal and to provide the gain-value as a function of the one or more quantitative feature values such that the gain values are quantitatively dependent on the quantitative values. The gain value determinator is configured to determine the gain values such that ambience components are emphasized over non-ambience components in the weighted sub-band signal.
摘要:
An apparatus for extracting an ambient signal from an input audio signal comprises a gain-value determinator configured to determine a sequence of time-varying ambient signal gain values for a given frequency band of the time-frequency distribution of the input audio signal in dependence on the input audio signal. The apparatus comprises a weighter configured to weight one of the sub-band signals representing the given frequency band of the time-frequency-domain representation with the time-varying gain values, to obtain a weighted sub-band signal. The gain-value determinator is configured to obtain one or more quantitative feature-values describing one or more features of the input audio signal and to provide the gain-value as a function of the one or more quantitative feature values such that the gain values are quantitatively dependent on the quantitative values. The gain value determinator is configured to determine the gain values such that ambience components are emphasized over non-ambience components in the weighted sub-band signal.
摘要:
An audio signal decoder for providing an upmix signal representation in dependence on a downmix signal representation and an object-related parametric information includes an object separator configured to decompose the downmix signal representation, to provide a first audio information describing a first set of one or more audio objects of a first audio object type and a second audio information describing a second set of one or more audio objects of a second audio object type, in dependence on the downmix signal representation and using at least a part of the object-related parametric information.
摘要:
An audio signal decoder for providing an upmix signal representation in dependence on a downmix signal representation and an object-related parametric information includes an object separator configured to decompose the downmix signal representation, to provide a first audio information describing a first set of one or more audio objects of a first audio object type and a second audio information describing a second set of one or more audio objects of a second audio object type, in dependence on the downmix signal representation and using at least a part of the object-related parametric information.
摘要:
An apparatus for generating a multi-channel audio signal based on an input audio signal comprises a main signal upmixer, a section selector, a section signal upmixer and a combiner. The main signal upmixer is configured to provide a main multi-channel audio signal based on the input audio signal. The section selector is configured to select or not select a section of the input audio signal based on an analysis of the input audio signal. The selected section of the input audio signal, a processed selected section of the input audio signal or a reference signal associated with the selected section of the input audio signal is provided as section signal. The section signal upmixer is configured to provide a section upmix signal based on the section signal, and the combiner is configured to overlay the main multi-channel audio signal and the section upmix signal to obtain the multi-channel audio signal.
摘要:
An apparatus for providing one or more adjusted parameters for a provision of an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information includes a parameter adjuster. The parameter adjuster is configured to receive one or more input parameters and to provide, on the basis thereof, one or more adjusted parameters. The parameter adjuster is configured to provide the one or more adjusted parameters in dependence on the one or more input parameters and the object-related parametric information, such that a distortion of the upmix signal representation caused by the use of non-optimal parameters is reduced at least for input parameters deviating from optimal parameters by more than a predetermined deviation.
摘要:
An apparatus for providing one or more adjusted parameters for a provision of an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information includes a parameter adjuster. The parameter adjuster is configured to receive one or more input parameters and to provide, on the basis thereof, one or more adjusted parameters. The parameter adjuster is configured to provide the one or more adjusted parameters in dependence on the one or more input parameters and the object-related parametric information, such that a distortion of the upmix signal representation caused by the use of non-optimal parameters is reduced at least for input parameters deviating from optimal parameters by more than a predetermined deviation.