摘要:
An audio signal decoder for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information and in dependence on a rendering information has an object parameter determinator. The object parameter determinator is configured to obtain inter-object-correlation values for a plurality of pairs of audio objects. The object parameter determinator is configured to evaluate a bitstream signaling parameter in order to decide whether to evaluate individual inter-object-correlation bitstream parameter values to obtain inter-object-correlation values for a plurality of pairs of related audio objects, or to obtain inter-object-correlation values for a plurality of pairs of related audio objects using a common inter-object-correlation bitstream parameter value. The audio signal decoder also has a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and using the inter-object-correlation values for a plurality of pairs of related objects and the rendering information.
摘要:
A transient detector is provided for generating an ambience signal suitable for being emitted via loudspeakers for which there is no special loudspeaker signal to detect a transient period. A synthesis signal generator produces a synthesis signal which fulfills the transient condition on the one hand and the continuity condition for the synthesis signal on the other hand. A signal substituter will then substitute a portion of the examination signal by the synthesis signal to obtain an ambience signal for the surround channels.
摘要:
An apparatus for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information, which are included in a bitstream representation of an audio content, and in dependence on a rendering information, has a distortion limiter configured to adjust upmix parameters using a distortion control scheme to avoid or limit audible distortions which are caused by an inappropriate choice of rendering parameters. The distortion limiter is configured to obtain a distortion limitation control parameter, which is included in the bitstream representation of the audio content, and to adjust a distortion control scheme in dependence on the distortion limitation control parameter.
摘要:
A multi-mode audio signal decoder has a spectral value determinator to obtain sets of decoded spectral coefficients for a plurality of portions of an audio content and a spectrum processor configured to apply a spectral shaping to a set of spectral coefficients in dependence on a set of linear-prediction-domain parameters for a portion of the audio content encoded in a linear-prediction mode, and in dependence on a set of scale factor parameters for a portion of the audio content encoded in a frequency-domain mode. The audio signal decoder has a frequency-domain-to-time-domain converter configured to obtain a time-domain audio representation on the basis of a spectrally-shaped set of decoded spectral coefficients for a portion of the audio content encoded in the linear-prediction mode and for a portion of the audio content encoded in the frequency domain mode. An audio signal encoder is also described.
摘要:
In order to generate a multi-channel signal having a number of output channels greater than a number of input channels, a mixer is used for upmixing the input signal to form at least a direct channel signal and at least an ambience channel signal. A speech detector is provided for detecting a section of the input signal, the direct channel signal or the ambience channel signal in which speech portions occur. Based on this detection, a signal modifier modifies the input signal or the ambience channel signal in order to attenuate speech portions in the ambience channel signal, whereas such speech portions in the direct channel signal are attenuated to a lesser extent or not at all. A loudspeaker signal outputter then maps the direct channel signals and the ambience channel signals to loudspeaker signals which are associated to a defined reproduction scheme, such as, for example, a 5.1 scheme.
摘要:
A selected channel of a multi-channel signal which is represented by frames composed from sampling values having a high time resolution can be encoded with higher quality when a wave form parameter representation representing a wave form of an intermediate resolution representation of the selected channel is derived, the wave form parameter representation including a sequence of intermediate wave form parameters having a time resolution lower than the high time resolution of the sampling values and higher than a time resolution defined by a frame repetition rate. The wave form parameter representation with the intermediate resolution can be used to shape a reconstructed channel to retrieve a channel having a signal envelope close to that one of the selected original channel. The time scale on which the shaping is performed is shorter than the time scale of a framewise processing, thus enhancing the quality of the reconstructed channel. On the other hand, the shaping time scale is larger than the time scale of the sampling values, significantly reducing the amount of data needed by the wave form parameter representation.
摘要:
An apparatus and a method for generating a multi-channel synthesizer control signal, a multi-channel synthesizer, a method of generating an output signal from an input signal and a machine-readable storage medium are provided. On an encoder-side, a multi-channel input signal is analyzed for obtaining smoothing control information, which is to be used by a decoder-side multi-channel synthesis for smoothing quantized transmitted parameters or values derived from the quantized transmitted parameters for providing an improved subjective audio quality in particular for slowly moving point sources and rapidly moving point sources having tonal material such as fast moving sinusoids.
摘要:
An apparatus for generating an enhanced downmix signal on the basis of a multi-channel microphone signal has a spatial analyzer configured to compute a set of spatial cue parameters having a direction information describing a direction-of-arrival of a direct sound, a direct sound power information and a diffuse sound power information on the basis of the multi-channel microphone signal. The apparatus also has a filter calculator for calculating enhancement filter parameters in dependence on the direction information describing the direction-of-arrival of the direct sound, in dependence on the direct sound power information and in dependence on the diffuse sound power information. The apparatus also has a filter for filtering the microphone signal, or a signal derived therefrom, using the enhancement filter parameters, to obtain the enhanced downmix signal.
摘要:
An apparatus for generating an ambient signal from an audio signal includes a compressor for lossy compression of a representation of the audio signal so as to obtain a compressed representation of the audio signal describing a compressed audio signal. The apparatus for generating the ambient signal further includes a calculator for calculating a difference between the compressed representation of the audio signal and the representation of the audio signal so as to obtain a discrimination representation. The apparatus further includes a provider for providing the ambient signal using the discrimination representation. An apparatus for deriving a multi-channel audio signal from an audio signal includes an apparatus for generating an ambient signal from an audio signal, an apparatus for providing the audio signal as a front-loudspeaker signal and an apparatus for providing the ambient signal as a back-loudspeaker signal.
摘要:
An apparatus for extracting a direct and/or ambience signal from a downmix signal and spatial parametric information, the downmix signal and the spatial parametric information representing a multi-channel audio signal having more channels than the downmix signal, wherein the spatial parametric information has inter-channel relations of the multi-channel audio signal, is described. The apparatus has a direct/ambience estimator and a direct/ambience extractor. The direct/ambience estimator is configured for estimating a level information of a direct portion and/or an ambient portion of the multi-channel audio signal based on the spatial parametric information. The direct/ambience extractor is configured for extracting a direct signal portion and/or an ambient signal portion from the downmix signal based on the estimated level information of the direct portion or the ambient portion.