摘要:
A parametric representation of a multi-channel audio signal describes the spatial properties of the audio signal well with compact side information when a coherence information, describing the coherence between a first and a second channel, is derived within a hierarchical encoding process only for channel pairs including a first channel having only information of a left side with respect to a listening position and including a second channel having only information from a right side with respect to a listening position. As within the hierarchical process the multiple audio channels of the audio signal are downmixed iteratively into monophonic channels, one can pick the relevant parameters from an encoding step involving only channel pairs carrying the information needed to describe the spatial properties of the multi-channel audio signal.
摘要:
The temporal envelope of a decorrelated signal derived from an original signal can be shaped without introducing additional distortion, when a spectral flattener is used to spectrally flatten the spectrum of the decorrelated signal and the original signal prior to using the flattened spectra for deriving a gain factor describing the energy distribution between the flattened spectra, and when the so derived gain factor is used by an envelope shaper to timely shape the envelope of the decorrelated signal .
摘要:
An audio signal decoder includes a transform domain path configured to obtain a time-domain representation of a portion of an audio content on the basis of a first set of spectral coefficients, a representation of an aliasing-cancellation stimulus signal and a plurality of linear-prediction-domain parameters. The transform domain path applies a spectrum shaping to the first set of spectral coefficients to obtain a spectrally-shaped version thereof. The transform domain path obtains a time-domain representation of the audio content on the basis of the spectrally-shaped version of the first set of spectral coefficients. The transform domain path includes an aliasing-cancellation stimulus filter to filter the aliasing-cancellation stimulus signal in dependence on at least a subset of the linear-prediction-domain parameters. The transform domain path also includes a combiner configured to combine the time-domain representation of the audio content with an aliasing-cancellation synthesis signal to obtain an aliasing reduced time-domain signal.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeake configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters comprise an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
摘要:
Binaural rendering a multi-channel audio signal into a binaural output signal (24) is described. The multi-channel audio signal comprises a stereo downmix signal (18) into which a plurality of audio signals are downmixed, and side information comprising a downmix information (DMG, DCLD) indicating, for each audio signal, to what extent the respective audio signal has been mixed into a first channel and a second channel of the stereo downmix signal (18), respectively, as well as object level information of the plurality of audio signals and inter-object cross correlation information describing similarities between pairs of audio signals of the plurality of audio signals. Based on a first rendering prescription, a preliminary binaural signal (54) is computed from the first and second channels of the stereo downmix signal (18). A decorrelated signal (X n,k d) is generated as an perceptual equivalent to a mono downmix (58) of the first and second channels of the stereo downmix signal (18) being, however, decorrelated to the mono downmix (58). Depending on a second rendering prescription (P2 1,m ), a corrective binaural signal (64) is computed from the decorrelated signal (62) and the preliminary binaural signal (54) is mixed with the corrective binaural signal (64) to obtain the binaural output signal (24).
摘要:
An apparatus for generating a bandwidth extended audio signal from an input signal, includes a patch generator for generating one or more patch signals from the input signal, wherein the patch generator is configured for performing a time stretching of subband signals from an analysis filterbank, and wherein the patch generator further includes a phase adjuster for adjusting phases of the subband signals using a filterbank-channel dependent phase correction.
摘要:
An audio signal decoder configured to provide a decoded audio signal representation on the basis of an encoded audio signal representation including a sampling frequency information, an encoded time warp information and an encoded spectrum representation includes a time warp calculator and a warp decoder. The time warp calculator is configured to adapt a mapping rule for mapping codewords of the encoded time warp information onto decoded time warp values describing the decoded time warp information in dependence on the sampling frequency information. The warp decoder is configured to provide the decoded audio signal representation on the basis of the encoded spectrum representation and in dependence on the decoded time warp information.
摘要:
An audio signal decoder configured to provide a decoded audio signal representation on the basis of an encoded audio signal representation including a sampling frequency information, an encoded time warp information and an encoded spectrum representation includes a time warp calculator and a warp decoder. The time warp calculator is configured to adapt a mapping rule for mapping codewords of the encoded time warp information onto decoded time warp values describing the decoded time warp information in dependence on the sampling frequency information. The warp decoder is configured to provide the decoded audio signal representation on the basis of the encoded spectrum representation and in dependence on the decoded time warp information.
摘要:
An audio signal decoder configured to provide a decoded audio signal representation on the basis of an encoded audio signal representation including a sampling frequency information, an encoded time warp information and an encoded spectrum representation includes a time warp calculator and a warp decoder. The time warp calculator is configured to adapt a mapping rule for mapping codewords of the encoded time warp information onto decoded time warp values describing the decoded time warp information in dependence on the sampling frequency information. The warp decoder is configured to provide the decoded audio signal representation on the basis of the encoded spectrum representation and in dependence on the decoded time warp information.