摘要:
An audio signal decoder has a time warp contour calculator, a time warp contour data rescaler and a warp decoder. The time warp contour calculator is configured to generate time warp contour data repeatedly restarting from a predetermined time warp contour start value, based on time warp contour evolution information describing a temporal evolution of the time warp contour. The time warp contour data rescaler is configured to rescale at least a portion of the time warp contour data such that a discontinuity at a restart is avoided, reduced or eliminated in a rescaled version of the time warp contour. The warp decoder is configured to provide the decoded audio signal representation, based on an encoded audio signal representation and using the rescaled version of the time warp contour.
摘要:
An encoder for providing an audio stream on the basis of a transform-domain representation of an input audio signal includes a quantization error calculator configured to determine a multi-band quantization error over a plurality of frequency bands of the input audio signal for which separate band gain information is available. The encoder also includes an audio stream provider for providing the audio stream such that the audio stream includes information describing an audio content of the frequency bands and information describing the multi-band quantization error.A decoder for providing a decoded representation of an audio signal on the basis of an encoded audio stream representing spectral components of frequency bands of the audio signal includes a noise filler for introducing noise into spectral components of a plurality of frequency bands to which separate frequency band gain information is associated on the basis of a common multi-band noise intensity value.
摘要:
An apparatus for encoding an audio signal includes the windower for windowing a first block of the audio signal using an analysis window having an aliasing portion and a further portion. The apparatus furthermore includes a processor for processing the first sub-block of the audio signal associated with the aliasing portion by transforming the sub-block from a domain into a different domain subsequent to windowing the first sub-block to obtain the processed first sub-block, and for processing a second sub-block of the audio signal associated with the further portion by transforming the second sub-block from the domain into the different domain before windowing the second sub-block to obtain a processed second sub-block. Thus, a critically sampled switch between two coding modes can be obtained.
摘要:
For generating a signal to be transmitted original information is encoded into a main channel and a side channel, wherein the side channel is more robust against channel influences than the main channel. On the receiver side, when the receive quality is above a threshold, which is necessitated to execute a successful decoding of the main channel, the main channel is reproduced. If the receive quality falls below this threshold, however, the side channel is reproduced which may have less bits than the main channel and which is a correspondingly lower quality representation of the original information than the main channel.
摘要:
An audio encoder adapted for encoding frames of a sampled audio signal to obtain encoded frames, wherein a frame includes a number of time domain audio samples. The audio encoder includes a predictive coding analysis stage for determining information on coefficients of a synthesis filter and a prediction domain frame based on a frame of audio samples. The audio encoder further includes a time-aliasing introducing transformer for transforming overlapping prediction domain frames to the frequency domain to obtain prediction domain frame spectra, wherein the time-aliasing introducing transformer is adapted for transforming the overlapping prediction domain frames in a critically-sampled way. Moreover, the audio encoder includes a redundancy reducing encoder for encoding the prediction domain frame spectra to obtain the encoded frames based on the coefficients and the encoded prediction domain frame spectra.
摘要:
A noise filler for providing a noise-filled spectral representation of an audio signal on the basis of an input spectral representation of the audio signal has a spectral region identifier configured to identify spectral regions of the input spectral representation spaced from non-zero spectral regions of the input spectral representation by at least one intermediate spectral region, to obtain identified spectral regions, and a noise inserter configured to selectively introduce noise into the identified spectral regions to obtain the noise-filled spectral representation of the audio signal. A noise filling parameter calculator for providing a noise filling parameter on the basis of a quantized spectral representation of an audio signal has a spectral region identifier, as mentioned above, and a noise value calculator configured to selectively consider quantization errors of the identified spectral regions for a calculation of the noise filling parameter. Accordingly, an encoded audio signal representation representing the audio signal can be obtained.
摘要:
The central idea of the present invention is that the prior procedure, namely interpolation relative to the filter coefficients and the amplification value, for obtaining interpolated values for the intermediate audio values starting from the nodes has to be dismissed. Coding containing less audible artifacts can be obtained by not interpolating the amplification value, but rather taking the power limit derived from the masking threshold, for each node, i.e. for each parameterization to be transferred, and then performing the interpolation between these power limits of neighboring nodes, such as, for example, a linear interpolation. On both the coder and the decoder side, an amplification value can then be calculated from the intermediate power limit determined such that the quantizing noise caused by quantization, which has a constant frequency before post-filtering on the decoder side, is below the power limit or corresponds thereto after post-filtering.
摘要:
For generating a signal to be transmitted original information is encoded into a main channel and a side channel, wherein the side channel is more robust against channel influences than the main channel. On the receiver side, when the receive quality is above a threshold, which is necessitated to execute a successful decoding of the main channel, the main channel is reproduced. If the receive quality falls below this threshold, however, the side channel is reproduced which may have less bits than the main channel and which is a correspondingly lower quality representation of the original information than the main channel.
摘要:
An embodiment of an apparatus for generating audio subband values in audio subband channels has an analysis windower for windowing a frame of time-domain audio input samples being in a time sequence extending from an early sample to a later sample using an analysis window function having a sequence of window coefficients to obtain windowed samples. The analysis window function has a first group of window coefficients and a second group of window coefficients. The first group of window coefficients is used for windowing later time-domain samples and the second group of window coefficients is used for windowing an earlier time-domain samples. The apparatus further has a calculator for calculating the audio subband values using the windowed samples.
摘要:
For embedding binary payload in a carrier signal, which, for example, is an audio signal, a sequence of time-discrete values of the carrier signal is converted to the frequency domain by means of an integer transform algorithm to obtain binary spectral representation values. Bits of the binary spectral representation values with a valency less than signal limit valency are determined and set according to the payload. The signal limit valency for a spectral representation value is less than the valency of the leading bit of this spectral representation value, so that, with adequate distance, a psychoacoustic transparent insertion of information is achieved. Thus a modified spectral representation with inserted information is generated which is finally converted back to the time domain using an integer back transform algorithm. For extracting the payload, the time-discrete signal with the inserted information is again converted to a spectral representation with the integer forward transform algorithm. Furthermore, signal limit valency information is determined to identify the bits of the binary spectral representation values containing no information regarding the carrier signal, but information regarding the payload signal, to extract these bits. The inventive concept is simple in its implementation and may be scaled with respect to the data rate of the information to be inserted.