摘要:
An audio encoder, an audio decoder or an audio processor includes a filter for generating a filtered audio signal, the filter having a variable warping characteristic, the characteristic being controllable in response to a time-varying control signal, the control signal indicating a small or no warping characteristic or a comparatively high warping characteristic. Furthermore, a controller is connected for providing the time-varying control signal, which depends on the audio signal. The filtered audio signal can be introduced to an encoding processor having different encoding algorithms, one of which is a coding algorithm adapted to a specific signal pattern. Alternatively, the filter is a post-filter receiving a decoded audio signal.
摘要:
An audio encoder has a common preprocessing stage, an information sink based encoding branch such as spectral domain encoding branch, a information source based encoding branch such as an LPC-domain encoding branch and a switch for switching between these branches at inputs into these branches or outputs of these branches controlled by a decision stage. An audio decoder has a spectral domain decoding branch, an LPC-domain decoding branch, one or more switches for switching between the branches and a common post-processing stage for post-processing a time-domain audio signal for obtaining a post-processed audio signal.
摘要:
An apparatus for encoding an audio signal includes the windower for windowing a first block of the audio signal using an analysis window having an aliasing portion and a further portion. The apparatus furthermore includes a processor for processing the first sub-block of the audio signal associated with the aliasing portion by transforming the sub-block from a domain into a different domain subsequent to windowing the first sub-block to obtain the processed first sub-block, and for processing a second sub-block of the audio signal associated with the further portion by transforming the second sub-block from the domain into the different domain before windowing the second sub-block to obtain a processed second sub-block. The apparatus furthermore includes a transformer for converting the processed first sub-block and the processed second sub-block from the different domain into a further different domain using the same block transform rule to obtain a converted first block which may then be compressed using any of the well-known data compression algorithms. Thus, a critically sampled switch between two coding modes can be obtained, since aliasing portions occurring in two different domains are matched to each other.
摘要:
An audio encoder, an audio decoder or an audio processor includes a filter for generating a filtered audio signal, the filter having a variable warping characteristic, the characteristic being controllable in response to a time-varying control signal, the control signal indicating a small or no warping characteristic or a comparatively high warping characteristic. Furthermore, a controller is connected for providing the time-varying control signal, which depends on the audio signal. The filtered audio signal can be introduced to an encoding processor having different encoding algorithms, one of which is a coding algorithm adapted to a specific signal pattern. Alternatively, the filter is a post-filter receiving a decoded audio signal.
摘要:
An encoder for providing an audio stream on the basis of a transform-domain representation of an input audio signal includes a quantization error calculator configured to determine a multi-band quantization error over a plurality of frequency bands of the input audio signal for which separate band gain information is available. The encoder also includes an audio stream provider for providing the audio stream such that the audio stream includes information describing an audio content of the frequency bands and information describing the multi-band quantization error.A decoder for providing a decoded representation of an audio signal on the basis of an encoded audio stream representing spectral components of frequency bands of the audio signal includes a noise filler for introducing noise into spectral components of a plurality of frequency bands to which separate frequency band gain information is associated on the basis of a common multi-band noise intensity value.
摘要:
An apparatus for encoding an audio signal includes the windower for windowing a first block of the audio signal using an analysis window having an aliasing portion and a further portion. The apparatus furthermore includes a processor for processing the first sub-block of the audio signal associated with the aliasing portion by transforming the sub-block from a domain into a different domain subsequent to windowing the first sub-block to obtain the processed first sub-block, and for processing a second sub-block of the audio signal associated with the further portion by transforming the second sub-block from the domain into the different domain before windowing the second sub-block to obtain a processed second sub-block. Thus, a critically sampled switch between two coding modes can be obtained.
摘要:
A noise filler for providing a noise-filled spectral representation of an audio signal on the basis of an input spectral representation of the audio signal has a spectral region identifier configured to identify spectral regions of the input spectral representation spaced from non-zero spectral regions of the input spectral representation by at least one intermediate spectral region, to obtain identified spectral regions, and a noise inserter configured to selectively introduce noise into the identified spectral regions to obtain the noise-filled spectral representation of the audio signal. A noise filling parameter calculator for providing a noise filling parameter on the basis of a quantized spectral representation of an audio signal has a spectral region identifier, as mentioned above, and a noise value calculator configured to selectively consider quantization errors of the identified spectral regions for a calculation of the noise filling parameter. Accordingly, an encoded audio signal representation representing the audio signal can be obtained.
摘要:
If an adaptive prediction algorithm controllable by a speed coefficient is started from to operate with a first adaption speed and a first adaption precision and an accompanying first prediction precision in the case that the speed coefficient has a first value and to operate with a second, compared to the first one, lower adaption speed and a second, but compared to the first one, higher precision in the case that the speed parameter has a second value, the adaption durations occurring after the reset times where the prediction errors are at first increased due to the, not yet, adapted prediction coefficients may be decreased by at first setting the speed parameter to the first value and, after a while, to a second value. After the speed parameter has again been set to the second value after a predetermined duration after the reset times, the prediction errors and thus the residuals to be transmitted are more optimized or smaller than would be possible with the first speed parameter value.
摘要:
A very coarse quantization exceeding the measure determined by the masking threshold without or only very little quality losses is enabled by quantizing not immediately the prefiltered signal, but a prediction error obtained by forward-adaptive prediction of the prefiltered signal. Due to the forward adaptivity, the quantizing error has no negative effect on the prediction on the decoder side.
摘要:
An audio encoder has a common preprocessing stage, an information sink based encoding branch such as spectral domain encoding branch, a information source based encoding branch such as an LPC-domain encoding branch and a switch for switching between these branches at inputs into these branches or outputs of these branches controlled by a decision stage. An audio decoder has a spectral domain decoding branch, an LPC-domain decoding branch, one or more switches for switching between the branches and a common post-processing stage for post-processing a time-domain audio signal for obtaining a post-processed audio signal.