摘要:
A codec supporting switching between time-domain aliasing cancellation transform coding mode and time-domain coding mode is made less liable to frame loss by adding a further syntax portion to the frames, depending on which the parser of the decoder may select between a first action of expecting the current frame to comprise, and thus reading forward aliasing cancellation data from the current frame and a second action of not-expecting the current frame to comprise, and thus not reading forward aliasing cancellation data from the current frame. In other words, while a bit of coding efficiency is lost due to the provision of the new syntax portion, it is merely the new syntax portion which provides for the ability to use the codec in case of a communication channel with frame loss. Without the new syntax portion, the decoder would not be capable of decoding any data stream portion after a loss and will crash in trying to resume parsing. Thus, in an error prone environment, the coding efficiency is prevented from vanishing by the introduction of the new syntax portion.
摘要:
A codec supporting switching between time-domain aliasing cancellation transform coding mode and time-domain coding mode is made less liable to frame loss by adding a further syntax portion to the frames, depending on which the parser of the decoder may select between a first action of expecting the current frame to comprise, and thus reading forward aliasing cancellation data from the current frame and a second action of not-expecting the current frame to comprise, and thus not reading forward aliasing cancellation data from the current frame. In other words, while a bit of coding efficiency is lost due to the provision of the new syntax portion, it is merely the new syntax portion which provides for the ability to use the codec in case of a communication channel with frame loss. Without the new syntax portion, the decoder would not be capable of decoding any data stream portion after a loss and will crash in trying to resume parsing. Thus, in an error prone environment, the coding efficiency is prevented from vanishing by the introduction of the new syntax portion.
摘要:
A codec supporting switching between time-domain aliasing cancellation transform coding mode and time-domain coding mode is made less liable to frame loss by adding a further syntax portion to the frames, depending on which the parser of the decoder may select between a first action of expecting the current frame to comprise, and thus reading forward aliasing cancellation data from the current frame and a second action of not-expecting the current frame to comprise, and thus not reading forward aliasing cancellation data from the current frame. In other words, while a bit of coding efficiency is lost due to the provision of the new syntax portion, it is merely the new syntax portion which provides for the ability to use the codec in case of a communication channel with frame loss. Without the new syntax portion, the decoder would not be capable of decoding any data stream portion after a loss and will crash in trying to resume parsing. Thus, in an error prone environment, the coding efficiency is prevented from vanishing by the introduction of the new syntax portion.
摘要:
An audio signal decoder (200) for providing a decoded representation (212) of an audio content on the basis of an encoded representation (310) of the audio content comprises a transform domain path (230, 240, 242, 250, 260) configured to obtain a time-domain representation (212) of a portion of the audio content encoded in a transform-domain mode on the basis of a first set (220) of spectral coefficients, a representation (224) of an aliasing-cancellation stimulus signal and a plurality of linear-prediction-domain parameters (222). The transform domain path comprises a spectrum processor (230) configured to apply a spectrum shaping to the first set of spectral coefficients in dependence on at least a subset of the linear-prediction-domain parameters, to obtain a spectrally-shaped version (232) of the first set of spectral coefficients. The transform domain path comprises a first frequency-domain-to-time-domain converter (240) configured to obtain a time-domain representation of the audio content on the basis of the spectrally-shaped version of the first set of spectral coefficients. The transform domain path comprises an aliasing-cancellation stimulus filter configured to filter (250) the aliasing-cancellation stimulus signal (324) in dependence on at least a subset of the linear-prediction-domain parameters (222), to derive an aliasing-cancellation synthesis signal (252) from the aliasing-cancellation stimulus signal. The transform domain path also comprises a combiner (260) configured to combine the time-domain representation (242) of the audio content with the aliasing-cancellation synthesis signal (252), or a post-processed version thereof, to obtain an aliasing reduced time-domain signal.
摘要:
Embodiments of the invention relate to an error concealment unit for providing an error concealment audio information for concealing a loss of an audio frame in an encoded audio information. The error concealment unit provides a first error concealment audio information component for a first frequency range using a frequency domain concealment. The error concealment unit also provides a second error concealment audio information component for a second frequency range, which includes lower frequencies than the first frequency range, using a time domain concealment. The error concealment unit also combines the first error concealment audio information component and the second error concealment audio information component, to obtain the error concealment audio information. Other embodiments of the invention relate to a decoder including the error concealment unit, as well as related encoders, methods, and computer programs for decoding and/or concealing.
摘要:
An apparatus for reconstructing a frame including a speech signal as a reconstructed frame is provided, the apparatus including a determination unit and a frame reconstructor being configured to reconstruct the reconstructed frame, such that the reconstructed frame completely or partially includes the first reconstructed pitch cycle, such that the reconstructed frame completely or partially includes a second reconstructed pitch cycle, and such that the number of samples of the first reconstructed pitch cycle differs from a number of samples of the second reconstructed pitch cycle.
摘要:
An audio decoder for providing a decoded audio information on the basis of an encoded audio information. The audio decoder has an error concealment configured to provide an error concealment audio information for concealing a loss of an audio frame, wherein the error concealment is configured to modify a time domain excitation signal obtained for one or more audio frames preceding a lost audio frame, in order to obtain the error concealment audio information.
摘要:
An audio decoder (100; 300) for providing a decoded audio information (112; 312) on the basis of an encoded audio information (110; 310) comprises an error concealment (130; 380; 500) configured to provide an error concealment audio information (132; 382; 512) for concealing a loss of an audio frame following an audio frame encoded in a frequency domain representation (322) using a time domain excitation signal (532).
摘要:
An apparatus for generating an encoded signal, comprises: a window sequence controller (808) for generating a window sequence information (809) for windowing an audio or image signal, the window sequence information indicating a first window (1500) for generating a first frame of spectral values, a second window function (1502) and at least one third window function (1503) for generating a second frame of spectral values, wherein the first window function (1500), the second window function (1502) and the one or more third window functions overlap within a multi-overlap region (1300); a preprocessor (802) for windowing (902) a second block of samples corresponding to the second window function and the at least one third window functions using an auxiliary window function (1 100) to obtain a second block of windowed samples, and for preprocessing (904) the second block of windowed samples using a folding-in operation of a portion of the second block overlapping with a first block into the multi-overlap portion (1300) to obtain a preprocessed second block of windowed samples having a modified multi-overlap portion; a spectrum converter (804) for applying an aliasing-introducing transform (906) to the first block of samples using the first window function to obtain the first frame of spectral values, for applying the aliasing introducing transform to a first portion of the preprocessed second block of windowed samples using the second window function to obtain a first portion of spectral samples of a second frame and for applying the aliasing introducing transform to a second portion of the preprocessed second block of windowed samples using the one or more third window functions (1503) to obtain a second portion of spectral samples of the second frame; and a processor (806) for processing the first frame and the second frame to obtain encoded frames of the audio or image signal.