摘要:
A time scaler for providing a time scaled version of an input audio signal is configured to compute or estimate a quality of a time scaled version of the input audio signal obtainable by a time scaling of the input audio signal. The time scaler is configured to perform the time scaling of the input audio signal in dependence on the computation or estimation of the quality of the time scaled version of the input audio signal obtainable by the time scaling. An audio decoder has such a time scaler.
摘要:
A decoder for decoding an audio or image signal comprising a sequence of blocks of converted windowed samples and associated window information (160) identifying a specific window function for a block out of at least three different window functions, comprises: a processor (156) for providing a sequence of blocks of spectral values; a controllable converter (158) for converting the sequence of blocks of spectral values into a time domain representation using an overlap-add processing, wherein the controllable converter (158) is controlled by the window information to apply window functions indicated by the window information to the corresponding block to calculate a decoded audio or image signal, wherein the window is selected from a group of at least three windows comprising a first window (201) having a first overlap length (203), a second window (215) having a second overlap length (218), and a third window (224) having a third overlap length (229) or having no overlap, wherein the first overlap length (203) is greater than the second overlap length (218), and wherein the second overlap length (218) is greater than the third overlap length (229) or greater than an overlap of zero.
摘要:
An apparatus for encoding audio information is provided. The apparatus for encoding audio information comprises a selector (110) for selecting a comfort noise generation mode from two or more comfort noise generation modes depending on a background noise characteristic of an audio input signal, and an encoding unit (120) for encoding the audio information, wherein the audio information comprises mode information indicating the selected comfort noise generation mode.
摘要:
An apparatus for generating an encoded signal, comprises: a window sequence controller (808) for generating a window sequence information (809) for windowing an audio or image signal, the window sequence information indicating a first window (1500) for generating a first frame of spectral values, a second window function (1502) and at least one third window function (1503) for generating a second frame of spectral values, wherein the first window function (1500), the second window function (1502) and the one or more third window functions overlap within a multi-overlap region (1300); a preprocessor (802) for windowing (902) a second block of samples corresponding to the second window function and the at least one third window functions using an auxiliary window function (1 100) to obtain a second block of windowed samples, and for preprocessing (904) the second block of windowed samples using a folding-in operation of a portion of the second block overlapping with a first block into the multi-overlap portion (1300) to obtain a preprocessed second block of windowed samples having a modified multi-overlap portion; a spectrum converter (804) for applying an aliasing-introducing transform (906) to the first block of samples using the first window function to obtain the first frame of spectral values, for applying the aliasing introducing transform to a first portion of the preprocessed second block of windowed samples using the second window function to obtain a first portion of spectral samples of a second frame and for applying the aliasing introducing transform to a second portion of the preprocessed second block of windowed samples using the one or more third window functions (1503) to obtain a second portion of spectral samples of the second frame; and a processor (806) for processing the first frame and the second frame to obtain encoded frames of the audio or image signal.
摘要:
A jitter buffer control for controlling a provision of a decoded audio content on the basis of an input audio content is configured to select a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner. An audio decoder uses such a jitter buffer control.
摘要:
An apparatus for generating an encoded signal includes: a window sequence controller for generating a window sequence information for windowing an audio or image signal, the window sequence information indicating a first window for generating a first frame of spectral values, a second window function and at least one third window function for generating a second frame of spectral values, wherein the first window function, the second window function and the one or more third window functions overlap within a multi-overlap region; a preprocessor for windowing a second block of samples corresponding to the second window function and the at least one third window functions using an auxiliary window function to acquire a second block of windowed samples, a spectrum converter for applying an aliasing-introducing transform; and a processor for processing the first frame and the second frame to acquire encoded frames of the audio or image signal.
摘要:
A decoder for decoding an audio or image signal comprising a sequence of blocks of converted windowed samples and associated window information (160) identifying a specific window function for a block out of at least three different window functions, comprises: a processor (156) for providing a sequence of blocks of spectral values; a controllable converter (158) for converting the sequence of blocks of spectral values into a time domain representation using an overlap-add processing, wherein the controllable converter (158) is controlled by the window information to apply window functions indicated by the window information to the corresponding block to calculate a decoded audio or image signal, wherein the window is selected from a group of at least three windows comprising a first window (201) having a first overlap length (203), a second window (215) having a second overlap length (218), and a third window (224) having a third overlap length (229) or having no overlap, wherein the first overlap length (203) is greater than the second overlap length (218), and wherein the second overlap length (218) is greater than the third overlap length (229) or greater than an overlap of zero.