摘要:
An apparatus for decoding an encoded audio signal, comprises a spectral domain audio decoder (112) for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder (114) for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator (116) for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter (118) for converting the first decoded representation and the reconstructed second spectral portion into a time representation.
摘要:
An apparatus for generating a decoded two-channel signal, comprises: an audio processor (802) for decoding an encoded two-channel signal to obtain a first set of first spectral portions; a parametric decoder (804) for providing parametric data for a second set of second spectral portions and a two-channel identification identifying either a first or a second different two-channel representation for the second spectral portions; and a frequency regenerator (806) for regenerating a second spectral portion depending on a first spectral portion of the first set of first spectral portions, the parametric data for the second portion and the two-channel identification for the second portion.
摘要:
An apparatus for generating a decoded two-channel signal, comprises: an audio processor (802) for decoding an encoded two-channel signal to obtain a first set of first spectral portions; a parametric decoder (804) for providing parametric data for a second set of second spectral portions and a two-channel identification identifying either a first or a second different two-channel representation for the second spectral portions; and a frequency regenerator (806) for regenerating a second spectral portion depending on a first spectral portion of the first set of first spectral portions, the parametric data for the second portion and the two-channel identification for the second portion.
摘要:
An audio encoder (100) for encoding audio samples, comprising a first time domain aliasing introducing encoder (110) for decoding audio samples in a first encoding domain, the first time domain aliasing introducing encoder (110) having a first framing rule, a start window and a stop window. The audio encoder (100) further comprises a second encoder (120) for encoding samples in a second encoding domain, the second encoder (120) having a predetermined frame size number of audio samples, and a coding warm-up period number of audio samples, the second encoder (120) having a different second framing rule, a frame of the second encoder (120) being an encoded representation of a number of timely subsequent audio samples, the number being equal to the predetermined frame size number of audio samples. The audio encoder (100) further comprises a controller (130) switching from the first encoder (110) to the second encoder (120) in response to characteristic of the audio samples, and for modifying the second framing rule in response to switching from the first encoder (110) to the second encoder (120) or for modifying the start window or the stop window of the first encoder (110), wherein the second framing rule remains unmodified.
摘要:
An apparatus for encoding comprises a first domain converter (510), a switchable bypass (50), a second domain converter (410), a first processor (420) and a second processor (520) to obtain an encoded audio signal having different signal portions represented by coded data in different domains, which have been coded by different coding algorithms. Corresponding decoding stages in the decoder together with a bypass for bypassing a domain converter allow the generation of a decoded audio signal with high quality and low bit rate.
摘要:
A processed representation of an audio signal having a sequence of frames is generated by sampling the audio signal within a first and a second frame of the sequence of frames, the second frame following the first frame, the sampling using information on a pitch contour of the first and the second frame to derive a first sampled representation. The audio signal is sampled within the second and the third frame, the third frame following the second frame in the sequence of frames. The sampling uses the information on the pitch contour of the second frame and information on a pitch contour of the third frame to derive a second sampled representation. A first scaling window is derived for the first sampled representation and a second scaling window is derived for the second sampled representation, the scaling windows depending on the samplings applied to derive the first sampled representations or the second sampled representation.
摘要:
An apparatus for encoding a multi-channel signal comprising at least two channels, comprises: a time-spectral converter (1000) for converting sequences of blocks of sample values of the at least two channels into a frequency domain representation having sequences of blocks of spectral values for the at least two channels, wherein a block of sampling values has an associated input sampling rate, and a block of spectral values of the sequences of blocks of spectral values has spectral values up to a maximum input frequency (1211) being related to the input sampling rate; a multi-channel processor (1010) for applying a joint multi-channel processing to the sequences of blocks of spectral values or to resampled sequences of blocks of spectral values to obtain at least one result sequence of blocks of spectral values comprising information related to the at least two channels; a spectral domain resampler (1020) for resampling the blocks of the result sequences in the frequency domain or for resampling the sequences of blocks of spectral values for the at least two channels in the frequency domain to obtain a resampled sequence of blocks of spectral values, wherein a block of the resampled sequence of blocks of spectral values has spectral values up to a maximum output frequency (1231, 1221) being different from the maximum input frequency (1211); a spectral-time converter for converting the resampled sequence of blocks of spectral values into a time domain representation or for converting the result sequence of blocks of spectral values into a time domain representation comprising an output sequence of blocks of sampling values having associated an output sampling rate being different from the input sampling rate; and a core encoder (1040) for encoding the output sequence of blocks of sampling values to obtain an encoded multi-channel signal (1510).
摘要:
An audio encoder comprises a window function controller (504), a windower (502), a time warper (506) with a final quality check functionality, a time/frequency converter (508), a TNS stage (510) or a quantizer encoder (512), the window function controller (504), the time warper (506), the TNS stage (510) or an additional noise filling analyzer (524) are controlled by signal analysis results obtained by a time warp analyzer (516) or a signal classifier (520). Furthermore, a decoder applies a noise filling operation using a manipulated noise filling estimate depending on a harmonic or speech characteristic of the audio signal.
摘要:
An apparatus for generating a synthesis audio signal using a patching control signal comprises a first converter, a spectral domain patch generator, a high frequency reconstruction manipulator and a combiner. The first converter is configured for converting a time portion of an audio signal into a spectral representation. The spectral domain patch generator is configured for performing a plurality of different spectral domain patching algorithms, wherein each patching algorithm generates a modified spectral representation comprising spectral components in an upper frequency band derived from corresponding spectral components in a core frequency band of the audio signal. The spectral domain patch generator is furthermore configured to select a first spectral domain patching algorithm from the plurality of patching algorithms for a first time portion and a second spectral domain patching algorithm from the plurality of patching algorithm for a second different time portion in accordance with the patching control signal to obtain the modified spectral representation. The high frequency reconstruction manipulator is configured for manipulating the modified spectral representation or a signal derived from the modified spectral representation in accordance with a spectral band replication parameter to obtain a bandwidth extended signal. Finally, the combiner is configured for combining the audio signal having spectral components in the core frequency band or a signal derived from the audio signal with the bandwidth extended signal to obtain the synthesis audio signal.