摘要:
An audio signal decoder includes a transform domain path configured to obtain a time-domain representation of a portion of an audio content on the basis of a first set of spectral coefficients, a representation of an aliasing-cancellation stimulus signal and a plurality of linear-prediction-domain parameters. The transform domain path applies a spectrum shaping to the first set of spectral coefficients to obtain a spectrally-shaped version thereof. The transform domain path obtains a time-domain representation of the audio content on the basis of the spectrally-shaped version of the first set of spectral coefficients. The transform domain path includes an aliasing-cancellation stimulus filter to filter the aliasing-cancellation stimulus signal in dependence on at least a subset of the linear-prediction-domain parameters. The transform domain path also includes a combiner configured to combine the time-domain representation of the audio content with an aliasing-cancellation synthesis signal to obtain an aliasing reduced time-domain signal.
摘要:
A better compromise between a too high bitstream and decoding overhead on the one hand and flexibility of frame element positioning on the other hand is achieved by arranging that each of the sequence of frames of the bitstream comprises a sequence of N frame elements and, on the other hand, the bitstream comprises a configuration block comprising a field indicating the number of elements N and a type indication syntax portion indicating, for each element position of the sequence of N element positions, an element type out of a plurality of element types with, in the sequences of N frame elements of the frames, each frame element being of the element type indicated, by the type indication portion, for the respective element position at which the respective frame element is positioned within the sequence of N frame elements of the respective frame in the bitstream. Thus, the frames are equally structured in that each frame comprises the same sequence of N frame elements of the frame element type indicated by the type indication syntax portion, positioned within the bitstream in the same sequential order. This sequential order is commonly adjustable for the sequence of frames by use of the type indication syntax portion which indicates, for each element position of the sequence of N element positions, an element type out of a plurality of element types.
摘要:
Frame elements which shall be made available for skipping may are transmitted more efficiently by arranging that a default payload length information is transmitted separately within a configuration block, with the length information within the frame elements, in turn, being subdivided into a default payload length flag followed, if the default payload length flag is not set, by a payload length value explicitly coding the payload length of the respective frame element. However, if the default payload length flag is set, an explicit transmission of the payload length may be avoided. Rather, any frame element, the default extension payload length flag of which is set, has the default payload length and any frame element, the default extension payload length flag of which is not set, has a payload length corresponding to the payload length value. By this measure, transmission effectiveness is increased.
摘要:
An audio signal decoder (200) for providing a decoded representation (212) of an audio content on the basis of an encoded representation (310) of the audio content comprises a transform domain path (230, 240, 242, 250, 260) configured to obtain a time-domain representation (212) of a portion of the audio content encoded in a transform-domain mode on the basis of a first set (220) of spectral coefficients, a representation (224) of an aliasing-cancellation stimulus signal and a plurality of linear-prediction-domain parameters (222). The transform domain path comprises a spectrum processor (230) configured to apply a spectrum shaping to the first set of spectral coefficients in dependence on at least a subset of the linear-prediction-domain parameters, to obtain a spectrally-shaped version (232) of the first set of spectral coefficients. The transform domain path comprises a first frequency-domain-to-time-domain converter (240) configured to obtain a time-domain representation of the audio content on the basis of the spectrally-shaped version of the first set of spectral coefficients. The transform domain path comprises an aliasing-cancellation stimulus filter configured to filter (250) the aliasing-cancellation stimulus signal (324) in dependence on at least a subset of the linear-prediction-domain parameters (222), to derive an aliasing-cancellation synthesis signal (252) from the aliasing-cancellation stimulus signal. The transform domain path also comprises a combiner (260) configured to combine the time-domain representation (242) of the audio content with the aliasing-cancellation synthesis signal (252), or a post-processed version thereof, to obtain an aliasing reduced time-domain signal.
摘要:
An audio signal decoder (200) for providing a decoded representation (212) of an audio content on the basis of an encoded representation (310) of the audio content comprises a transform domain path (230, 240, 242, 250, 260) configured to obtain a time-domain representation (212) of a portion of the audio content encoded in a transform-domain mode on the basis of a first set (220) of spectral coefficients, a representation (224) of an aliasing-cancellation stimulus signal and a plurality of linear-prediction-domain parameters (222). The transform domain path comprises a spectrum processor (230) configured to apply a spectrum shaping to the first set of spectral coefficients in dependence on at least a subset of the linear-prediction-domain parameters, to obtain a spectrally-shaped version (232) of the first set of spectral coefficients. The transform domain path comprises a first frequency-domain-to-time-domain converter (240) configured to obtain a time-domain representation of the audio content on the basis of the spectrally-shaped version of the first set of spectral coefficients. The transform domain path comprises an aliasing-cancellation stimulus filter configured to filter (250) the aliasing-cancellation stimulus signal (324) in dependence on at least a subset of the linear-prediction-domain parameters (222), to derive an aliasing-cancellation synthesis signal (252) from the aliasing-cancellation stimulus signal. The transform domain path also comprises a combiner (260) configured to combine the time-domain representation (242) of the audio content with the aliasing-cancellation synthesis signal (252), or a post-processed version thereof, to obtain an aliasing reduced time-domain signal.
摘要:
An audio decoder for decoding an encoded audio signal (10), the encoded audio signal (10) comprising a first channel element (52a) and a second channel element (52b) in a payload section (52) of a data stream and first decoder configuration data (50c) for the first channel element (52a) and second decoder configuration data (50d) for the second channel element (52b) in a configuration section (50) of the data stream, comprises: a data stream reader (12) for reading the configuration data for each channel element in the configuration section and for reading the payload data for each channel element in the payload section; a configurable decoder (16) for decoding the plurality of channel elements; and a configuration controller (14) for configuring the configurable decoder (16) so that the configurable decoder (16) is configured in accordance with the first decoder configuration data when decoding the first channel element and in accordance with the second decoder configuration data when decoding the second channel element.
摘要:
Audio splicing is rendered more effective by the use of one or more truncation unit packets inserted into the audio data stream so as to indicate to an audio decoder, for a predetermined access unit, an end portion of an audio frame with which the predetermined access unit is associated, as to be discarded in playout.
摘要:
An encoder for encoding an audio signal. The encoder is configured to encode the audio signal in a transform domain or filter-bank domain, wherein the encoder is configured to determine spectral coefficients of the audio signal for a current frame and at least one previous frame, wherein the encoder is configured to selectively apply predictive encoding to a plurality of individual spectral coefficients or groups of spectral coefficients which are separated by at least one spectral coefficient.