摘要:
A better compromise between a too high bitstream and decoding overhead on the one hand and flexibility of frame element positioning on the other hand is achieved by arranging that each of the sequence of frames of the bitstream comprises a sequence of N frame elements and, on the other hand, the bitstream comprises a configuration block comprising a field indicating the number of elements N and a type indication syntax portion indicating, for each element position of the sequence of N element positions, an element type out of a plurality of element types with, in the sequences of N frame elements of the frames, each frame element being of the element type indicated, by the type indication portion, for the respective element position at which the respective frame element is positioned within the sequence of N frame elements of the respective frame in the bitstream. Thus, the frames are equally structured in that each frame comprises the same sequence of N frame elements of the frame element type indicated by the type indication syntax portion, positioned within the bitstream in the same sequential order. This sequential order is commonly adjustable for the sequence of frames by use of the type indication syntax portion which indicates, for each element position of the sequence of N element positions, an element type out of a plurality of element types.
摘要:
Frame elements which shall be made available for skipping may are transmitted more efficiently by arranging that a default payload length information is transmitted separately within a configuration block, with the length information within the frame elements, in turn, being subdivided into a default payload length flag followed, if the default payload length flag is not set, by a payload length value explicitly coding the payload length of the respective frame element. However, if the default payload length flag is set, an explicit transmission of the payload length may be avoided. Rather, any frame element, the default extension payload length flag of which is set, has the default payload length and any frame element, the default extension payload length flag of which is not set, has a payload length corresponding to the payload length value. By this measure, transmission effectiveness is increased.
摘要:
An audio decoder for decoding an encoded audio signal (10), the encoded audio signal (10) comprising a first channel element (52a) and a second channel element (52b) in a payload section (52) of a data stream and first decoder configuration data (50c) for the first channel element (52a) and second decoder configuration data (50d) for the second channel element (52b) in a configuration section (50) of the data stream, comprises: a data stream reader (12) for reading the configuration data for each channel element in the configuration section and for reading the payload data for each channel element in the payload section; a configurable decoder (16) for decoding the plurality of channel elements; and a configuration controller (14) for configuring the configurable decoder (16) so that the configurable decoder (16) is configured in accordance with the first decoder configuration data when decoding the first channel element and in accordance with the second decoder configuration data when decoding the second channel element.
摘要:
A method for loss concealment of spatial audio parameters, the spatial audio parameters comprise at least a direction of arrival information; the method comprising the following steps: - receiving a first set of spatial audio parameters comprising at least a first direction of arrival information; - receiving a second set of spatial audio parameters, comprising at least a second direction of arrival information; and - replacing the second direction of arrival information of a second set by a replacement direction of arrival information derived from the first direction of arrival information, if at least the second direction of arrival information or a portion of the second direction of arrival information is lost or damaged.
摘要:
An audio encoder (100) for encoding segments of coefficients, the segments of coefficients representing different time or frequency resolutions of a sampled audio signal, the audio encoder (100) comprising a processor (110) for deriving a coding context for a currently encoded coefficient of a current segment based on a previously encoded coefficient of a previous segment, the previously encoded coefficient representing a different time or frequency resolution than the currently encoded coefficient. The audio encoder (100) further comprises an entropy encoder (120) for entropy encoding the current coefficient based on the coding context to obtain an encoded audio stream.
摘要:
An encoder for providing an audio stream on the basis of a transform-domain representation of an input audio signal comprises a quantization error calculator configured to determine a multi-band quantization error over a plurality of frequency bands of the input audio signal for which separate band gain information is available. The encoder also comprises an audio stream provider configured to provide the audio stream such that the audio stream comprises an information describing an audio content of the frequency bands and an information describing the multi-band quantization error. A decoder for providing a decoded representation of an audio signal on the basis of an encoded audio stream representing spectral components of frequency bands of the audio signal comprises a noise filler configured to introduce noise into spectral components of a plurality of frequency bands to which separate frequency band gain information is associated on the basis of a common multi-band noise intensity value.
摘要:
An audio encoder for providing an encoded audio information on the basis of an input audio information comprises a bandwidth extension information provider configured to provide bandwidth extension information using a variable temporal resolution and a detector configured to detect an onset of a fricative or affricate. The audio encoder is configured to adjust a temporal resolution used by the bandwidth extension information provider such that bandwidth extension information is provided with an increased temporal resolution at least for a predetermined period of time before a time at which an onset of a fricative or affricate is detected and for a predetermined period of time following the time at which the onset of the fricative or affricate is detected. Alternatively or in addition, the bandwidth extension information is provided with an increased temporal resolution in response to a detection of an offset of a fricative or affricate. Audio encoders and methods use a corresponding concept.
摘要:
An audio encoder for providing an encoded audio information on the basis of an input audio information comprises a bandwidth extension information provider configured to provide bandwidth extension information using a variable temporal resolution and a detector configured to detect an onset of a fricative or affricate. The audio encoder is configured to adjust a temporal resolution used by the bandwidth extension information provider such that bandwidth extension information is provided with an increased temporal resolution at least for a predetermined period of time before a time at which an onset of a fricative or affricate is detected and for a predetermined period of time following the time at which the onset of the fricative or affricate is detected. Alternatively or in addition, the bandwidth extension information is provided with an increased temporal resolution in response to a detection of an offset of a fricative or affricate. Audio encoders and methods use a corresponding concept.
摘要:
Audio decoder device for decoding a bitstream, the audio decoder device comprising: a predictive decoder for producing a decoded audio frame from the bitstream, wherein the predictive decoder comprises a parameter decoder for producing one or more audio parameters for the decoded audio frame from the bitstream and wherein the predictive decoder comprises a synthesis filter device for producing the decoded audio frame by synthesizing the one or more audio parameters for the decoded audio frame; a memory device comprising one or more memories, wherein each of the memories is configured to store a memory state for the decoded audio frame, wherein the memory state for the decoded audio frame of the one or more memories is used by the synthesis filter device for synthesizing the one or more audio parameters for the decoded audio frame; and a memory state resampling device configured to determine the memory state for synthesizing the one or more audio parameters for the decoded audio frame, which has a sampling rate, for one or more of said memories by resampling a preceding memory state for synthesizing one or more audio parameters for a preceding decoded audio frame, which has a preceding sampling rate being different from the sampling rate of the decoded audio frame, for one or more of said memories and to store the memory state for synthesizing of the one or more audio parameters for the decoded audio frame for one or more of said memories into the respective memory.