Abstract:
A method for decoding an encoded audio signal is described. The encoded audio signal may comprise a sequence of frames and may be indicative of a plurality of different dynamic range control (DRC) profiles for a corresponding plurality of different rendering modes. Different subsets of DRC profiles may be comprised within different frames. The method may comprise determining a first rendering mode from the plurality of different rendering modes; determining one or more DRC profiles from a subset of DRC profiles comprised within a current frame; determining whether at least one of the DRC profiles is applicable to the first rendering mode; selecting a default DRC profile as a current DRC profile, if none of the DRC profiles is applicable to the first rendering mode; wherein definition data of the default DRC profile is known at a decoder; and decoding the current frame using the current DRC profile.
Abstract:
Methods for generating an AV bitstream (e.g., an MPEG-2 transport stream or bitstream segment having adaptive streaming format) such that the AV bitstream includes at least one video I-frame synchronized with at least one audio I-frame, e.g., including by re-authoring at least one video or audio frame (as a re-authored I-frame or a re-authored P-frame). Typically, a segment of content of the AV bitstream which includes the re-authored frame starts with an I-frame and includes at least one subsequent P-frame. Other aspects are methods for adapting such an AV bitstream, audio/video processing units configured to perform any embodiment of the inventive method, and audio/video processing units which include a buffer memory which stores at least one segment of an AV bitstream generated in accordance with any embodiment of the inventive method.
Abstract:
The present document relates to the design of anti-aliasing and/or anti-imaging filters for resamplers using rational resampling factors. In particular, the present document relates to a method for implementing such anti-aliasing and/or anti-imaging filters with reduced computational complexity. In addition, the present document relates to further aspects of an audio encoding and decoding system, such as the phase relation between the channels of a multi-channel audio signal and/or the structure of the bitstream of an encoded audio signal.
Abstract:
An audio bitstream is decoded into audio objects and audio metadata for the audio objects. The audio objects include a specific audio object. The audio metadata specifies frame-level gains that include a first gain and a second gain respectively for a first audio frame and a second audio frame. It is determined, based on the first and second gains, whether sub-frame gains are to be generated for the specific audio object. If so, a ramp length is determined for a ramp used to generate the sub-frame gains for the specific audio object. The ramp of the ramp length is used to generate the sub-frame gains for the specific audio object. A sound field represented by the audio objects with the sub-frame gains is rendered by audio speakers.
Abstract:
A method performed in an audio decoder for reconstructing an original audio signal having a lowband portion and a highband portion is disclosed. The method includes receiving an encoded audio signal and extracting reconstruction parameters from the encoded audio signal. The method further includes decoding the encoded audio signal with a core audio decoder to obtain a decoded lowband portion and regenerating the highband portion based at least in part on a cross over frequency and the decoded lowband portion to obtain a regenerated highband portion. The method also includes creating a synthetic sinusoid with a level based at least in part on a spectral envelope value for the particular subband and a noise floor value for the particular subband and adding the synthetic sinusoid to the regenerated highband portion in the particular frequency band specified by the location information. Finally, the method includes combining the lowband portion and the regenerated highband portion to obtain a full bandwidth audio signal.
Abstract:
In an audio encoder, for audio content received in a source audio format, default gains are generated based on a default dynamic range compression (DRC) curve, and non-default gains are generated for a non-default gain profile. Based on the default gains and non-default gains, differential gains are generated. An audio signal comprising the audio content, the default DRC curve, and differential gains is generated. In an audio decoder, the default DRC curve and the differential gains are identified from the audio signal. Default gains are re-generated based on the default DRC curve. Based on the combination of the re-generated default gains and the differential gains, operations are performed on the audio content extracted from the audio signal.
Abstract:
Methods for generating an AV bitstream (e.g., an MPEG-2 transport stream or bitstream segment having adaptive streaming format) such that the AV bitstream includes at least one video I-frame synchronized with at least one audio I-frame, e.g., including by re-authoring at least one video or audio frame (as a re-authored I-frame or a re-authored P-frame). Typically, a segment of content of the AV bitstream which includes the re-authored frame starts with an I-frame and includes at least one subsequent P-frame. Other aspects are methods for adapting such an AV bitstream, audio/video processing units configured to perform any embodiment of the inventive method, and audio/video processing units which include a buffer memory which stores at least one segment of an AV bitstream generated in accordance with any embodiment of the inventive method.
Abstract:
The present document describes a method (800) for allocating bits to a frame (301) of a sequence of frames (301) to yield a bitstream having a constant average bitrate, wherein the frame (301) comprises audio data and metadata. The method (800) comprises maintaining (801) an overall bit reservoir (100) and maintaining (802) a virtual bit reservoir (510) being a subset of the overall bit reservoir (100), such that bits for the metadata of the frame (301) are allocated from the virtual bit reservoir (510) and such that bits for the audio data of the frame (301) are allocated from the overall bit reservoir (100).
Abstract:
In an audio encoder, for audio content received in a source audio format, default gains are generated based on a default dynamic range compression (DRC) curve, and non-default gains are generated for a non-default gain profile. Based on the default gains and non-default gains, differential gains are generated. An audio signal comprising the audio content, the default DRC curve, and differential gains is generated. In an audio decoder, the default DRC curve and the differential gains are identified from the audio signal. Default gains are re-generated based on the default DRC curve. Based on the combination of the re-generated default gains and the differential gains, operations are performed on the audio content extracted from the audio signal.
Abstract:
A method (600) for decoding an encoded audio signal (102) is described. The encoded audio signal (102) comprises a sequence of frames. Furthermore, the encoded audio signal (102) is indicative of a plurality of different dynamic range control (DRC) profiles for a corresponding plurality of different rendering modes. Different subsets of DRC profiles from the plurality of DRC profiles are comprised within different frames of the sequence of frames, such that two or more frames of the sequence of frames jointly comprise the plurality of DRC profiles. The method (600) comprises determining a first rendering mode from the plurality of different rendering modes; determining (609, 610) one or more DRC profiles from a subset of DRC profiles comprised within a current frame of the sequence of frames; determining (611) whether at least one of the one or more DRC profiles is applicable to the first rendering mode; selecting (604) a default DRC profile as a current DRC profile, if none of the one or more DRC profiles is applicable to the first rendering mode; wherein definition data of the default DRC profile is known at a decoder (100) for decoding the encoded audio signal (102); and decoding the current frame using the current DRC profile.