摘要:
The present document relates to audio decoding and/or audio transcoding. In particular, the present document relates to a scheme for efficiently decoding a number M of audio channels from a bitstream comprising a higher number N of audio channels. In this context a method and system for merging a first and a second source set of spectral band replication (SBR) parameters to a target set of SBR parameters is described. The first and second source set comprise a first and second frequency band partitioning, respectively, which are different from one another. The first source set comprises a first set of energy related values associated with frequency bands of the first frequency band partitioning. The second source set comprises a second set of energy related values associated with frequency bands of the second frequency band partitioning. The target set comprises a target energy related value associated with an elementary frequency band. The method comprises the steps of breaking up the first and the second frequency band partitioning into a joint grid comprising the elementary frequency band; assigning a first value of the first set of energy related values to the elementary frequency band; assigning a second value of the second set of energy related values to the elementary frequency band; and combining the first and second value to yield the target energy related value for the elementary frequency band.
摘要:
The present document relates to the technical field of audio coding, decoding and processing. It specifically relates to methods of recovering high frequency content of an audio signal from low frequency content of the same audio signal in an efficient manner. A method for determining a first banded tonality value (311, 312) for a first frequency subband (205) of an audio signal is described. The first banded tonality value (311, 312) is used for approximating a high frequency component of the audio signal based on a low frequency component of the audio signal. The method comprises determining a set of transform coefficients in a corresponding set of frequency bins based on a block of samples of the audio signal; determining a set of bin tonality values (341) for the set of frequency bins using the set of transform coefficients, respectively; and combining a first subset of two or more of the set of bin tonality values (341) for two or more corresponding adjacent frequency bins of the set of frequency bins lying within the first frequency subband, thereby yielding the first banded tonality value (311, 312) for the first frequency subband.
摘要:
There is provided methods and apparatuses for decoding and encoding of audio signals. In particular, a method for decoding includes receiving a waveform-coded signal having a spectral content corresponding to a subset of the frequency range above a cross-over frequency. The waveform-coded signal is interleaved with a parametric high frequency reconstruction of the audio signal above the cross-over frequency. In this way an improved reconstruction of the high frequency bands of the audio signal is achieved.
摘要:
The present document relates to the technical field of audio coding, decoding and processing. It specifically relates to methods of recovering high frequency content of an audio signal from low frequency content of the same audio signal in an efficient manner. A method for determining a first banded tonality value (311, 312) for a first frequency subband (205) of an audio signal is described. The first banded tonality value (311, 312) is used for approximating a high frequency component of the audio signal based on a low frequency component of the audio signal. The method comprises determining a set of transform coefficients in a corresponding set of frequency bins based on a block of samples of the audio signal; determining a set of bin tonality values (341) for the set of frequency bins using the set of transform coefficients, respectively; and combining a first subset of two or more of the set of bin tonality values (341) for two or more corresponding adjacent frequency bins of the set of frequency bins lying within the first frequency subband, thereby yielding the first banded tonality value (311, 312) for the first frequency subband.
摘要:
The present document relates to audio decoding and/or audio transcoding. In particular, the present document relates to a scheme for efficiently decoding a number M of audio channels from a bitstream comprising a higher number N of audio channels. In this context a method and system for merging a first and a second source set of spectral band replication (SBR) parameters to a target set of SBR parameters is described. The first and second source set comprise a first and second frequency band partitioning, respectively, which are different from one another. The first source set comprises a first set of energy related values associated with frequency bands of the first frequency band partitioning. The second source set comprises a second set of energy related values associated with frequency bands of the second frequency band partitioning. The target set comprises a target energy related value associated with an elementary frequency band. The method comprises the steps of breaking up the first and the second frequency band partitioning into a joint grid comprising the elementary frequency band; assigning a first value of the first set of energy related values to the elementary frequency band; assigning a second value of the second set of energy related values to the elementary frequency band; and combining the first and second value to yield the target energy related value for the elementary frequency band.
摘要:
A method, an apparatus, a computer readable storage medium configured with instructions for carrying out a method, and logic encoded in one or more computer-readable tangible medium to carry out actions. The method is to decode audio data that includes N.n channels to M.m decoded audio channels, including unpacking metadata and unpacking and decoding frequency domain exponent and mantissa data; determining transform coefficients from the unpacked and decoded frequency domain exponent and mantissa data; inverse transforming the frequency domain data; and in the case M
摘要:
There is provided methods and apparatuses for decoding and encoding of audio signals. In particular, a method for decoding includes receiving a waveform-coded signal having a spectral content corresponding to a subset of the frequency range above a cross-over frequency. The waveform-coded signal is interleaved with a parametric high frequency reconstruction of the audio signal above the cross-over frequency. In this way an improved reconstruction of the high frequency bands of the audio signal is achieved.
摘要:
A method comprising: receiving and decoding a coded bitstream encoded with audio content including first audio objects corresponding to a first media content type of two consecutive media content types and second audio objects corresponding to a second media content type of the two consecutive media content types, and audio metadata corresponding to the audio content, the audio metadata including first and second audio object gains, respectively for the first and second audio objects, generated at least in part based on a first fading curve of the first media content type and a second fading curve of the second media content type, respectively; applying the first and second audio object gains to the first and second audio objects, respectively; rendering a sound field represented by the first audio object with the applied first audio object gain and the second audio object with the applied second audio object gain.