摘要:
An apparatus for generating a merged audio data stream is provided. The apparatus comprises a demultiplexer (180) for obtaining a plurality of single-layer audio data streams, wherein the demultiplexer (180) is adapted to receive one or more input audio data streams, wherein each input audio data stream comprises one or more layers, wherein the demultiplexer (180) is adapted to demultiplex each one of the input audio data streams having one or more layers into two or more demultiplexed audio data streams having exactly one layer, such that the two or more demultiplexed audio data streams together comprise the one or more layers of the input audio data stream. Furthermore, the apparatus comprises a merging module (190) for generating the merged audio data stream, having one or more layers, based on the plurality of single-layer audio data streams. Each layer of the input data audio streams, of the demultiplexed audio data streams, of the single-layer data streams and of the merged audio data stream comprises a pressure value of a pressure signal, a position value and a diffuseness value as audio data.
摘要:
An audio signal encoder (600) for providing a downmix signal representation (614) and an object-related parametric information (616) on the basis of a plurality of object signals (x 1 to x N ) comprises a downmixer (620) configured to provide one or more downmix signals in dependence on downmix coefficients (d 1 to d N ) associated with the object signals (x 1 to x N ), such that the one or more downmix signals comprise a superposition of a plurality of object signals, and a side information provider (630) configured to provide an inter-object-relationship side information (OLD, IOC) describing level differences and correlation characteristics of object signals (x 1 to x N ) and an individual-object side information describing one or more individual properties of the individual object signals (x 1 to x N ).
摘要:
An apparatus for encoding an audio signal having a plurality of channels is provided. The apparatus comprises a downmixer (1010) for downmixing the plurality of channels to obtain a downmix signal. Moreover, the apparatus comprises a residual signal calculator (1020) adapted for calculating a residual signal. Furthermore, the apparatus comprises a phase information calculator (1030) adapted for calculating information on a phase difference between the downmix and the residual signal to obtain phase information. Moreover, the apparatus comprises an output generator (1040) for outputting the phase information.
摘要:
An apparatus for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information, which are included in a bitstream representation of an audio content, and in dependence on a rendering information, has a distortion limiter configured to adjust upmix parameters using a distortion control scheme to avoid or limit audible distortions which are caused by an inappropriate choice of rendering parameters. The distortion limiter is configured to obtain a distortion limitation control parameter, which is included in the bitstream representation of the audio content, and to adjust a distortion control scheme in dependence on the distortion limitation control parameter.
摘要:
The invention relates to a device (100) for encoding a sequence of scanning values of an audio signal, wherein each scanning value within the sequence has an original position. The device (100) comprises a system (110) for sorting the scanning values according to size in order to obtain a sorted sequence of scanning values, wherein each scanning value has a sorting position within the sorted sequence. Furthermore, the device (100) comprises a system (120) for encoding the sorted scanning values and information about a relationship between the original and sorting positions of the scanning values.
摘要:
A noise filler for providing a noise-filled spectral representation of an audio signal on the basis of an input spectral representation of the audio signal comprises a spectral region identifier configured to identify spectral regions of the input spectral representation spaced from non-zero spectral regions of the input spectral representation by at least one intermediate spectral region, to obtain identified spectral regions, and a noise inserter configured to selectively introduce noise into the identified spectral regions to obtain the noise-filled spectral representation of the audio signal. A noise filling parameter calculator for providing a noise filling parameter on the basis of a quantized spectral representation of an audio signal comprises a spectral region identifier, as mentioned above, and a noise value calculator configured to selectively consider quantization errors of the identified spectral regions for a calculation of the noise filling parameter. Accordingly, an encoded audio signal representation representing the audio signal can be obtained.
摘要:
A compact encoded representation of information values not exceeding a predefined size can be derived when a first encoding rule generating an encoded representation of the information values of variable length is compared to a second encoding rule generating an encoded representation of the information values of fixed length and when the encoding rule resulting in the encoded representation requiring the lower number of information units is chosen. Thus, the maximum bit rate can be guaranteed to be at least the maximum bit rate of the second encoding rule deriving the second encoded representation. Signaling the choice of the encoding rule by some rule information together with the encoded representation of the information values, the correct information values can later on be derived on a decoder side, using a decoding rule fitting the encoding rule used during the encoding.
摘要:
A compact encoded representation of information values not exceeding a predefined size can be derived when a first encoding rule generating an encoded representation of the information values of variable length is compared to a second encoding rule generating an encoded representation of the information values of fixed length and when the encoding rule resulting in the encoded representation requiring the lower number of information units is chosen. Thus, the maximum bit rate can be guaranteed to be at least the maximum bit rate of the second encoding rule deriving the second encoded representation. Signaling the choice of the encoding rule by some rule information together with the encoded representation of the information values, the correct information values can later on be derived on a decoder side, using a decoding rule fitting the encoding rule used during the encoding.