摘要:
An improved concept for coding sample values of a spectral envelope is obtained by combining spectrotemporal prediction on the one hand and context-based entropy coding the residuals, on the other hand, while particularly determining the context for a current sample value dependent on a measure of a deviation between a pair of already coded/decoded sample values of the spectral envelope in a spectrotemporal neighborhood of the current sample value. The combination of the spectrotemporal prediction on the one hand and the context-based entropy coding of the prediction residuals with selecting the context depending on the deviation measure on the other hand harmonizes with the nature of spectral envelopes.
摘要:
An apparatus for decoding an encoded audio signal comprising an encoded representation of a first set of first spectral portions and an encoded representation of parametric data indicating spectral energies for a second set of second spectral portions, comprises: an audio decoder (900) for decoding the encoded representation (901b) of the first set of the first spectral portions to obtain a first set of first spectral portions (904) and for decoding the encoded representation of the parametric data to obtain a decoded parametric data (902) for the second set of second spectral portions indicating, for individual reconstruction bands, individual energies; a frequency regenerator (906) for reconstructing spectral values in a reconstruction band (920) comprising a second spectral portion (922, 923) using a first spectral portion of the first set of the first spectral portions and an individual energy for the reconstruction band, the reconstruction band comprising a first spectral portion (921) and the second spectral portion; wherein the frequency regenerator (906) is configured for determining (912) a survive energy information comprising an accumulated energy of the first spectral portion having frequency values in the reconstruction band, determining (918) a tile energy information of further spectral portions (922, 923) of the reconstruction band (920) for frequency values different from the first spectral portion (921) having frequencies in the reconstruction band (920), wherein the further spectral portions (922, 923) are to be generated by frequency regeneration using a first spectral portion (302) different from the first spectral portion (921, 306) in the reconstruction band; determining (914) a missing energy in the reconstruction band (920) using the individual energy for the reconstruction band and the survive energy information; and adjusting (916) the further spectral portions in the reconstruction band based on the missing energy information and the tile energy information.
摘要:
An audio scene encoder for encoding an audio scene, the audio scene comprising at least two component signals, comprises: a core encoder (160) for core encoding the at least two component signals, wherein the core encoder (160) is configured to generate a first encoded representation (310) for a first portion of the at least two component signals, and to generate a second encoded representation (320) for a second portion of the at least two component signals, a spatial analyzer (200) for analyzing the audio scene to derive one or more spatial parameters (330) or one or more spatial parameter sets for the second portion; and an output interface (300) for forming the encoded audio scene signal (340), the encoded audio scene signal (340) comprising the first encoded representation (310), the second encoded representation (320), and the one or more spatial parameters (330) or one or more spatial parameter sets for the second portion.
摘要:
An audio data converter comprises: an input interface (100) for receiving an object description of an audio object having audio object metadata; a metadata converter (150, 125, 126, 148) for converting the audio object metadata into DirAC metadata; and an output interface (300) for transmitting or storing the DirAC metadata.