Frequency-domain audio coding supporting transform length switching
    1.
    发明公开
    Frequency-domain audio coding supporting transform length switching 审中-公开
    常见问题解答

    公开(公告)号:EP2830058A1

    公开(公告)日:2015-01-28

    申请号:EP13189334.9

    申请日:2013-10-18

    IPC分类号: G10L19/022

    摘要: A frequency-domain audio codec is is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility. As far as coding efficiency penalties due to the coding of the frequency domain coefficients in a manner transparent for older decoders are concerned, same are of comparatively minor nature due to the interleaving.

    摘要翻译: 通过以下方式,向频域音频编解码器提供额外支持特定变换长度的能力:通过交织方式发送相应帧的频域系数,而不管信令信令如何 对于实际应用哪个变换长度的帧,并且频域系数提取和比例因子提取独立于信号化操作。 通过这种措施,对信号化不敏感的老式频域音频编码器/解码器将能够无故障地运行并且再现合理的质量。 同时,即使向后兼容,能够支持附加变换长度的频域音频编码器/解码器将提供更好的质量。 关于由于对于旧解码器而言以透明的方式对频域系数的编码所引起的编码效率的惩罚是由于交织而相同的。

    Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension
    2.
    发明公开
    Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension 审中-公开
    音频解码器,音频编码器,提供基于一个编码表示至少四个音频信道信号的方法,基于至少四个音频信道信号和计算机程序有带宽扩展提供了一个编码表示的方法

    公开(公告)号:EP2830052A1

    公开(公告)日:2015-01-28

    申请号:EP13189306.7

    申请日:2013-10-18

    IPC分类号: G10L19/008 G10L21/038

    摘要: An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation is configured to provide a first downmix signal and a second downmix signal on the basis of a jointly encoded representation of the first downmix signal and the second downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a first audio channel signal and a second audio channel signal on the basis of the first downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a third audio channel signal and a fourth audio channel signal on the basis of the second downmix signal using a multi-channel decoding. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the first audio channel signal and the third audio channel signal, to obtain a first bandwidth-extended channel signal and a third bandwidth-extended channel signal. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the second audio channel signal and the fourth audio channel signal, to obtain a second bandwidth extended channel signal and a fourth bandwidth extended channel signal. An audio encoder uses a related concept.

    摘要翻译: 用于编码表示的基础上,提供至少四个带宽扩展信道信号的音频解码器被配置为提供一个第一混合信号和所述第一混合信号的联合编码表示的基础上,第二缩混信号和所述第二下混 利用信号的多声道解码。 音频解码器被配置为提供至少一个第一音频信道信号,并使用多通道解码所述第一缩混信号的基础上的第二音频信道信号。 音频解码器被配置为提供至少一个第三通道的音频信号,并使用多通道解码所述第二缩混信号的基础上的第四信道的音频信号。 音频解码器被配置为将第一音频信道信号和第三音频信道信号的基础上执行多通道带宽扩展,以获得第一带宽扩展信道信号和第三带宽扩展信道信号。 音频解码器被配置为将第二音频信道信号和所述第四音频信道信号的基础上执行多通道带宽扩展,以获得第二带宽扩展信道信号和第四信道带宽扩展信号。 音频编码器使用相关的概念。

    FREQUENCY-DOMAIN AUDIO CODING SUPPORTING TRANSFORM LENGTH SWITCHING

    公开(公告)号:EP4369337A3

    公开(公告)日:2024-06-26

    申请号:EP24165597.6

    申请日:2014-07-15

    摘要: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility. As far as coding efficiency penalties due to the coding of the frequency domain coefficients in a manner transparent for older decoders are concerned, same are of comparatively minor nature due to the interleaving.

    APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING

    公开(公告)号:EP4254988A2

    公开(公告)日:2023-10-04

    申请号:EP23167354.2

    申请日:2015-03-25

    IPC分类号: H04S7/00

    摘要: An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

    FREQUENCY-DOMAIN AUDIO CODING SUPPORTING TRANSFORM LENGTH SWITCHING

    公开(公告)号:EP4369337A2

    公开(公告)日:2024-05-15

    申请号:EP24165597.6

    申请日:2014-07-15

    IPC分类号: G10L19/03

    摘要: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility. As far as coding efficiency penalties due to the coding of the frequency domain coefficients in a manner transparent for older decoders are concerned, same are of comparatively minor nature due to the interleaving.

    FREQUENCY-DOMAIN AUDIO DECODING SUPPORTING TRANSFORM LENGTH SWITCHING

    公开(公告)号:EP4191581A1

    公开(公告)日:2023-06-07

    申请号:EP23150061.2

    申请日:2014-07-15

    摘要: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility. As far as coding efficiency penalties due to the coding of the frequency domain coefficients in a manner transparent for older decoders are concerned, same are of comparatively minor nature due to the interleaving.

    Renderer controlled spatial upmix
    10.
    发明公开
    Renderer controlled spatial upmix 审中-公开
    渲染器控制空间上混

    公开(公告)号:EP2830336A2

    公开(公告)日:2015-01-28

    申请号:EP13189285.3

    申请日:2013-10-18

    IPC分类号: H04S7/00 H04S5/00

    摘要: An audio decoder device for decoding a compressed input audio signal comprising
    at least one core decoder (6, 24) having one or more processors (36, 36') for generating a processor output signal (37) based on a processor input signal (38, 38'), wherein a number of output channels (37.1, 37.2, 37.1', 37.2') of the processor output signal (37, 37') is higher than a number of input channels (38.1, 38.1') of the processor input signal (38, 38'), wherein each of the one or more processors (36, 36') comprises a decorrelator (39, 39') and a mixer (40, 40'), wherein a core decoder output signal (13) having a plurality of channels (13.1, 13.2, 13.3, 13,4) comprises the processor output signal (37, 37'), and wherein the core decoder output signal (13) is suitable for a reference loudspeaker setup (42);
    at least one format converter device (9, 10) configured to convert the core decoder output signal (13) into an output audio signal (31), which is suitable for a target loudspeaker setup (45); and
    a control device (46) configured to control at least one or more processors (36, 36') in such way that the decorrelator (39, 39') of the processor (36, 36') may be controlled independently from the mixer (40, 40') of the processor (36, 36'), wherein the control device (46) is configured to control at least one of the decorrelators (39, 39') of the one or more processors (36, 36') depending on the target loudspeaker setup (45).

    摘要翻译: 一种用于解码压缩输入音频信号的音频解码器设备,包括至少一个具有一个或多个处理器(36,36')的核心解码器(6,24),用于基于处理器输入信号(38)产生处理器输出信号(37) ,38'),其中处理器输出信号(37,37')的多个输出通道(37.1,37.2,37.1',37.2')高于处理器的输入通道(38.1,38.1')的数量 输入信号(38,38'),其中一个或多个处理器(36,36')中的每一个包括解相关器(39,39')和混合器(40,40'),其中核心解码器输出信号 )具有多个通道(13.1,13.2,13.3,13.4)的处理器包括处理器输出信号(37,37'),并且其中核心解码器输出信号(13)适用于参考扬声器设置(42)。 至少一个格式转换器设备(9,10),被配置为将核心解码器输出信号(13)转换为适合于目标扬声器设置(45)的输出音频信号(31); 以及控制设备(46),其被配置为以如下方式控制至少一个或多个处理器(36,36'):处理器(36,36')的解相关器(39,39')可独立于混频器 (36,36')的所述解相关器(39,40')中的至少一个解相关器(39,39'),其中所述控制装置(46)被配置为控制所述一个或多个处理器 )取决于目标扬声器设置(45)。