Concept for audio encoding and decoding for audio channels and audio objects
    1.
    发明公开
    Concept for audio encoding and decoding for audio channels and audio objects 审中-公开
    Konzept zur Audiocodierung und AudiodecodierungfürAudiokanäleund Audioobjekte

    公开(公告)号:EP2830045A1

    公开(公告)日:2015-01-28

    申请号:EP13177378.0

    申请日:2013-07-22

    IPC分类号: G10L19/008

    摘要: Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).

    摘要翻译: 用于编码音频输入数据(101)以获得音频输出数据(501)的音频编码器包括用于接收多个音频通道的输入接口(100),与多个音频中的一个或多个音频相关的多个音频对象和元数据 对象; 混合器(200),用于混合多个对象和多个通道以获得多个预混合通道,每个预混合通道包括通道的音频数据和至少一个对象的音频数据; 核心编码器(300),用于核心编码核心编码器输入数据; 以及用于压缩与所述多个音频对象中的一个或多个音频对象有关的元数据的元数据压缩器(400),其中所述音频编码器被配置为在包括第一模式的两种模式的组的至少一种模式中操作,其中 核心编码器被配置为将由输入接口接收的多个音频频道和多个音频对象编码为核心编码器输入数据,以及第二模式,其中核心编码器(300)被配置为接收作为核心 编码器输入数据,由混合器(200)产生的多个预混频道。

    Apparatus and method for low delay object metadata coding
    2.
    发明公开
    Apparatus and method for low delay object metadata coding 审中-公开
    Vorrichtung und Verfahren zurverzögerungsarmenCodierung von Objektmetadaten

    公开(公告)号:EP2830047A1

    公开(公告)日:2015-01-28

    申请号:EP13189279.6

    申请日:2013-10-18

    IPC分类号: G10L19/008

    摘要: An apparatus (100) for generating one or more audio channels is provided. The apparatus comprises a metadata decoder (110) for generating one or more reconstructed metadata signals (x 1 ',...,x N ') from one or more processed metadata signals (z 1 ,...,z N ) depending on a control signal (b), wherein each of the one or more reconstructed metadata signals (x 1 ',...,x N ') indicates information associated with an audio object signal of one or more audio object signals, wherein the metadata decoder (110) is configured to generate the one or more reconstructed metadata signals (x 1 ',...,x N ') by determining a plurality of reconstructed metadata samples (x 1 '(n),...,x N '(n)) for each of the one or more reconstructed metadata signals (x 1 ',...,x N '). Moreover, the apparatus comprises an audio channel generator (120) for generating the one or more audio channels depending on the one or more audio object signals and depending on the one or more reconstructed metadata signals (x 1 ',...,x N '). The metadata decoder (110) is configured to receive a plurality of processed metadata samples (z 1 (n),...,z N (n)) of each of the one or more processed metadata signals (z 1 ,...z N ). Moreover, the metadata decoder (110) is configured to receive the control signal (b). Furthermore, the metadata decoder (110) is configured to determine each reconstructed metadata sample (x i '(n)) of the plurality of reconstructed metadata samples (x i '(1),... x i '(n-1), x i '(n)) of each reconstructed metadata signal (x i ') of the one or more reconstructed metadata signals (x 1 ',...,x N '), so that, when the control signal (b) indicates a first state (b(n)=0), said reconstructed metadata sample (x i '(n)) is a sum of one of the processed metadata samples (z i (n)) of one of the one or more processed metadata signals (z i ) and of another already generated reconstructed metadata sample (x i '(n-1)) of said reconstructed metadata signal (x i '), and so that, when the control signal indicates a second state (b(n)=1) being different from the first state, said reconstructed metadata sample (x i '(n)) is said one (z i (n)) of the processed metadata samples (z i (1),...,z i (n)) of said one (z i ) of the one or more processed metadata signals (z 1 ,... ,z N ). Moreover, an apparatus (250) for generating encoded audio information is provided.

    摘要翻译: 提供了一种用于产生一个或多个音频通道的装置(100)。 该装置包括一个元数据解码器(110),用于根据来自一个或多个经处理的元数据信号(z 1,...,z N)生成一个或多个重建的元数据信号(x 1',...,x N'), 控制信号(b),其中所述一个或多个重建的元数据信号(x 1',...,x N')中的每一个指示与一个或多个音频对象信号的音频对象信号相关联的信息,其中所述元数据解码器 (110)被配置为通过确定多个重构的元数据样本(x 1'(n),...,x N')来生成一个或多个重构的元数据信号(x 1',...,x N' (x 1',...,x N')中的每一个重建元数据信号(n)。 此外,该装置包括用于根据一个或多个音频对象信号产生一个或多个音频信道的音频信道发生器(120),并且取决于一个或多个重构的元数据信号(x 1',...,x N “)。 元数据解码器(110)被配置为接收一个或多个处理的元数据信号(z 1,...)中的每一个的多个经处理的元数据样本(z 1(n),...,z N(n) z N)。 此外,元数据解码器(110)被配置为接收控制信号(b)。 此外,元数据解码器(110)被配置为确定多个重构元数据样本(xi'(1),... xi'(n-1),xi'(n-1)中的每个重构的元数据样本(xi' (x 1',...,x N')的每个重构的元数据信号(xi')的每个重建的元数据信号(xi')的值(n)),使得当控制信号(b) b(n)= 0),所述重建的元数据样本(xi'(n))是一个或多个处理的元数据信号(zi)中的一个被处理的元数据样本(zi(n))之一和 另一个已经生成的所述重构的元数据信号(xi')的重构元数据样本(xi'(n-1)),并且使得当控制信号指示第二状态(b(n)= 1) 所述重构的元数据样本(xi'(n))是所述一个(zi)的所处理的元数据样本(zi(1),...,zi(n))的所述一个(zi(n) 一个或多个经处理的元数据信号(z 1,...,z N)。 此外,提供了一种用于生成编码音频信息的装置(250)。

    Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension
    3.
    发明公开
    Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension 审中-公开
    音频解码器,音频编码器,提供基于一个编码表示至少四个音频信道信号的方法,基于至少四个音频信道信号和计算机程序有带宽扩展提供了一个编码表示的方法

    公开(公告)号:EP2830052A1

    公开(公告)日:2015-01-28

    申请号:EP13189306.7

    申请日:2013-10-18

    IPC分类号: G10L19/008 G10L21/038

    摘要: An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation is configured to provide a first downmix signal and a second downmix signal on the basis of a jointly encoded representation of the first downmix signal and the second downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a first audio channel signal and a second audio channel signal on the basis of the first downmix signal using a multi-channel decoding. The audio decoder is configured to provide at least a third audio channel signal and a fourth audio channel signal on the basis of the second downmix signal using a multi-channel decoding. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the first audio channel signal and the third audio channel signal, to obtain a first bandwidth-extended channel signal and a third bandwidth-extended channel signal. The audio decoder is configured to perform a multi-channel bandwidth extension on the basis of the second audio channel signal and the fourth audio channel signal, to obtain a second bandwidth extended channel signal and a fourth bandwidth extended channel signal. An audio encoder uses a related concept.

    摘要翻译: 用于编码表示的基础上,提供至少四个带宽扩展信道信号的音频解码器被配置为提供一个第一混合信号和所述第一混合信号的联合编码表示的基础上,第二缩混信号和所述第二下混 利用信号的多声道解码。 音频解码器被配置为提供至少一个第一音频信道信号,并使用多通道解码所述第一缩混信号的基础上的第二音频信道信号。 音频解码器被配置为提供至少一个第三通道的音频信号,并使用多通道解码所述第二缩混信号的基础上的第四信道的音频信号。 音频解码器被配置为将第一音频信道信号和第三音频信道信号的基础上执行多通道带宽扩展,以获得第一带宽扩展信道信号和第三带宽扩展信道信号。 音频解码器被配置为将第二音频信道信号和所述第四音频信道信号的基础上执行多通道带宽扩展,以获得第二带宽扩展信道信号和第四信道带宽扩展信号。 音频编码器使用相关的概念。

    Apparatus and method for efficient object metadata coding
    4.
    发明公开
    Apparatus and method for efficient object metadata coding 审中-公开
    Vorrichtung und Verfahren zur effizienten Codierung von Objektmetadaten

    公开(公告)号:EP2830049A1

    公开(公告)日:2015-01-28

    申请号:EP13189284.6

    申请日:2013-10-18

    IPC分类号: G10L19/008

    摘要: An apparatus (100) for generating one or more audio channels is provided. The apparatus (100) comprises a metadata decoder (110) for receiving one or more compressed metadata signals. Each of the one or more compressed metadata signals comprises a plurality of first metadata samples. The first metadata samples of each of the one or more compressed metadata signals indicate information associated with an audio object signal of one or more audio object signals. The metadata decoder (110) is configured to generate one or more reconstructed metadata signals, so that each of the one or more reconstructed metadata signals comprises the first metadata samples of one of the one or more compressed metadata signals and further comprises a plurality of second metadata samples. Moreover, the metadata decoder (110) is configured to generate each of the second metadata samples of each reconstructed metadata signal of the one or more reconstructed metadata signals depending on at least two of the first metadata samples of said reconstructed metadata signal. Moreover, the apparatus (100) comprises an audio channel generator (120) for generating the one or more audio channels depending on the one or more audio object signals and depending on the one or more reconstructed metadata signals. Furthermore, an apparatus for generating encoded audio information comprising one or more encoded audio signals and one or more compressed metadata signals is provided.

    摘要翻译: 提供了一种用于产生一个或多个音频通道的装置(100)。 装置(100)包括用于接收一个或多个压缩的元数据信号的元数据解码器(110)。 一个或多个压缩的元数据信号中的每一个包括多个第一元数据样本。 一个或多个压缩元数据信号中的每一个的第一元数据样本表示与一个或多个音频对象信号的音频对象信号相关联的信息。 元数据解码器(110)被配置为生成一个或多个重建的元数据信号,使得一个或多个重构的元数据信号中的每一个包括一个或多个压缩的元数据信号之一的第一元数据样本,并且还包括多个第二 元数据样本。 此外,元数据解码器(110)被配置为根据所述重建的元数据信号的第一元数据样本中的至少两个生成所述一个或多个重建的元数据信号的每个重建的元数据信号的每个第二元数据样本。 此外,装置(100)包括音频信道发生器(120),用于根据一个或多个音频对象信号产生一个或多个音频信道,并且取决于一个或多个重建的元数据信号。 此外,提供了一种用于生成包括一个或多个编码音频信号和一个或多个压缩元数据信号的编码音频信息的装置。

    Renderer controlled spatial upmix
    8.
    发明公开
    Renderer controlled spatial upmix 审中-公开
    渲染器控制空间上混

    公开(公告)号:EP2830336A2

    公开(公告)日:2015-01-28

    申请号:EP13189285.3

    申请日:2013-10-18

    IPC分类号: H04S7/00 H04S5/00

    摘要: An audio decoder device for decoding a compressed input audio signal comprising
    at least one core decoder (6, 24) having one or more processors (36, 36') for generating a processor output signal (37) based on a processor input signal (38, 38'), wherein a number of output channels (37.1, 37.2, 37.1', 37.2') of the processor output signal (37, 37') is higher than a number of input channels (38.1, 38.1') of the processor input signal (38, 38'), wherein each of the one or more processors (36, 36') comprises a decorrelator (39, 39') and a mixer (40, 40'), wherein a core decoder output signal (13) having a plurality of channels (13.1, 13.2, 13.3, 13,4) comprises the processor output signal (37, 37'), and wherein the core decoder output signal (13) is suitable for a reference loudspeaker setup (42);
    at least one format converter device (9, 10) configured to convert the core decoder output signal (13) into an output audio signal (31), which is suitable for a target loudspeaker setup (45); and
    a control device (46) configured to control at least one or more processors (36, 36') in such way that the decorrelator (39, 39') of the processor (36, 36') may be controlled independently from the mixer (40, 40') of the processor (36, 36'), wherein the control device (46) is configured to control at least one of the decorrelators (39, 39') of the one or more processors (36, 36') depending on the target loudspeaker setup (45).

    摘要翻译: 一种用于解码压缩输入音频信号的音频解码器设备,包括至少一个具有一个或多个处理器(36,36')的核心解码器(6,24),用于基于处理器输入信号(38)产生处理器输出信号(37) ,38'),其中处理器输出信号(37,37')的多个输出通道(37.1,37.2,37.1',37.2')高于处理器的输入通道(38.1,38.1')的数量 输入信号(38,38'),其中一个或多个处理器(36,36')中的每一个包括解相关器(39,39')和混合器(40,40'),其中核心解码器输出信号 )具有多个通道(13.1,13.2,13.3,13.4)的处理器包括处理器输出信号(37,37'),并且其中核心解码器输出信号(13)适用于参考扬声器设置(42)。 至少一个格式转换器设备(9,10),被配置为将核心解码器输出信号(13)转换为适合于目标扬声器设置(45)的输出音频信号(31); 以及控制设备(46),其被配置为以如下方式控制至少一个或多个处理器(36,36'):处理器(36,36')的解相关器(39,39')可独立于混频器 (36,36')的所述解相关器(39,40')中的至少一个解相关器(39,39'),其中所述控制装置(46)被配置为控制所述一个或多个处理器 )取决于目标扬声器设置(45)。

    Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
    9.
    发明公开
    Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals 审中-公开
    音频编码器,音频解码器,方法和计算机程序使用联合编码的残差信号

    公开(公告)号:EP2830051A2

    公开(公告)日:2015-01-28

    申请号:EP13189305.9

    申请日:2013-10-18

    IPC分类号: G10L19/008 G10L21/038

    摘要: An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.

    摘要翻译: 用于基于编码表示提供至少四个音频信道信号的音频解码器被配置为基于第一残差信号和第二残差信号的联合编码表示来提供第一残差信号和第二残差信号 使用多通道解码。 音频解码器被配置为基于第一缩混信号和使用残余信号辅助的多通道解码的第一残留信号来提供第一音频通道信号和第二音频通道信号。 音频解码器被配置为基于第二缩混信号和第二残余信号使用残余信号辅助的多通道解码来提供第三音频通道信号和第四音频通道信号。 音频编码器基于相应的考虑。