Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
    11.
    发明公开
    Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals 审中-公开
    音频编码器,音频解码器,方法和计算机程序使用联合编码的残差信号

    公开(公告)号:EP2830051A2

    公开(公告)日:2015-01-28

    申请号:EP13189305.9

    申请日:2013-10-18

    IPC分类号: G10L19/008 G10L21/038

    摘要: An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.

    摘要翻译: 用于基于编码表示提供至少四个音频信道信号的音频解码器被配置为基于第一残差信号和第二残差信号的联合编码表示来提供第一残差信号和第二残差信号 使用多通道解码。 音频解码器被配置为基于第一缩混信号和使用残余信号辅助的多通道解码的第一残留信号来提供第一音频通道信号和第二音频通道信号。 音频解码器被配置为基于第二缩混信号和第二残余信号使用残余信号辅助的多通道解码来提供第三音频通道信号和第四音频通道信号。 音频编码器基于相应的考虑。

    FREQUENCY-DOMAIN AUDIO CODING SUPPORTING TRANSFORM LENGTH SWITCHING
    14.
    发明公开
    FREQUENCY-DOMAIN AUDIO CODING SUPPORTING TRANSFORM LENGTH SWITCHING 审中-公开
    频域音频编码支持转换长度切换

    公开(公告)号:EP3312836A1

    公开(公告)日:2018-04-25

    申请号:EP17189418.1

    申请日:2014-07-15

    摘要: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility. As far as coding efficiency penalties due to the coding of the frequency domain coefficients in a manner transparent for older decoders are concerned, same are of comparatively minor nature due to the interleaving.

    摘要翻译: 频域音频编解码器具有以向后兼容的方式另外支持特定变换长度的能力,通过以下方式:相应帧的频域系数以交织方式传输,而不考虑信号化信令 关于实际应用变换长度的帧,并且另外频域系数提取和比例因子提取独立于信号化进行操作。 通过这种措施,对信号不敏感的老式频域音频编码器/解码器将仍然能够无故障地工作并且重现合理的质量。 同时,尽管具有向后兼容性,但能够支持额外变换长度的频域音频编码器/解码器可以提供更好的质量。 就由于以较老的解码器而言透明的方式对频域系数进行编码而导致的编码效率处罚来说,由于交织相同,其性质相对较小。

    APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING
    16.
    发明公开
    APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING 审中-公开
    VORRICHTUNG UND VERFAHRENFÜRBILDSCHIRMBEZOGENE AUDIOOBJEKT-NEUABBILDUNG

    公开(公告)号:EP2928216A1

    公开(公告)日:2015-10-07

    申请号:EP14196769.5

    申请日:2014-12-08

    IPC分类号: H04S7/00 H04S3/00

    摘要: An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

    摘要翻译: 提供一种用于产生扬声器信号的装置。 该装置包括对象元数据处理器(110)和对象渲染器(120)。 对象渲染器(120)被配置为接收音频对象。 对象元数据处理器(110)被配置为接收元数据,其包括关于音频对象是否是屏幕相关的指示,还包括音频对象的第一位置。 对象元数据处理器(110)被配置为根据音频对象的第一位置并根据屏幕的大小来计算音频对象的第二位置,如果音频对象在元数据中被指示为屏幕相关 。 对象渲染器(120)被配置为根据音频对象并根据位置信息产生扬声器信号。 如果音频对象在元数据中被指示为不与屏幕相关,则对象元数据处理器(110)被配置为将音频对象的第一位置作为位置信息馈送到对象渲染器(120)。 如果音频对象在元数据中被指示为与屏幕相关的话,对象元数据处理器(110)被配置为将音频对象的第二位置作为位置信息馈送到对象渲染器(120)。

    Concept for audio encoding and decoding for audio channels and audio objects
    17.
    发明公开
    Concept for audio encoding and decoding for audio channels and audio objects 审中-公开
    Konzept zur Audiocodierung und AudiodecodierungfürAudiokanäleund Audioobjekte

    公开(公告)号:EP2830045A1

    公开(公告)日:2015-01-28

    申请号:EP13177378.0

    申请日:2013-07-22

    IPC分类号: G10L19/008

    摘要: Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).

    摘要翻译: 用于编码音频输入数据(101)以获得音频输出数据(501)的音频编码器包括用于接收多个音频通道的输入接口(100),与多个音频中的一个或多个音频相关的多个音频对象和元数据 对象; 混合器(200),用于混合多个对象和多个通道以获得多个预混合通道,每个预混合通道包括通道的音频数据和至少一个对象的音频数据; 核心编码器(300),用于核心编码核心编码器输入数据; 以及用于压缩与所述多个音频对象中的一个或多个音频对象有关的元数据的元数据压缩器(400),其中所述音频编码器被配置为在包括第一模式的两种模式的组的至少一种模式中操作,其中 核心编码器被配置为将由输入接口接收的多个音频频道和多个音频对象编码为核心编码器输入数据,以及第二模式,其中核心编码器(300)被配置为接收作为核心 编码器输入数据,由混合器(200)产生的多个预混频道。

    APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING

    公开(公告)号:EP4254988A3

    公开(公告)日:2023-11-01

    申请号:EP23167354.2

    申请日:2015-03-25

    IPC分类号: H04S7/00 H04S3/00

    摘要: An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.

    APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING

    公开(公告)号:EP3487189A1

    公开(公告)日:2019-05-22

    申请号:EP18248305.7

    申请日:2015-03-25

    IPC分类号: H04S7/00 H04S3/00

    摘要: An apparatus for generating loudspeaker signals is provided. The apparatus comprises an object metadata processor (110) and an object renderer (120). The object renderer (120) is configured to receive an audio object. The object metadata processor (110) is configured to receive metadata, comprising an indication on whether the audio object is screen-related, and further comprising a first position of the audio object. The object metadata processor (110) is configured to calculate a second position of the audio object depending on the first position of the audio object and depending on a size of a screen, if the audio object is indicated in the metadata as being screen-related. The object renderer (120) is configured to generate the loudspeaker signals depending on the audio object and depending on position information. The object metadata processor (110) is configured to feed the first position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being not screen-related. The object metadata processor (110) is configured to feed the second position of the audio object as the position information into the object renderer (120), if the audio object is indicated in the metadata as being screen-related.