Audio object separation from mixture signal using object-specific time/frequency resolutions
    3.
    发明公开
    Audio object separation from mixture signal using object-specific time/frequency resolutions 审中-公开
    Trennung von Audio-Objekt aus einem Mischsignal mit objektspezifischen Zeit- undFrequenzauflösungen

    公开(公告)号:EP2804176A1

    公开(公告)日:2014-11-19

    申请号:EP13167484.8

    申请日:2013-05-13

    摘要: An audio decoder is proposed for decoding a multi-object audio signal consisting of a downmix signal X and side information PSI. The side information comprises object-specific side information PSI i for an audio object s i in a time/frequency region R(t R ,f R ), and object-specific time/frequency resolution information TFRI i indicative of an object-specific time/frequency resolution TFR h of the object-specific side information for the audio object s i in the time/frequency region R(t R ,f R ). The audio decoder comprises an object-specific time/frequency resolution determiner 110 configured to determine the object-specific time/frequency resolution information TFRI i from the side information PSI for the audio object s i . The audio decoder further comprises an object separator 120 configured to separate the audio object s i from the downmix signal X using the object-specific side information in accordance with the object-specific time/frequency resolution TFRI i . A corresponding encoder and corresponding methods for decoding or encoding are also described.

    摘要翻译: 提出了一种音频解码器,用于对由下混信号X和侧信息PSI组成的多对象音频信号进行解码。 侧面信息包括用于时间/频率区域R(t R,f R)中的音频对象si的对象特定侧信息PSI i以及指示对象特定时间/频率区域的对象特定时间/频率分辨率信息TFRI i, 在时间/频率区域R(t R,f R)中的音频对象si的对象特定侧信息的频率分辨率TFR h。 音频解码器包括对象特定的时间/频率分辨率确定器110,其被配置为从音频对象s i的侧信息PSI确定对象特定的时间/频率分辨率信息TFRI i。 音频解码器还包括对象分离器120,其被配置为根据对象特定时间/频率分辨率TFRI i,使用对象特定侧信息将音频对象s与降混信号X分离。 还描述了相应的编码器和相应的解码或编码方法。

    Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding
    5.
    发明公开
    Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding 审中-公开
    Codierer,Decodierer und VerfahrenfürsignalabhängigeZoomumwandlung beim Spatial-Audio-Object-Coding

    公开(公告)号:EP2717262A1

    公开(公告)日:2014-04-09

    申请号:EP13167487.1

    申请日:2013-05-13

    摘要: A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal is provided. The downmix signal encodes one or more audio object signals. The decoder comprises a control unit (181) for setting an activation indication to an activation state depending on a signal property of at least one of the one or more audio object signals. Moreover, the decoder comprises a first analysis module (182) for transforming the downmix signal to obtain a first transformed downmix comprising a plurality of first subband channels. Furthermore, the decoder comprises a second analysis module (183) for generating, when the activation indication is set to the activation state, a second transformed downmix by transforming at least one of the first subband channels to obtain a plurality of second subband channels, wherein the second transformed downmix comprises the first subband channels which have not been transformed by the second analysis module and the second subband channels. Moreover, the decoder comprises an un-mixing unit (184), wherein the un-mixing unit (184) is configured to un-mix the second transformed downmix, when the activation indication is set to the activation state, based on parametric side information on the one or more audio object signals to obtain the audio output signal, and to un-mix the first transformed downmix, when the activation indication is not set to the activation state, based on the parametric side information on the one or more audio object signals to obtain the audio output signal. Furthermore, an encoder is provided.

    摘要翻译: 提供了一种用于从降混信号产生包括一个或多个音频输出声道的音频输出信号的解码器。 降混信号对一个或多个音频对象信号进行编码。 解码器包括用于根据一个或多个音频对象信号中的至少一个的信号属性将激活指示设置为激活状态的控制单元(181)。 此外,解码器包括用于变换下混合信号以获得包括多个第一子带信道的第一变换下混合的第一分析模块(182)。 此外,解码器包括第二分析模块(183),用于当激活指示被设置为激活状态时,通过转换第一子带信道中的至少一个以获得多个第二子带信道来产生第二变换下混合,其中 第二变换下混合包括尚未被第二分析模块和第二子带信道变换的第一子带信道。 此外,解码器包括解混合单元(184),其中,当激活指示被设置为激活状态时,解混合单元(184)被配置为基于参数侧信息来解混合第二变换下混合 在一个或多个音频对象信号上获得音频输出信号,并且当激活指示未被设置为激活状态时,基于关于一个或多个音频对象的参数侧信息来解混合第一变换下混合 信号以获得音频输出信号。 此外,提供了一种编码器。

    APPARATUS AND METHOD FOR DETERMINING A MEASURE FOR A PERCEIVED LEVEL OF REVERBERATION, AUDIO PROCESSOR AND METHOD FOR PROCESSING A SIGNAL
    6.
    发明公开
    APPARATUS AND METHOD FOR DETERMINING A MEASURE FOR A PERCEIVED LEVEL OF REVERBERATION, AUDIO PROCESSOR AND METHOD FOR PROCESSING A SIGNAL 审中-公开
    装置和方法,用于确定用于处理信号的感知混响电平,音频处理器和方法的幅度值

    公开(公告)号:EP2541542A1

    公开(公告)日:2013-01-02

    申请号:EP11171488.7

    申请日:2011-06-27

    IPC分类号: G10K15/12 H04S5/00

    摘要: An apparatus for determining a measure for a perceived level of reverberation in a mix signal consisting of a direct signal component (100) and a reverberation signal component (102), comprises a loudness model processor (104) comprising a perceptual filter stage for filtering the dry signal component (100) the reverberation signal component (102) or the mix signal, wherein the perceptual filter stage is configured for modeling an auditory perception mechanism of an entity to obtain a filtered direct signal, a filtered reverberation signal or a filtered mix signal. The apparatus furthermore comprises a loudness estimator for estimating a first loudness measure using the filtered direct signal and for estimating a second loudness measure using the filtered reverberation signal or the filtered mix signal, where the filtered mix signal is derived from a superposition of the direct signal component and the reverberation signal component. The apparatus furthermore comprises a combiner (110) for combining the first and the second loudness measures (106, 108) to obtain a measure (112) for the perceived level of reverberation.

    摘要翻译: 确定性采矿一种用于混响的混合信号由直射信号分量(100)和混响信号分量(102)的感知水平的量度包括响度模型处理器(104),其包括感知滤波级,用于滤波 干信号分量(100),该混响信号分量(102)或所述混合信号,worin感知滤波器级被配置用于一个实体的听觉感知机制,以获得经滤波的直接信号,经滤波的混响信号或经滤波的混合信号建模 , 所述装置还更包括响度估计器,用于利用滤波直接信号估计第一响度度量和使用该过滤混响信号或经滤波的混合信号,其中经滤波的混合信号被从直接信号的叠加衍生估计第二响度度量 分量和混响信号分量。 所述装置还包括多个用于将所述第一和第二响度测量(106,108)以获得用于混响的感知水平的度量(112)的组合器(110)。

    Encoder, decoder and methods for backward compatible multi-resolution spatial-audio-object-coding
    8.
    发明公开
    Encoder, decoder and methods for backward compatible multi-resolution spatial-audio-object-coding 审中-公开
    编码器,解码器和方法,用于向后兼容的空间音频对象编码多分辨率

    公开(公告)号:EP2717261A1

    公开(公告)日:2014-04-09

    申请号:EP13167485.5

    申请日:2013-05-13

    IPC分类号: G10L19/008 G10L19/02

    CPC分类号: G10L19/008 G10L19/02

    摘要: A decoder for generating an un-mixed audio signal comprising a plurality of un-mixed audio channels is provided. Moreover, an encoder and an encoded audio signal is provided. The decoder comprises an un-mixing-information determiner for determining un-mixing information by receiving first parametric side information on the at least one audio object signal and second parametric side information on the at least one audio object signal, wherein the frequency resolution of the second parametric side information is higher than the frequency resolution of the first parametric side information. Moreover, the decoder comprises an un-mix module for applying the un-mixing information on a downmix signal, indicating a downmix of at least one audio object signal, to obtain an un-mixed audio signal comprising the plurality of un-mixed audio channels. The un-mixing-information determiner is configured to determine the un-mixing information by modifying the first parametric information and the second parametric information to obtain modified parametric information, such that the modified parametric information has a frequency resolution which is higher than the first frequency resolution.

    摘要翻译: 提供了一种用于在非混合音频信号产生包括未混合音频信道与多个解码器。 更完了,在编码器和编码的音频信号被提供。 通过接收关于在所述至少一个音频对象信号的所述至少一个音频对象信号,并且第二参数侧信息的第一参数侧信息,worin的频率分辨率的解码器,用于确定开采未混合信息未混合-信息确定包括 第二参数侧信息比的第一参量侧信息的频率分辨率越高。 更上方,所述解码器包括到未混合组件用于施加上的下混信号的未混合的信息,表示至少一个音频对象信号的下混合,以获得包含未混合的音频信道的所述多个未混合音频信号 , 未混合信息确定器被配置为确定矿的未混合通过修改第一参数信息和第二参数的信息,以获得修改的参数信息,检查做了修改的参数信息的频率分辨率的所有比所述第一频率高 分辨率。

    Decoder, encoder and method for informed loudness estimation in object-based audio coding systems
    9.
    发明公开
    Decoder, encoder and method for informed loudness estimation in object-based audio coding systems 审中-公开
    Dekodierer,Kodierer und VerfahrenfürinformierteLautstärkenschätzungin objektbasierten Audiocodierungssystemen

    公开(公告)号:EP2879131A1

    公开(公告)日:2015-06-03

    申请号:EP13194664.2

    申请日:2013-11-27

    IPC分类号: G10L19/008

    摘要: A decoder for generating an audio output signal comprising one or more audio output channels is provided. The decoder comprises a receiving interface (110) for receiving an audio input signal comprising a plurality of audio object signals, for receiving loudness information on the audio object signals, and for receiving rendering information indicating whether one or more of the audio object signals shall be amplified or attenuated. Moreover, the decoder comprises a signal processor (120) for generating the one or more audio output channels of the audio output signal. The signal processor (120) is configured to determine a loudness compensation value depending on the loudness information and depending on the rendering information. Furthermore, the signal processor (120) is configured to generate the one or more audio output channels of the audio output signal from the audio input signal depending on the rendering information and depending on the loudness compensation value. Moreover, an encoder is provided.

    摘要翻译: 提供了一种用于产生包括一个或多个音频输出通道的音频输出信号的解码器。 解码器包括接收接口(110),用于接收包括多个音频对象信号的音频输入信号,用于接收关于音频对象信号的响度信息,并且用于接收指示音频对象信号中的一个或多个是否为 放大或减弱。 此外,解码器包括用于产生音频输出信号的一个或多个音频输出声道的信号处理器(120)。 信号处理器(120)被配置为根据响度信息和取决于渲染信息来确定响度补偿值。 此外,信号处理器(120)被配置为根据呈现信息并根据响度补偿值从音频输入信号产生音频输出信号的一个或多个音频输出声道。 此外,提供了一种编码器。

    Apparatus and method for enhanced spatial audio object coding
    10.
    发明公开
    Apparatus and method for enhanced spatial audio object coding 审中-公开
    Vorrichtung und Verfahren zur verb desserten Codierung einesräumlichenAudioobjekts

    公开(公告)号:EP2830050A1

    公开(公告)日:2015-01-28

    申请号:EP13189290.3

    申请日:2013-10-18

    IPC分类号: G10L19/008 H04S3/00

    摘要: An apparatus for generating one or more audio output channels is provided. The apparatus comprises a parameter processor (110) for calculating mixing information and a downmix processor (120) for generating the one or more audio output channels. The downmix processor (120) is configured to receive an audio transport signal comprising one or more audio transport channels. One or more audio channel signals are mixed within the audio transport signal, and one or more audio object signals are mixed within the transport signal, and wherein the number of the one or more audio transport channels is smaller than the number of the one or more audio channel signals plus the number of the one or more audio object signals. The parameter processor (110) is configured to receive downmix information indicating information on how the one or more audio channel signals and the one or more audio object signals are mixed within the one or more audio transport channels, and wherein the parameter processor (110) is configured to receive covariance information. Moreover, the parameter processor (110) is configured to calculate the mixing information depending on the downmix information and depending on the covariance information. The downmix processor (120) is configured to generate the one or more audio output channels from the audio transport signal depending on the mixing information. The information indicates a level difference information for at least one of the one or more audio channel signals and further indicates a level difference information for at least one of the one or more audio object signals. However, the covariance information does not indicate correlation information for any pair of one of the one or more audio channel signals and one of the one or more audio object signals.

    摘要翻译: 提供了一种用于产生一个或多个音频输出通道的装置。 该装置包括用于计算混合信息的参数处理器(110)和用于生成一个或多个音频输出通道的下混处理器(120)。 下混合处理器(120)被配置为接收包括一个或多个音频传输信道的音频传输信号。 一个或多个音频信道信号在音频传输信号内混合,并且一个或多个音频对象信号在传输信号内混合,并且其中一个或多个音频传输信道的数量小于一个或多个 音频通道信号加上一个或多个音频对象信号的数量。 参数处理器(110)被配置为接收指示关于一个或多个音频信道信号和一个或多个音频对象信号如何在一个或多个音频传输信道内混合的信息的下混信息,并且其中参数处理器(110) 被配置为接收协方差信息。 此外,参数处理器(110)被配置为根据缩混信息并根据协方差信息来计算混合信息。 下混合处理器(120)被配置为根据混合信息从音频传输信号生成一个或多个音频输出信道。 该信息指示一个或多个音频信道信号中的至少一个音频信道信号的电平差信息,并进一步指示一个或多个音频对象信号中的至少一个音频对象信号的电平差信息。 然而,协方差信息并不表示一个或多个音频信道信号中的任何一个与一个或多个音频对象信号之一的相关信息。