APPARATUS AND METHOD FOR SOURCE SEPARATION USING AN ESTIMATION AND CONTROL OF SOUND QUALITY

    公开(公告)号:EP3671739A1

    公开(公告)日:2020-06-24

    申请号:EP18215707.3

    申请日:2018-12-21

    摘要: An apparatus for generating a separated audio signal from an audio input signal is provided. The audio input signal comprises a target audio signal portion and a residual audio signal portion. The residual audio signal portion indicates a residual between the audio input signal and the target audio signal portion. The apparatus comprises a source separator (110), a determining module (120) and a signal processor (130). The source separator (110) is configured to determine an estimated target signal which depends on the audio input signal, the estimated target signal being an estimate of a signal that only comprises the target audio signal portion. The determining module (120) is configured to determine one or more result values depending on an estimated sound quality of the estimated target signal to obtain one or more parameter values, wherein the one or more parameter values are the one or more result values or depend on the one or more result values. The signal processor (130) is configured to generate the separated audio signal depending on the one or more parameter values and depending on at least one of the estimated target signal and the audio input signal and an estimated residual signal, the estimated residual signal being an estimate of a signal that only comprises the residual audio signal portion.

    APPARATUS AND METHOD FOR ENHANCED SPATIAL AUDIO OBJECT CODING
    3.
    发明公开
    APPARATUS AND METHOD FOR ENHANCED SPATIAL AUDIO OBJECT CODING 审中-公开
    设备和方法改进的空间编码中的音频点

    公开(公告)号:EP3025335A1

    公开(公告)日:2016-06-01

    申请号:EP14747862.2

    申请日:2014-07-17

    IPC分类号: G10L19/008 H04S3/00

    摘要: An apparatus for generating one or more audio output channels is provided. The apparatus comprises a parameter processor (110) for calculating mixing information and a downmix processor (120) for generating the one or more audio output channels. The downmix processor (120) is configured to receive an audio transport signal comprising one or more audio transport channels. One or more audio channel signals are mixed within the audio transport signal, and one or more audio object signals are mixed within the audio transport signal, and wherein the number of the one or more audio transport channels is smaller than the number of the one or more audio channel signals plus the number of the one or more audio object signals. The parameter processor (110) is configured to receive downmix information indicating information on how the one or more audio channel signals and the one or more audio object signals are mixed within the one or more audio transport channels, and wherein the parameter processor (110) is configured to receive covariance information. Moreover, the parameter processor (110) is configured to calculate the mixing information depending on the downmix information and depending on the covariance information. The downmix processor (120) is configured to generate the one or more audio output channels from the audio transport signal depending on the mixing information. The covariance information indicates a level difference information for at least one of the one or more audio channel signals and further indicates a level difference information for at least one of the one or more audio object signals. However, the covariance information does not indicate correlation information for any pair of one of the one or more audio channel signals and one of the one or more audio object signals.

    ENCODER, DECODER AND METHODS FOR SIGNAL-DEPENDENT ZOOM-TRANSFORM IN SPATIAL AUDIO OBJECT CODING
    8.
    发明公开
    ENCODER, DECODER AND METHODS FOR SIGNAL-DEPENDENT ZOOM-TRANSFORM IN SPATIAL AUDIO OBJECT CODING 审中-公开
    CODIERER,DECODIERER UND VERFAHRENFÜRSIGNALABHÄNGIGEZOOMUMWANDLUNG BEIM空间 - 音频对象编码

    公开(公告)号:EP2904610A1

    公开(公告)日:2015-08-12

    申请号:EP13776987.3

    申请日:2013-10-02

    摘要: A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal is provided. The downmix signal encodes one or more audio object signals. The decoder comprises a control unit (181) for setting an activation indication to an activation state depending on a signal property of at least one of the one or more audio object signals. Moreover, the decoder comprises a first analysis module (182) for transforming the downmix signal to obtain a first transformed downmix comprising a plurality of first subband channels. Furthermore, the decoder comprises a second analysis module (183) for generating, when the activation indication is set to the activation state, a second transformed downmix by transforming at least one of the first subband channels to obtain a plurality of second subband channels, wherein the second transformed downmix comprises the first subband channels which have not been transformed by the second analysis module and the second subband channels. Moreover, the decoder comprises an un-mixing unit (184), wherein the un-mixing unit (184) is configured to un-mix the second transformed downmix, when the activation indication is set to the activation state, based on parametric side information on the one or more audio object signals to obtain the audio output signal, and to un-mix the first transformed downmix, when the activation indication is not set to the activation state, based on the parametric side information on the one or more audio object signals to obtain the audio output signal. Furthermore, an encoder is provided.

    摘要翻译: 提供一种解码器,用于从包括多个时域下混样本的下混信号产生包括一个或多个音频输出通道的音频输出信号。 降混信号编码两个或更多个音频对象信号。 解码器包括用于确定多个分析窗口的窗口序列发生器(134),其中每个分析窗口包括下混合信号的多个时域下混样本。 多个分析窗口的每个分析窗口具有指示所述分析窗口的时域下混样本数的窗口长度。 窗口序列生成器(134)被配置为确定多个分析窗口,使得每个分析窗口的窗口长度取决于两个或更多个音频对象信号中的至少一个的信号属性。 此外,解码器包括在/ f分析模块(135),用于根据窗口长度将多个分析窗口中的每个分析窗口的多个时域下混样本从时域变换到时间 - 频域 的分析窗口,以获得一个转换的下混。 此外,解码器包括用于基于关于两个或更多个音频对象信号的参数侧信息解混合变换的下混合以获得音频输出信号的解混合单元(136)。 此外,提供了一种编码器。