Apparatus and method for enhanced spatial audio object coding
    61.
    发明公开
    Apparatus and method for enhanced spatial audio object coding 审中-公开
    Vorrichtung und Verfahren zur verb desserten Codierung einesräumlichenAudioobjekts

    公开(公告)号:EP2830050A1

    公开(公告)日:2015-01-28

    申请号:EP13189290.3

    申请日:2013-10-18

    IPC分类号: G10L19/008 H04S3/00

    摘要: An apparatus for generating one or more audio output channels is provided. The apparatus comprises a parameter processor (110) for calculating mixing information and a downmix processor (120) for generating the one or more audio output channels. The downmix processor (120) is configured to receive an audio transport signal comprising one or more audio transport channels. One or more audio channel signals are mixed within the audio transport signal, and one or more audio object signals are mixed within the transport signal, and wherein the number of the one or more audio transport channels is smaller than the number of the one or more audio channel signals plus the number of the one or more audio object signals. The parameter processor (110) is configured to receive downmix information indicating information on how the one or more audio channel signals and the one or more audio object signals are mixed within the one or more audio transport channels, and wherein the parameter processor (110) is configured to receive covariance information. Moreover, the parameter processor (110) is configured to calculate the mixing information depending on the downmix information and depending on the covariance information. The downmix processor (120) is configured to generate the one or more audio output channels from the audio transport signal depending on the mixing information. The information indicates a level difference information for at least one of the one or more audio channel signals and further indicates a level difference information for at least one of the one or more audio object signals. However, the covariance information does not indicate correlation information for any pair of one of the one or more audio channel signals and one of the one or more audio object signals.

    摘要翻译: 提供了一种用于产生一个或多个音频输出通道的装置。 该装置包括用于计算混合信息的参数处理器(110)和用于生成一个或多个音频输出通道的下混处理器(120)。 下混合处理器(120)被配置为接收包括一个或多个音频传输信道的音频传输信号。 一个或多个音频信道信号在音频传输信号内混合,并且一个或多个音频对象信号在传输信号内混合,并且其中一个或多个音频传输信道的数量小于一个或多个 音频通道信号加上一个或多个音频对象信号的数量。 参数处理器(110)被配置为接收指示关于一个或多个音频信道信号和一个或多个音频对象信号如何在一个或多个音频传输信道内混合的信息的下混信息,并且其中参数处理器(110) 被配置为接收协方差信息。 此外,参数处理器(110)被配置为根据缩混信息并根据协方差信息来计算混合信息。 下混合处理器(120)被配置为根据混合信息从音频传输信号生成一个或多个音频输出信道。 该信息指示一个或多个音频信道信号中的至少一个音频信道信号的电平差信息,并进一步指示一个或多个音频对象信号中的至少一个音频对象信号的电平差信息。 然而,协方差信息并不表示一个或多个音频信道信号中的任何一个与一个或多个音频对象信号之一的相关信息。

    Apparatus and method for realizing a SAOC downmix of 3D audio content
    62.
    发明公开
    Apparatus and method for realizing a SAOC downmix of 3D audio content 审中-公开
    Vorrichtung und Verfahren zur Realisierung eines SAOC-Downmix von 3D-Audioinhalt

    公开(公告)号:EP2830048A1

    公开(公告)日:2015-01-28

    申请号:EP13189281.2

    申请日:2013-10-18

    IPC分类号: G10L19/008 H04S3/00

    摘要: An apparatus for generating one or more audio output channels is provided. The apparatus comprises a parameter processor (110) for calculating output channel mixing information and a downmix processor (120) for generating the one or more audio output channels. The downmix processor (120) is configured to receive an audio transport signal comprising one or more audio transport channels, wherein two or more audio object signals are mixed within the audio transport signal, and wherein the number of the one or more audio transport channels is smaller than the number of the two or more audio object signals. The audio transport signal depends on a first mixing rule and on a second mixing rule. The first mixing rule indicates how to mix the two or more audio object signals to obtain a plurality of premixed channels. Moreover, the second mixing rule indicates how to mix the plurality of premixed channels to obtain the one or more audio transport channels of the audio transport signal. The parameter processor (110) is configured to receive information on the second mixing rule, wherein the information on the second mixing rule indicates how to mix the plurality of premixed signals such that the one or more audio transport channels are obtained. Moreover, the parameter processor (110) is configured to calculate the output channel mixing information depending on an audio objects number indicating the number of the two or more audio object signals, depending on a premixed channels number indicating the number of the plurality of premixed channels, and depending on the information on the second mixing rule. The downmix processor (120) is configured to generate the one or more audio output channels from the audio transport signal depending on the output channel mixing information.

    摘要翻译: 提供了一种用于产生一个或多个音频输出通道的装置。 该装置包括用于计算输出通道混合信息的参数处理器(110)和用于产生一个或多个音频输出通道的下混处理器(120)。 下混合处理器(120)被配置为接收包括一个或多个音频传输信道的音频传输信号,其中在音频传输信号内混合两个或多个音频对象信号,并且其中一个或多个音频传输信道的数量为 小于两个或更多个音频对象信号的数量。 音频传输信号取决于第一混合规则和第二混合规则。 第一混合规则指示如何混合两个或多个音频对象信号以获得多个预混频道。 此外,第二混合规则指示如何混合多个预混频道以获得音频传输信号的一个或多个音频传输信道。 参数处理器(110)被配置为接收关于第二混合规则的信息,其中关于第二混合规则的信息指示如何混合多个预混合信号,使得获得一个或多个音频传输信道。 此外,参数处理器(110)被配置为根据指示多个预混频道的数量的预设频道号码,根据指示两个或多个音频对象信号的数量的音频对象号来计算输出频道混合信息 ,并且取决于关于第二混合规则的信息。 下混合处理器(120)被配置为根据输出信道混合信息从音频传输信号生成一个或多个音频输出信道。

    Concept for audio encoding and decoding for audio channels and audio objects
    63.
    发明公开
    Concept for audio encoding and decoding for audio channels and audio objects 审中-公开
    Konzept zur Audiocodierung und AudiodecodierungfürAudiokanäleund Audioobjekte

    公开(公告)号:EP2830045A1

    公开(公告)日:2015-01-28

    申请号:EP13177378.0

    申请日:2013-07-22

    IPC分类号: G10L19/008

    摘要: Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).

    摘要翻译: 用于编码音频输入数据(101)以获得音频输出数据(501)的音频编码器包括用于接收多个音频通道的输入接口(100),与多个音频中的一个或多个音频相关的多个音频对象和元数据 对象; 混合器(200),用于混合多个对象和多个通道以获得多个预混合通道,每个预混合通道包括通道的音频数据和至少一个对象的音频数据; 核心编码器(300),用于核心编码核心编码器输入数据; 以及用于压缩与所述多个音频对象中的一个或多个音频对象有关的元数据的元数据压缩器(400),其中所述音频编码器被配置为在包括第一模式的两种模式的组的至少一种模式中操作,其中 核心编码器被配置为将由输入接口接收的多个音频频道和多个音频对象编码为核心编码器输入数据,以及第二模式,其中核心编码器(300)被配置为接收作为核心 编码器输入数据,由混合器(200)产生的多个预混频道。

    Encoder, decoder and methods for backward compatible dynamic adaption of time/frequency resolution in spatial-audio-object-coding
    64.
    发明公开
    Encoder, decoder and methods for backward compatible dynamic adaption of time/frequency resolution in spatial-audio-object-coding 审中-公开
    编码器,解码器和在空间音频对象编码为时间/频率分辨率的向后兼容的动态调整的方法

    公开(公告)号:EP2717265A1

    公开(公告)日:2014-04-09

    申请号:EP13167481.4

    申请日:2013-05-13

    IPC分类号: G10L19/025 G10L19/008

    摘要: A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal comprising a plurality of time-domain downmix samples is provided. The downmix signal encodes two or more audio object signals. The decoder comprises a window-sequence generator (134) for determining a plurality of analysis windows, wherein each of the analysis windows comprises a plurality of time-domain downmix samples of the downmix signal. Each analysis window of the plurality of analysis windows has a window length indicating the number of the time-domain downmix samples of said analysis window. The window-sequence generator (134) is configured to determine the plurality of analysis windows so that the window length of each of the analysis windows depends on a signal property of at least one of the two or more audio object signals. Moreover, the decoder comprises a t/f-analysis module (135) for transforming the plurality of time-domain downmix samples of each analysis window of the plurality of analysis windows from a time-domain to a time-frequency domain depending on the window length of said analysis window, to obtain a transformed downmix. Furthermore, the decoder comprises an un-mixing unit (136) for un-mixing the transformed downmix based on parametric side information on the two or more audio object signals to obtain the audio output signal. Moreover, an encoder is provided.

    摘要翻译: 提供了一种用于在包括一个或从下混信号包括时域混样品的多元性多个音频输出声道生成音频输出信号的解码器。 缩混信号编码两个或多个音频对象信号。 解码器用于确定性采矿包括窗口序列产生器(134)的分析窗口复数,worin每个分析窗口的包括缩混信号的时域混样品的多元性。 的分析窗,所述多个每个分析窗口具有窗口长度指示所述分析窗口的时域混的样本的数目。 窗口序列产生器(134)被配置为确定矿分析窗口多元性所以没有每个分析窗口的窗口长度取决于两个或更多个音频对象信号中的至少一个的信号特性。 更上方,所述解码器包括在/ F-分析模块(135),用于将的分析窗口,所述多个每个分析窗口的时域混样品的多元性从时域变换到时频域取决于窗口长度 所述分析窗口,以获得转化的缩混。 进一步,对于未混合单元(136)的解码器包括未混合的转化缩混基于关于两个或更多个音频对象信号,以获得所述音频输出信号参数侧信息。 更过,在编码器提供。

    Apparatus, method and computer program for deriving a multi-channel audio signal from an audio signal
    65.
    发明授权
    Apparatus, method and computer program for deriving a multi-channel audio signal from an audio signal 有权
    的装置,方法和计算机程序用于从音频信号中导出多声道音频信号

    公开(公告)号:EP2500900B1

    公开(公告)日:2014-04-02

    申请号:EP12168768.5

    申请日:2007-10-23

    IPC分类号: G10L19/008 G10L19/02 H04S5/00

    摘要: An apparatus for deriving a multi-channel audio signal comprising a front-loudspeaker signal and a back-loudspeaker signal from an audio signal, the apparatus comprising an apparatus for generating an ambient signal from the audio signal, wherein the apparatus for generating the ambient signal from the audio signal comprises means for a lossy compression of a representation of the audio signal so as to obtain a compressed representation of the audio signal; and means for calculating a difference between the compressed representation of the audio signal and the representation of the audio signal so as to obtain a discrimination representation, the discrimination representation describing the difference between the representation of the audio signal and the compressed representation of the audio signal and describing those portions of the audio signal not played back in the lossily compressed representation, and wherein the means for lossy compression is configured such that signal portions exhibiting regular distribution of the energy or carrying a large signal energy are preferred to be included in the compressed representation; wherein the discrimination representation forms the ambient signal; an apparatus for providing the audio signal or a signal derived therefrom as the front-loudspeaker signal; and a back-loudspeaker-signal-providing apparatus for providing the ambient signal provided by the apparatus for generating the ambient signal or a signal derived therefrom as the back-loudspeaker signal. An apparatus for generating an ambient signal from an audio signal comprises means for lossy compression of a representation of the audio signal so as to obtain a compressed representation of the audio signal describing a compressed audio signal. The apparatus for generating the ambient signal further comprises means for calculating a difference between the compressed representation of the audio signal and the representation of the audio signal so as to obtain a discrimination representation. The apparatus further comprises means for providing the ambient signal using the discrimination representation.

    Apparatus and method for improving the perceived quality of sound reproduction by combining active noise cancellation and perceptual noise compensation
    66.
    发明公开
    Apparatus and method for improving the perceived quality of sound reproduction by combining active noise cancellation and perceptual noise compensation 审中-公开
    装置和方法,用于通过主动降噪和噪声补偿的感知组合提高感知Tonqualitätswiedergabe

    公开(公告)号:EP2645362A1

    公开(公告)日:2013-10-02

    申请号:EP12169608.2

    申请日:2012-05-25

    IPC分类号: G10K11/178

    摘要: An apparatus for improving a perceived quality of sound reproduction of an audio output signal is provided. The apparatus comprises an active noise cancellation unit (110) for generating a noise cancellation signal based on an environmental audio signal, wherein the environmental audio signal comprises noise signal portions, the noise signal portions resulting from recording environmental noise. Moreover, the apparatus comprises a residual noise characteristics estimator (120) for determining a residual noise characteristic depending on the environmental noise and the noise cancellation signal. Furthermore, the apparatus comprises a perceptual noise compensation unit (130) for generating a noise-compensated signal based on an audio target signal and based on the residual noise characteristic. Moreover, the apparatus comprises a combiner (140) for combining the noise cancellation signal and the noise-compensated signal to obtain the audio output signal.

    摘要翻译: 提供了一种用于改进的音频输出信号的声音再现的感知质量的装置。 有源噪声消除单元(110),用于产生在环境音频信号基于噪声消除信号,worin环境音频信号的设备包括:包括噪声信号部分,所述噪声信号部分从记录环境噪声引起。 更上方,该装置包括一个残留噪声特性估计器(120),用于确定的采矿残余噪声特性根据环境噪声和所述噪声消除信号。 进一步,该装置包括一个感知噪声补偿单元(130),用于基于音频对象的信号,基于所述残留噪声特性的噪声补偿的信号。 更上方,该装置包括用于将所述噪声消除信号和所述噪声补偿的信号,以获得所述音频输出信号的组合器(140)。

    Phase coherence control for harmonic signals in perceptual audio codecs
    67.
    发明公开
    Phase coherence control for harmonic signals in perceptual audio codecs 审中-公开
    Phasenkoherenzsteuerungfürharmonische信号在hörbaren音频编解码器

    公开(公告)号:EP2631906A1

    公开(公告)日:2013-08-28

    申请号:EP12178265.0

    申请日:2012-07-27

    摘要: A decoder for decoding an encoded audio signal to obtain a phase-adjusted audio signal is provided. The decoder comprises a decoding unit (110) and a phase adjustment unit (120). The decoding unit (110) is adapted to decode the encoded audio signal to obtain a decoded audio signal. The phase adjustment unit (120) is adapted to adjust the decoded audio signal to obtain the phase-adjusted audio signal. The phase adjustment unit (120) is configured to receive control information depending on a vertical phase coherence of the encoded audio signal. Moreover, the phase adjustment unit (120) is adapted to adjust the decoded audio signal based on the control information.

    摘要翻译: 提供了一种用于解码编码音频信号以获得相位调整音频信号的解码器。 解码器包括解码单元(110)和相位调整单元(120)。 解码单元(110)适于对编码的音频信号进行解码以获得解码的音频信号。 相位调整单元(120)适于调整解码的音频信号以获得相位调整的音频信号。 相位调整单元(120)被配置为根据编码的音频信号的垂直相位相干接收控制信息。 此外,相位调整单元120适于基于控制信息调整解码音频信号。

    Apparatus and method for merging geometry - based spatial audio coding streams
    68.
    发明公开
    Apparatus and method for merging geometry - based spatial audio coding streams 审中-公开
    Vorrichtung und Verfahren zum Mischen von Raumtoncodierungsstreams auf Geometriebasis

    公开(公告)号:EP2600343A1

    公开(公告)日:2013-06-05

    申请号:EP11191816.5

    申请日:2011-12-02

    IPC分类号: G10L19/00

    摘要: An apparatus for generating a merged audio data stream is provided. The apparatus comprises a demultiplexer (180) for obtaining a plurality of single-layer audio data streams, wherein the demultiplexer (180) is adapted to receive one or more input audio data streams, wherein each input audio data stream comprises one or more layers, wherein the demultiplexer (180) is adapted to demultiplex each one of the input audio data streams having one or more layers into two or more demultiplexed audio data streams having exactly one layer, such that the two or more demultiplexed audio data streams together comprise the one or more layers of the input audio data stream. Furthermore, the apparatus comprises a merging module (190) for generating the merged audio data stream, having one or more layers, based on the plurality of single-layer audio data streams. Each layer of the input data audio streams, of the demultiplexed audio data streams, of the single-layer data streams and of the merged audio data stream comprises a pressure value of a pressure signal, a position value and a diffuseness value as audio data.

    摘要翻译: 提供一种用于产生合并的音频数据流的装置。 该装置包括用于获得多个单层音频数据流的解复用器(180),其中解复用器(180)适于接收一个或多个输入音频数据流,其中每个输入音频数据流包括一个或多个层, 其中解复用器(180)适于将具有一个或多个层的输入音频数据流中的每一个解复用为具有正好一个层的两个或更多个解复用的音频数据流,使得两个或更多个解复用的音频数据流一起构成一个 或更多层的输入音频数据流。 此外,该装置包括用于基于多个单层音频数据流来生成具有一个或多个层的合并音频数据流的合并模块(190)。 单层数据流和合并音频数据流的解复用音频数据流的输入数据音频流的每一层包括作为音频数据的压力信号,位置值和扩散度值的压力值。

    Semantic audio track mixer
    69.
    发明公开
    Semantic audio track mixer 审中-公开
    Vorrichtung zum Mischen von Audiospuren

    公开(公告)号:EP2485213A1

    公开(公告)日:2012-08-08

    申请号:EP11153211.5

    申请日:2011-02-03

    CPC分类号: H04R3/00 G10L15/22 H04H60/04

    摘要: An audio mixer for mixing a plurality of audio tracks to a mixture signal comprises a semantic command interpreter (30; 35) for receiving a semantic mixing command and for deriving a plurality of mixing parameters for the plurality of audio tracks from the semantic mixing command; an audio track processor (70; 75) for processing the plurality of audio tracks in accordance with the plurality of mixing parameters; and an audio track combiner (76) for combining the plurality of audio tracks processed by the audio track processor into the mixture signal (MS). A corresponding method comprises: receiving a semantic mixing command; deriving a plurality of mixing parameters for the plurality of audio tracks from the semantic mixing command; processing the plurality of audio tracks in accordance with the plurality of mixing parameters; and combining the plurality of audio tracks resulting from the processing of the plurality of audio tracks to form the mixture signal.

    摘要翻译: 用于将多个音频轨道混合到混合信号的音频混合器包括语义命令解释器(30; 35),用于接收语义混合命令并从语义混合命令中导出多个音频轨道的多个混合参数; 音轨处理器(70; 75),用于根据多个混合参数来处理多个音轨; 以及用于将由音频轨道处理器处理的多个音频轨道组合成混合信号(MS)的音轨组合器(76)。 相应的方法包括:接收语义混合命令; 从所述语义混合命令导出所述多个音轨的多个混合参数; 根据多个混合参数处理多个音频轨道; 以及组合由多个音频轨道的处理产生的多个音频轨道以形成混合信号。

    Vorrichtung und Verfahren zum Erzeugen eines Umgebungssignals
    70.
    发明公开
    Vorrichtung und Verfahren zum Erzeugen eines Umgebungssignals 有权
    装置和方法用于产生环境信号

    公开(公告)号:EP2402943A2

    公开(公告)日:2012-01-04

    申请号:EP11182965.1

    申请日:2007-01-30

    IPC分类号: G10L19/02 H04S5/00

    CPC分类号: H04S5/005 G10L19/008 H04R5/04

    摘要: Zum Erzeugen eines Umgebungssignals, das zur Ausstrahlung über Lautsprecher geeignet ist, für die kein eigenes Lautsprechersignal existiert, also beispielsweise für Surround-Kanäle, ist ein Transienten-Detektor(11) vorgesehen, um einen Transientenzeitraum zu detektieren. Ein Synthesesignalgenerator(12) erzeugt ein Synthesesignal, das einerseits die Transientenbedingung und andererseits die Kontinuitätsbedingung für das Synthesesignal erfüllt. Ein Signalsubstituierer(14) ersetzt dann einen Abschnitt des Untersuchungssignals durch das Synthesesignal, um ein Umgebungssignal für die Surround-Kanäle zu erhalten.

    摘要翻译: 用于产生环境信号,其适合于在扬声器广播针对不存在自己的扬声器信号,因此,例如,环绕声道,瞬态检测器(11)设置成检测Transientenzeitraum。 内合成信号发生器(12)生成,一方面和另一方面Transientenbedingung满足用于合成信号的连续性条件的合成信号。 甲Signalsubstituierer(14)取代了探测信号的一部分由所述合成信号以获得用于环绕声道环绕信号。