Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
    1.
    发明授权
    Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal 有权
    用于构造多通道输出信号或用于产生下混合信号的装置和方法

    公开(公告)号:US07394903B2

    公开(公告)日:2008-07-01

    申请号:US10762100

    申请日:2004-01-20

    IPC分类号: H04R5/00

    摘要: The apparatus for constructing a multi-channel output signal using an input signal and parametric side information, the input signal including the first input channel and the second input channel derived from an original multi-channel signal, and the parametric side information describing interrelations between channels of the multi-channel original signal uses base channels for synthesizing first and second output channels on one side of an assumed listener position, which are different from each other. The base channels are different from each other because of a coherence measure. Coherence between the base channels (for example the left and the left surround reconstructed channel) is reduced by calculating a base channel for one of those channels by a combination of the input channels, the combination being determined by the coherence measure. Thus, a high subjective quality of the reconstruction can be obtained because of an approximated original front/back coherence.

    摘要翻译: 用于使用输入信号和参数侧信息构造多通道输出信号的装置,包括从原始多通道信号导出的第一输入通道和第二输入通道的输入信号以及描述通道之间的相互关系的参数侧信息 多信道原始信号使用用于合成彼此不同的假定收听者位置的一侧上的第一和第二输出声道的基本通道。 由于一致性测量,基本通道彼此不同。 通过输入通道的组合计算这些通道中的一个通道的基本通道来减小基本通道(例如左和左环绕重建通道)之间的相干性,该组合由相干性测量确定。 因此,由于近似的原始前/后相干性,可以获得重建的高主观质量。

    Apparatus and method for generating a multi-channel output signal
    2.
    发明授权
    Apparatus and method for generating a multi-channel output signal 有权
    用于产生多通道输出信号的装置和方法

    公开(公告)号:US07391870B2

    公开(公告)日:2008-06-24

    申请号:US10935061

    申请日:2004-09-07

    IPC分类号: H04R5/00 G06F17/00 G10L19/00

    摘要: An apparatus for generating a multi-channel output signal performs a center channel cancellation to obtain improved base channels for reconstructing left-side output channels or right-side output channels. In particular, the apparatus includes a cancellation channel calculator for calculating a cancellation channel using information related to the original center channel available at the decoder. The device furthermore includes a combiner for combining a transmission channel with the cancellation channel. Finally, the apparatus includes a reconstructor for generating the multi-channel output signal. Due to the center channel cancellation, the channel reconstructor not only uses a different base channel for reconstructing the center channel but also uses base channels different from the transmission channels for reconstructing left and right output channels which have a reduced or even completely cancelled influence of the original center channel.

    摘要翻译: 用于产生多通道输出信号的装置执行中心信道消除以获得用于重建左侧输出信道或右侧输出信道的改进的基本信道。 具体地,该装置包括消除信道计算器,用于使用与解码器可用的原始中心信道相关的信息来计算消除信道。 该装置还包括用于将传输信道与消除信道组合的组合器。 最后,该装置包括用于产生多声道输出信号的重建器。 由于中心信道消除,信道重构器不仅使用不同的基本信道来重构中心信道,而且还使用与传输信道不同的基本信道来重构左右输出信道,这些信道具有减少甚至完全消除的影响 原中心频道。

    Method and device for processing time-discrete audio sampled values
    3.
    发明授权
    Method and device for processing time-discrete audio sampled values 有权
    用于处理时间离散音频采样值的方法和装置

    公开(公告)号:US07512539B2

    公开(公告)日:2009-03-31

    申请号:US10479398

    申请日:2002-05-28

    IPC分类号: G06F17/14 G10L19/00

    CPC分类号: G10L19/0212 G06F17/147

    摘要: An integer transform, which provides integer output values, carries out the TDAC function of a MDCT in the time domain before the forward transform. In overlapping windows, this results in a Givens rotation which may be represented by lifting matrices, wherein time-discrete sampled values of an audio signal may at first be summed up on a pair-wise basis to build a vector so as to be sequentially provided with a lifting matrix. After each multiplication of a vector by a lifting matrix, a rounding step is carried out such that, on the output-side, only integers will result. By transforming the windowed integer sampled value with an integer transform, a spectral representation with integer spectral values may be obtained. The inverse mapping with an inverse rotation matrix and corresponding inverse lifting matrices results in an exact reconstruction.

    摘要翻译: 提供整数输出值的整数变换在正向变换之前的时域中执行MDCT的TDAC功能。 在重叠窗口中,这导致Givens旋转,其可以由提升矩阵表示,其中音频信号的时间离散采样值可以首先在成对的基础上相加以构建向量以便顺序地提供 与提升矩阵。 在通过提升矩阵对向量进行每次乘法之后,执行舍入步骤,使得在输出侧仅将导致整数。 通过用整数变换变换窗口整数采样值,可以获得具有整数频谱值的频谱表示。 具有逆旋转矩阵和对应的反提升矩阵的逆映射导致精确重建。

    Method and apparatus for producing a fingerprint, and method and apparatus for identifying an audio signal
    4.
    发明授权
    Method and apparatus for producing a fingerprint, and method and apparatus for identifying an audio signal 有权
    用于制造指纹的方法和装置,以及用于识别音频信号的方法和装置

    公开(公告)号:US07460994B2

    公开(公告)日:2008-12-02

    申请号:US10483452

    申请日:2002-06-20

    IPC分类号: G10L15/00

    摘要: For producing a fingerprint of an audio signal, use is made of information defining a plurality of predetermined fingerprint modi, all of the fingerprint modi relating to the same type of fingerprint, the fingerprint modi, however, providing different fingerprints differing from each other with regard to their data volume, on the one hand, and to their characterizing strength for characterizing the audio signal, on the other hand, the fingerprint modi being pre-determined such that a fingerprint in accordance with a fingerprint modus having a first characterizing strength is convertible to a fingerprint in accordance with a fingerprint modus having a second characterizing strength, without using the audio signal. A predetermined fingerprint modus of the plurality of predetermined fingerprint modi is set and subsequently used for computing a fingerprint using the audio signal. The convertibility feature of the fingerprints having been produced by the different fingerprint modi enables setting a flexible compromise between the data volume and the characterizing strength for certain applications without having to re-generate a fingerprint database with each change of the fingerprint modus. Fingerprint representations scaled with regard to time or frequency may readily be converted to a different fingerprint modus.

    摘要翻译: 为了产生音频信号的指纹,使用定义多个预定指纹模式的信息,与相同类型的指纹相关的所有指纹模式,指纹模式,然而,提供彼此不同的不同指纹 一方面涉及它们的数据量,以及它们用于表征音频信号的特征强度,另一方面,预先指定的指纹模式使得根据具有第一特征强度的指纹模式的指纹可转换 根据具有第二特征强度的指纹模式,指纹,而不使用音频信号。 设置多个预定指纹模式的预定指纹模式,并随后用于使用音频信号计算指纹。 由不同的指纹模式产生的指纹的可转换特征使得能够在某些应用的数据量和特征强度之间设置灵活的折衷,而不必随着指纹模式的每次变化重新生成指纹数据库。 关于时间或频率缩放的指纹表示可以容易地转换成不同的指纹模式。

    Device and method for embedding a watermark in an audio signal
    5.
    发明授权
    Device and method for embedding a watermark in an audio signal 有权
    将音频信号嵌入水印的装置和方法

    公开(公告)号:US07346514B2

    公开(公告)日:2008-03-18

    申请号:US10481860

    申请日:2002-05-10

    IPC分类号: G10L11/00 G04L9/00

    CPC分类号: G10L19/018 G10L19/02

    摘要: Prior to embedding a watermark in an audio signal, a spectral representation of the audio signal and a spectral representation of the watermark signal are determined. The spectral representation of the watermark signal is then processed on the basis of a psychoacoustic masking threshold of the audio signal. The processed watermark signal is combined with the audio signal to obtain an audio signal bearing a watermark. The spectral representation of the watermark signal is processed iteratively as follows: first a predetermined watermark initial value is selected, then the interference introduced into the spectral representation of the audio signal after a quantization of the spectral representation of the audio signal is determined and then, if the interference introduced by the watermark initial value exceeds the predetermined interference threshold, the watermark initial value is modified progressively until the resulting interference introduced into the spectral representation of the audio signal after quantization is less than or equal to the predetermined interference threshold. The modified watermark initial value at the end of the iteration is used as the processed watermark signal to be combined with the audio signal. As a result it is no longer possible for a watermark to be quantized out. Instead, full control over the energy of the watermark is achieved. A watermark can therefore be embedded in an audio signal to provide either the best possible degree of watermark detectability or the best possible audio quality.

    摘要翻译: 在将音频信号嵌入水印之前,确定音频信号的频谱表示和水印信号的频谱表示。 然后基于音频信号的心理声学屏蔽阈值处理水印信号的频谱表示。 经处理的水印信号与音频信号组合以获得带有水印的音频信号。 水印信号的频谱表示如下进行迭代处理:首先选择一个预定的水印初始值,然后确定音频信号的频谱表示量化后引入到音频信号的频谱表示中的干扰, 如果由水印初始值引入的干扰超过预定的干扰阈值,则水印初始值被逐渐修改,直到引入量化后的音频信号的频谱表示的干扰小于或等于预定的干扰阈值。 使用迭代结束时的修改水印初始值作为与音频信号组合的经处理水印信号。 因此,不可能将水印量化出来。 相反,实现了对水印能量的完全控制。 因此,水印可以嵌入在音频信号中以提供最佳可能程度的水印检测能力或最佳音频质量。

    Compatible multi-channel coding/decoding by weighting the downmix channel
    7.
    发明授权
    Compatible multi-channel coding/decoding by weighting the downmix channel 有权
    通过对下混通道进行加权来兼容多通道编码/解码

    公开(公告)号:US07447317B2

    公开(公告)日:2008-11-04

    申请号:US10679085

    申请日:2003-10-02

    摘要: In processing a multi-channel audio signal having at least three original channels, a first downmix channel and a second downmix channel are provided, which are derived from the original channels. For a selected original channel of the original channels, channel side information are calculated such that a downmix channel or a combined downmix channel including the first and the second downmix channels, when weighted using the channel side information, results in an approximation of the selected original channel. The channel side information and the first and second downmix channels form output data to be transmitted to a decoder, which, in case of a low level decoder only decodes the first and second downmix channels or, in case of a high level decoder provides a full multi-channel audio signal based on the downmix channels and the channel side information.

    摘要翻译: 在处理具有至少三个原始信道的多声道音频信号时,提供从原始信道导出的第一下混通道和第二下混通道。 对于原始频道的所选择的原始频道,计算频道侧信息,使得当使用频道侧信息加权时,包括第一和第二下混频道的下混频道或组合缩混频道导致所选原稿的近似 渠道。 信道侧信息以及第一和第二下混通道形成要发送到解码器的输出数据,其在低电平解码器的情况下仅解码第一和第二下混通道,或者在高电平解码器提供满 基于下混频道的多声道音频信号和频道侧信息。

    Apparatus for analyzing an audio signal with regard to rhythm information of the audio signal by using an autocorrelation function
    8.
    发明授权
    Apparatus for analyzing an audio signal with regard to rhythm information of the audio signal by using an autocorrelation function 有权
    用于通过使用自相关函数来分析关于音频信号的节奏信息的音频信号的装置

    公开(公告)号:US07012183B2

    公开(公告)日:2006-03-14

    申请号:US10713691

    申请日:2003-11-14

    IPC分类号: G10H1/40

    摘要: An apparatus for analyzing an audio signal with regard to rhythm information of the audio signal by using an autocorrelation function comprises a filter bank for separating the audio signal into at least two sub-band signals. The sub-band signals are examined with regard to periodicities by an autocorrelation function, to obtain rhythm raw-information for the at least two sub-band signals. To reduce or eliminate the ambiguities of the autocorrelation function for periodical signals, the rhythm raw-information is postprocessed to obtain post-processed rhythm raw-information for the sub-band signal. The rhythm information of the audio signal is established based on the postprocessed rhythm raw-information. By the sub-band-wise ACF postprocessing, ACF ambiguities are already eliminated where they originate, and rhythm portions are added at double tempi, which an autocorrelation function processing does normally not provide, so that, as a result, a more robust determination of the rhythm information of the audio signal arises.

    摘要翻译: 用于通过使用自相关函数来分析关于音频信号的节奏信息的音频信号的装置包括用于将音频信号分离成至少两个子带信号的滤波器组。 通过自相关函数检查子带信号的周期性,以获得用于至少两个子带信号的节奏原始信息。 为了减少或消除周期信号的自相关函数的不确定性,后处理节奏原始信息以获得用于子带信号的后处理节奏原始信息。 基于后处理的节奏原始信息建立音频信号的节奏信息。 通过子带式ACF后处理,ACF模糊度已经被消除,它们起源,并且节奏部分以双重温度被添加,自相关函数处理通常不提供,因此,结果是更稳健地确定 音频信号的节奏信息出现。

    Method for coding an audio signal
    9.
    发明授权
    Method for coding an audio signal 有权
    音频信号编码方法

    公开(公告)号:US06424939B1

    公开(公告)日:2002-07-23

    申请号:US09402684

    申请日:1999-10-06

    IPC分类号: G10L1900

    CPC分类号: H04B1/665 G10L19/028

    摘要: A method for coding or decoding an audio signal combines the advantages of TNS processing and noise substitution. A time-discrete audio signal is initially transformed to the frequency domain in order to obtain spectral values of the temporal audio signal. Subsequently, a prediction of the spectral values in relation to frequency is carried out in order to obtain spectral residual values. Within the spectral residual values, areas are detected encompassing spectral residual values with noise properties. The spectral residual values in the noise areas are noise-substituted, whereupon information concerning the noise areas and noise substitution is incorporated into side information pertaining to a coded audio signal. Thus, considerable bit savings in case of transient signals can be achieved.

    摘要翻译: 用于对音频信号进行编码或解码的方法结合了TNS处理和噪声替换的优点。 时间离散音频信号最初被变换到频域以获得时间音频信号的频谱值。 随后,进行与频率相关的频谱值的预测,以获得谱残差值。 在光谱残差值内,检测到包含具有噪声特性的光谱残差值的区域。 噪声区域中的频谱残差值被噪声替代,因此关于噪声区域和噪声替换的信息被并入与编码音频信号有关的侧面信息中。 因此,可以实现在瞬态信号的情况下相当可观的位节省。

    Compatible multi-channel coding/decoding
    10.
    发明授权
    Compatible multi-channel coding/decoding 有权
    兼容多通道编码/解码

    公开(公告)号:US09462404B2

    公开(公告)日:2016-10-04

    申请号:US13588139

    申请日:2012-08-17

    摘要: In processing a multi-channel audio signal having at least three original channels, a first downmix channel and a second downmix channel are provided, which are derived from the original channels. For a selected original channel, channel side information are calculated such that a downmix channel or a combined downmix channel including the first and the second downmix channels, when weighted using the channel side information, results in an approximation of the selected original channel. The channel side information and the first and second downmix channels form output data to be transmitted to a decoder, which, in case of a low level decoder only decodes the first and second downmix channels or, in case of a high level decoder provides a full multi-channel audio signal based on the downmix channels and the channel side information.

    摘要翻译: 在处理具有至少三个原始信道的多声道音频信号时,提供从原始信道导出的第一下混通道和第二下混通道。 对于所选择的原始信道,计算信道侧信息,使得当使用信道侧信息加权时,包括第一和第二下混通道的下混通道或组合下混通道导致所选原始通道的近似。 信道侧信息以及第一和第二下混通道形成要发送到解码器的输出数据,其在低电平解码器的情况下仅解码第一和第二下混通道,或者在高电平解码器提供满 基于下混频道的多声道音频信号和频道侧信息。