Method and apparatus for producing a fingerprint, and method and apparatus for identifying an audio signal
    1.
    发明授权
    Method and apparatus for producing a fingerprint, and method and apparatus for identifying an audio signal 有权
    用于制造指纹的方法和装置,以及用于识别音频信号的方法和装置

    公开(公告)号:US07460994B2

    公开(公告)日:2008-12-02

    申请号:US10483452

    申请日:2002-06-20

    IPC分类号: G10L15/00

    摘要: For producing a fingerprint of an audio signal, use is made of information defining a plurality of predetermined fingerprint modi, all of the fingerprint modi relating to the same type of fingerprint, the fingerprint modi, however, providing different fingerprints differing from each other with regard to their data volume, on the one hand, and to their characterizing strength for characterizing the audio signal, on the other hand, the fingerprint modi being pre-determined such that a fingerprint in accordance with a fingerprint modus having a first characterizing strength is convertible to a fingerprint in accordance with a fingerprint modus having a second characterizing strength, without using the audio signal. A predetermined fingerprint modus of the plurality of predetermined fingerprint modi is set and subsequently used for computing a fingerprint using the audio signal. The convertibility feature of the fingerprints having been produced by the different fingerprint modi enables setting a flexible compromise between the data volume and the characterizing strength for certain applications without having to re-generate a fingerprint database with each change of the fingerprint modus. Fingerprint representations scaled with regard to time or frequency may readily be converted to a different fingerprint modus.

    摘要翻译: 为了产生音频信号的指纹,使用定义多个预定指纹模式的信息,与相同类型的指纹相关的所有指纹模式,指纹模式,然而,提供彼此不同的不同指纹 一方面涉及它们的数据量,以及它们用于表征音频信号的特征强度,另一方面,预先指定的指纹模式使得根据具有第一特征强度的指纹模式的指纹可转换 根据具有第二特征强度的指纹模式,指纹,而不使用音频信号。 设置多个预定指纹模式的预定指纹模式,并随后用于使用音频信号计算指纹。 由不同的指纹模式产生的指纹的可转换特征使得能够在某些应用的数据量和特征强度之间设置灵活的折衷,而不必随着指纹模式的每次变化重新生成指纹数据库。 关于时间或频率缩放的指纹表示可以容易地转换成不同的指纹模式。

    Method and device for characterizing a signal and method and device for producing an indexed signal
    2.
    发明授权
    Method and device for characterizing a signal and method and device for producing an indexed signal 有权
    用于表征信号的方法和装置以及用于产生索引信号的方法和装置

    公开(公告)号:US07478045B2

    公开(公告)日:2009-01-13

    申请号:US10484513

    申请日:2002-07-15

    IPC分类号: G10L15/02 G10L19/02

    摘要: In a method for characterizing a signal representing an audio content a measure is determined for a tonality of the signal, whereupon a statement is made about the audio content of the signal on the basis of the measure for the tonality of the signal. The measure for the tonality is derived from a quotient whose numerator is the mean of the summed values of spectral components of the signal exponentiated with a first power and whose denominator is the mean of the summed values of spectral components exponentiated with a second power, the first and second powers differing from each other. The measure for the tonality of the signal for the content analysis is robust in relation to a signal distortion, due e.g. to MP3 coding, and has a high correlation with the content of the analyzed signal.

    摘要翻译: 在表征音频内容的信号的表征方法中,针对信号的音调确定了一个度量,然后根据该信号音调的度量,对该信号的音频内容做出声明。 音调的度量来自商,其分子是以第一功率取幂的信号的频谱分量的总和值的平均值,其分母是用第二功率指数的频谱分量的总和值的平均值, 第一和第二权力彼此不同。 用于内容分析的信号的音调的度量相对于信号失真是鲁棒的,例如。 到MP3编码,并且与分析的信号的内容具有高度的相关性。

    Apparatus and method for synthesizing three output channels using two input channels
    3.
    发明授权
    Apparatus and method for synthesizing three output channels using two input channels 有权
    使用两个输入通道合成三个输出通道的装置和方法

    公开(公告)号:US07760886B2

    公开(公告)日:2010-07-20

    申请号:US11313180

    申请日:2005-12-20

    IPC分类号: H04R5/00

    CPC分类号: H04S5/00

    摘要: For synthesizing at least three output channels using two stereo input channels, the stereo input channels are analyzed to detect signal components occurring in both input channels. A signal generator is operative to introduce at least a part of the detected signal components into the second channel associated with a second speaker in an intended speaker scheme, which is positioned between a first and a third speaker in the speaker scheme. When, however, feeding of the complete detected signal components would result in a clipping situation, then only a part of the detected signal components is fed into the second channel as a real center channel and the remainder is located in the first and third channels as a phantom center channel.

    摘要翻译: 为了使用两个立体声输入通道合成至少三个输出通道,分析立体声输入通道以检测在两个输入通道中发生的信号分量。 信号发生器可操作以将所检测的信号分量的至少一部分引入与位于扬声器方案中的第一和第三扬声器之间的预期扬声器方案中与第二扬声器相关联的第二通道。 然而,当提供完整检测到的信号分量将导致削波情况时,则只有一部分检测到的信号分量被馈送到第二信道中作为实际中心信道,其余部分位于第一和第三信道中,如 幻影中心频道。

    Method and device for processing time-discrete audio sampled values
    4.
    发明授权
    Method and device for processing time-discrete audio sampled values 有权
    用于处理时间离散音频采样值的方法和装置

    公开(公告)号:US07512539B2

    公开(公告)日:2009-03-31

    申请号:US10479398

    申请日:2002-05-28

    IPC分类号: G06F17/14 G10L19/00

    CPC分类号: G10L19/0212 G06F17/147

    摘要: An integer transform, which provides integer output values, carries out the TDAC function of a MDCT in the time domain before the forward transform. In overlapping windows, this results in a Givens rotation which may be represented by lifting matrices, wherein time-discrete sampled values of an audio signal may at first be summed up on a pair-wise basis to build a vector so as to be sequentially provided with a lifting matrix. After each multiplication of a vector by a lifting matrix, a rounding step is carried out such that, on the output-side, only integers will result. By transforming the windowed integer sampled value with an integer transform, a spectral representation with integer spectral values may be obtained. The inverse mapping with an inverse rotation matrix and corresponding inverse lifting matrices results in an exact reconstruction.

    摘要翻译: 提供整数输出值的整数变换在正向变换之前的时域中执行MDCT的TDAC功能。 在重叠窗口中,这导致Givens旋转,其可以由提升矩阵表示,其中音频信号的时间离散采样值可以首先在成对的基础上相加以构建向量以便顺序地提供 与提升矩阵。 在通过提升矩阵对向量进行每次乘法之后,执行舍入步骤,使得在输出侧仅将导致整数。 通过用整数变换变换窗口整数采样值,可以获得具有整数频谱值的频谱表示。 具有逆旋转矩阵和对应的反提升矩阵的逆映射导致精确重建。

    Device and method for embedding a watermark in an audio signal
    5.
    发明授权
    Device and method for embedding a watermark in an audio signal 有权
    将音频信号嵌入水印的装置和方法

    公开(公告)号:US07346514B2

    公开(公告)日:2008-03-18

    申请号:US10481860

    申请日:2002-05-10

    IPC分类号: G10L11/00 G04L9/00

    CPC分类号: G10L19/018 G10L19/02

    摘要: Prior to embedding a watermark in an audio signal, a spectral representation of the audio signal and a spectral representation of the watermark signal are determined. The spectral representation of the watermark signal is then processed on the basis of a psychoacoustic masking threshold of the audio signal. The processed watermark signal is combined with the audio signal to obtain an audio signal bearing a watermark. The spectral representation of the watermark signal is processed iteratively as follows: first a predetermined watermark initial value is selected, then the interference introduced into the spectral representation of the audio signal after a quantization of the spectral representation of the audio signal is determined and then, if the interference introduced by the watermark initial value exceeds the predetermined interference threshold, the watermark initial value is modified progressively until the resulting interference introduced into the spectral representation of the audio signal after quantization is less than or equal to the predetermined interference threshold. The modified watermark initial value at the end of the iteration is used as the processed watermark signal to be combined with the audio signal. As a result it is no longer possible for a watermark to be quantized out. Instead, full control over the energy of the watermark is achieved. A watermark can therefore be embedded in an audio signal to provide either the best possible degree of watermark detectability or the best possible audio quality.

    摘要翻译: 在将音频信号嵌入水印之前,确定音频信号的频谱表示和水印信号的频谱表示。 然后基于音频信号的心理声学屏蔽阈值处理水印信号的频谱表示。 经处理的水印信号与音频信号组合以获得带有水印的音频信号。 水印信号的频谱表示如下进行迭代处理:首先选择一个预定的水印初始值,然后确定音频信号的频谱表示量化后引入到音频信号的频谱表示中的干扰, 如果由水印初始值引入的干扰超过预定的干扰阈值,则水印初始值被逐渐修改,直到引入量化后的音频信号的频谱表示的干扰小于或等于预定的干扰阈值。 使用迭代结束时的修改水印初始值作为与音频信号组合的经处理水印信号。 因此,不可能将水印量化出来。 相反,实现了对水印能量的完全控制。 因此,水印可以嵌入在音频信号中以提供最佳可能程度的水印检测能力或最佳音频质量。

    Compatible multi-channel coding/decoding by weighting the downmix channel
    7.
    发明授权
    Compatible multi-channel coding/decoding by weighting the downmix channel 有权
    通过对下混通道进行加权来兼容多通道编码/解码

    公开(公告)号:US07447317B2

    公开(公告)日:2008-11-04

    申请号:US10679085

    申请日:2003-10-02

    摘要: In processing a multi-channel audio signal having at least three original channels, a first downmix channel and a second downmix channel are provided, which are derived from the original channels. For a selected original channel of the original channels, channel side information are calculated such that a downmix channel or a combined downmix channel including the first and the second downmix channels, when weighted using the channel side information, results in an approximation of the selected original channel. The channel side information and the first and second downmix channels form output data to be transmitted to a decoder, which, in case of a low level decoder only decodes the first and second downmix channels or, in case of a high level decoder provides a full multi-channel audio signal based on the downmix channels and the channel side information.

    摘要翻译: 在处理具有至少三个原始信道的多声道音频信号时,提供从原始信道导出的第一下混通道和第二下混通道。 对于原始频道的所选择的原始频道,计算频道侧信息,使得当使用频道侧信息加权时,包括第一和第二下混频道的下混频道或组合缩混频道导致所选原稿的近似 渠道。 信道侧信息以及第一和第二下混通道形成要发送到解码器的输出数据,其在低电平解码器的情况下仅解码第一和第二下混通道,或者在高电平解码器提供满 基于下混频道的多声道音频信号和频道侧信息。

    Apparatus for analyzing an audio signal with regard to rhythm information of the audio signal by using an autocorrelation function
    8.
    发明授权
    Apparatus for analyzing an audio signal with regard to rhythm information of the audio signal by using an autocorrelation function 有权
    用于通过使用自相关函数来分析关于音频信号的节奏信息的音频信号的装置

    公开(公告)号:US07012183B2

    公开(公告)日:2006-03-14

    申请号:US10713691

    申请日:2003-11-14

    IPC分类号: G10H1/40

    摘要: An apparatus for analyzing an audio signal with regard to rhythm information of the audio signal by using an autocorrelation function comprises a filter bank for separating the audio signal into at least two sub-band signals. The sub-band signals are examined with regard to periodicities by an autocorrelation function, to obtain rhythm raw-information for the at least two sub-band signals. To reduce or eliminate the ambiguities of the autocorrelation function for periodical signals, the rhythm raw-information is postprocessed to obtain post-processed rhythm raw-information for the sub-band signal. The rhythm information of the audio signal is established based on the postprocessed rhythm raw-information. By the sub-band-wise ACF postprocessing, ACF ambiguities are already eliminated where they originate, and rhythm portions are added at double tempi, which an autocorrelation function processing does normally not provide, so that, as a result, a more robust determination of the rhythm information of the audio signal arises.

    摘要翻译: 用于通过使用自相关函数来分析关于音频信号的节奏信息的音频信号的装置包括用于将音频信号分离成至少两个子带信号的滤波器组。 通过自相关函数检查子带信号的周期性,以获得用于至少两个子带信号的节奏原始信息。 为了减少或消除周期信号的自相关函数的不确定性,后处理节奏原始信息以获得用于子带信号的后处理节奏原始信息。 基于后处理的节奏原始信息建立音频信号的节奏信息。 通过子带式ACF后处理,ACF模糊度已经被消除,它们起源,并且节奏部分以双重温度被添加,自相关函数处理通常不提供,因此,结果是更稳健地确定 音频信号的节奏信息出现。

    Method for coding an audio signal
    9.
    发明授权
    Method for coding an audio signal 有权
    音频信号编码方法

    公开(公告)号:US06424939B1

    公开(公告)日:2002-07-23

    申请号:US09402684

    申请日:1999-10-06

    IPC分类号: G10L1900

    CPC分类号: H04B1/665 G10L19/028

    摘要: A method for coding or decoding an audio signal combines the advantages of TNS processing and noise substitution. A time-discrete audio signal is initially transformed to the frequency domain in order to obtain spectral values of the temporal audio signal. Subsequently, a prediction of the spectral values in relation to frequency is carried out in order to obtain spectral residual values. Within the spectral residual values, areas are detected encompassing spectral residual values with noise properties. The spectral residual values in the noise areas are noise-substituted, whereupon information concerning the noise areas and noise substitution is incorporated into side information pertaining to a coded audio signal. Thus, considerable bit savings in case of transient signals can be achieved.

    摘要翻译: 用于对音频信号进行编码或解码的方法结合了TNS处理和噪声替换的优点。 时间离散音频信号最初被变换到频域以获得时间音频信号的频谱值。 随后,进行与频率相关的频谱值的预测,以获得谱残差值。 在光谱残差值内,检测到包含具有噪声特性的光谱残差值的区域。 噪声区域中的频谱残差值被噪声替代,因此关于噪声区域和噪声替换的信息被并入与编码音频信号有关的侧面信息中。 因此,可以实现在瞬态信号的情况下相当可观的位节省。

    Compatible multi-channel coding/decoding
    10.
    发明授权
    Compatible multi-channel coding/decoding 有权
    兼容多通道编码/解码

    公开(公告)号:US09462404B2

    公开(公告)日:2016-10-04

    申请号:US13588139

    申请日:2012-08-17

    摘要: In processing a multi-channel audio signal having at least three original channels, a first downmix channel and a second downmix channel are provided, which are derived from the original channels. For a selected original channel, channel side information are calculated such that a downmix channel or a combined downmix channel including the first and the second downmix channels, when weighted using the channel side information, results in an approximation of the selected original channel. The channel side information and the first and second downmix channels form output data to be transmitted to a decoder, which, in case of a low level decoder only decodes the first and second downmix channels or, in case of a high level decoder provides a full multi-channel audio signal based on the downmix channels and the channel side information.

    摘要翻译: 在处理具有至少三个原始信道的多声道音频信号时,提供从原始信道导出的第一下混通道和第二下混通道。 对于所选择的原始信道,计算信道侧信息,使得当使用信道侧信息加权时,包括第一和第二下混通道的下混通道或组合下混通道导致所选原始通道的近似。 信道侧信息以及第一和第二下混通道形成要发送到解码器的输出数据,其在低电平解码器的情况下仅解码第一和第二下混通道,或者在高电平解码器提供满 基于下混频道的多声道音频信号和频道侧信息。