Device and method for analyzing an information signal
    11.
    发明授权
    Device and method for analyzing an information signal 失效
    用于分析信息信号的装置和方法

    公开(公告)号:US08175730B2

    公开(公告)日:2012-05-08

    申请号:US12495138

    申请日:2009-06-30

    CPC分类号: G10L25/48

    摘要: In order to analyze an information signal, a significant short-time spectrum is extracted from the information signal, the means for extracting being configured to extract such short-time spectra which come closer to a specific characteristic than other short-time spectra of the information signal. The short-time spectra extracted are then decomposed into component signals using ICA analysis, a component signal spectrum representing a profile spectrum of a tone source which generates a tone corresponding to the characteristic sought for. From a sequence of short-time spectra of the information signal and from the profile spectra determined, an amplitude envelope is eventually calculated for each profile spectrum, the amplitude envelope indicating how a profile spectrum of a tone source all in all changes over time. The profile spectra and all the amplitude envelopes associated therewith provide a description of the information signal which may be evaluated further, for example for transcription purposes in the case of a music signal.

    摘要翻译: 为了分析信息信号,从信息信号中提取出显着的短时间频谱,提取装置被配置为提取比信息的其他短时间频谱更接近特定特征的短时频谱 信号。 然后,使用ICA分析将所提取的短时光谱分解为分量信号,分量信号谱表示产生与所寻求特性对应的色调的色调源的谱图。 从信息信号的短时光谱序列和所确定的谱图谱中,最终针对每个谱图计算幅度包络,幅度包络指示音源的谱图谱随时间的变化。 轮廓谱和与其相关联的所有振幅包络提供可以进一步评估的信息信号的描述,例如在音乐信号的情况下用于转录目的。

    Envelope shaping of decorrelated signals
    12.
    发明授权
    Envelope shaping of decorrelated signals 有权
    去相关信号的信封整形

    公开(公告)号:US07983424B2

    公开(公告)日:2011-07-19

    申请号:US11402519

    申请日:2006-04-12

    IPC分类号: H04R5/00 G10L19/00

    摘要: The envelope of a decorrelated signal derived from an original signal can be shaped without introducing additional distortion, when a spectral flattener is used to spectrally flatten the spectrum of the decorrelated signal and the original signal prior to using the flattened spectra for deriving a gain factor describing the energy distribution between the flattened spectra, and when the so derived gain factor is used by an envelope shaper to timely shape the envelope of the decorrelated signal.

    摘要翻译: 当原始信号得到的去相关信号的包络线可以被成形,而不引入额外的失真,当使用光谱平滑器在光谱平坦化去相关信号的频谱和原始信号之前,在使用平坦的光谱以得出描述的增益因子之前 平坦化光谱之间的能量分布,以及当衍生的增益因子被信封整形器用于及时地形成去相关信号的包络时。

    Device and method for analyzing an information signal
    13.
    发明授权
    Device and method for analyzing an information signal 失效
    用于分析信息信号的装置和方法

    公开(公告)号:US07565213B2

    公开(公告)日:2009-07-21

    申请号:US11123474

    申请日:2005-05-05

    CPC分类号: G10L25/48

    摘要: A significant short-time spectrum is extracted from an information signal, the means for extracting being configured to extract such short-time spectra which come closer to a specific characteristic than others. The short-time spectra extracted are then decomposed into component signals using ICA analysis, a component signal spectrum representing a profile spectrum of a tone source which generates a tone corresponding to the characteristic sought. From a sequence of short-time spectra of the information signal and from the profile spectra determined, an amplitude envelope is calculated for each profile spectrum to indicate how a tone source profile spectrum changes over time. The profile spectra and all the amplitude envelopes associated therewith provide a description of the information signal which may be evaluated further, for example for transcription purposes in the case of a music signal.

    摘要翻译: 从信息信号中提取出显着的短时频谱,提取装置被配置为提取比其他信号更接近特定特征的短时光谱。 然后,使用ICA分析将所提取的短时频谱分解为分量信号,分量信号谱表示产生与所寻求特征对应的色调的色调源的谱图。 根据信息信号的短时间序列和所确定的谱图谱,计算每个谱图的幅度包络,以指示色调谱图谱随时间的变化。 轮廓谱和与其相关联的所有振幅包络提供可以进一步评估的信息信号的描述,例如在音乐信号的情况下用于转录目的。

    Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
    14.
    发明授权
    Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal 有权
    用于构造多通道输出信号或用于产生下混合信号的装置和方法

    公开(公告)号:US07394903B2

    公开(公告)日:2008-07-01

    申请号:US10762100

    申请日:2004-01-20

    IPC分类号: H04R5/00

    摘要: The apparatus for constructing a multi-channel output signal using an input signal and parametric side information, the input signal including the first input channel and the second input channel derived from an original multi-channel signal, and the parametric side information describing interrelations between channels of the multi-channel original signal uses base channels for synthesizing first and second output channels on one side of an assumed listener position, which are different from each other. The base channels are different from each other because of a coherence measure. Coherence between the base channels (for example the left and the left surround reconstructed channel) is reduced by calculating a base channel for one of those channels by a combination of the input channels, the combination being determined by the coherence measure. Thus, a high subjective quality of the reconstruction can be obtained because of an approximated original front/back coherence.

    摘要翻译: 用于使用输入信号和参数侧信息构造多通道输出信号的装置,包括从原始多通道信号导出的第一输入通道和第二输入通道的输入信号以及描述通道之间的相互关系的参数侧信息 多信道原始信号使用用于合成彼此不同的假定收听者位置的一侧上的第一和第二输出声道的基本通道。 由于一致性测量,基本通道彼此不同。 通过输入通道的组合计算这些通道中的一个通道的基本通道来减小基本通道(例如左和左环绕重建通道)之间的相干性,该组合由相干性测量确定。 因此,由于近似的原始前/后相干性,可以获得重建的高主观质量。

    Method and device for detecting a transient in a discrete-time audio signal
    15.
    发明授权
    Method and device for detecting a transient in a discrete-time audio signal 有权
    用于检测离散时间音频信号中的瞬变的方法和装置

    公开(公告)号:US06826525B2

    公开(公告)日:2004-11-30

    申请号:US10183139

    申请日:2002-06-25

    IPC分类号: G10L1900

    CPC分类号: H04B1/665

    摘要: A method for detecting a transient in a discrete-time audio signal is performed completely in the time domain and includes the step of segmenting the discrete-time audio signal so as to generate consecutive segments of the same length with unfiltered discrete-time audio signals xs(T−1). The discrete-time audio signal in a current segment is subsequently filtered. Then either the energy of the filtered discrete-time audio signal in the current segment can be compared with the energy of the filtered discrete-time audio signal in a preceding segment or a current relationship between the energy of the filtered discrete-time audio signal in the current segment and the energy of the unfiltered discrete-time audio signal in the current segment can be formed and this current relationship compared with a preceding corresponding relationship. On the basis of the one and/or the other of these comparisons it is detected whether a transient is present in the discrete-time audio signal.

    摘要翻译: 用于检测离散时间音频信号中的瞬态的方法在时域中完全执行,并且包括分段离散时间音频信号以便生成具有未滤波的离散时间音频信号xs的相同长度的连续片段的步骤 (T-1)。 随后过滤当前片段中的离散时间音频信号。 然后可以将当前段中滤波的离散时间音频信号的能量与前一段中滤波的离散时间音频信号的能量或滤波后的离散时间音频信号的能量之间的当前关系进行比较 可以形成当前段的当前段和未过滤离散时间音频信号的能量,并将该当前关系与先前的对应关系进行比较。 基于这些比较中的一个和/或另一个,检测离散时间音频信号中是否存在瞬态。

    Method and device for processing time-discrete audio sampled values
    16.
    发明授权
    Method and device for processing time-discrete audio sampled values 有权
    用于处理时间离散音频采样值的方法和装置

    公开(公告)号:US07512539B2

    公开(公告)日:2009-03-31

    申请号:US10479398

    申请日:2002-05-28

    IPC分类号: G06F17/14 G10L19/00

    CPC分类号: G10L19/0212 G06F17/147

    摘要: An integer transform, which provides integer output values, carries out the TDAC function of a MDCT in the time domain before the forward transform. In overlapping windows, this results in a Givens rotation which may be represented by lifting matrices, wherein time-discrete sampled values of an audio signal may at first be summed up on a pair-wise basis to build a vector so as to be sequentially provided with a lifting matrix. After each multiplication of a vector by a lifting matrix, a rounding step is carried out such that, on the output-side, only integers will result. By transforming the windowed integer sampled value with an integer transform, a spectral representation with integer spectral values may be obtained. The inverse mapping with an inverse rotation matrix and corresponding inverse lifting matrices results in an exact reconstruction.

    摘要翻译: 提供整数输出值的整数变换在正向变换之前的时域中执行MDCT的TDAC功能。 在重叠窗口中,这导致Givens旋转,其可以由提升矩阵表示,其中音频信号的时间离散采样值可以首先在成对的基础上相加以构建向量以便顺序地提供 与提升矩阵。 在通过提升矩阵对向量进行每次乘法之后,执行舍入步骤,使得在输出侧仅将导致整数。 通过用整数变换变换窗口整数采样值,可以获得具有整数频谱值的频谱表示。 具有逆旋转矩阵和对应的反提升矩阵的逆映射导致精确重建。

    Method and apparatus for producing a fingerprint, and method and apparatus for identifying an audio signal
    17.
    发明授权
    Method and apparatus for producing a fingerprint, and method and apparatus for identifying an audio signal 有权
    用于制造指纹的方法和装置,以及用于识别音频信号的方法和装置

    公开(公告)号:US07460994B2

    公开(公告)日:2008-12-02

    申请号:US10483452

    申请日:2002-06-20

    IPC分类号: G10L15/00

    摘要: For producing a fingerprint of an audio signal, use is made of information defining a plurality of predetermined fingerprint modi, all of the fingerprint modi relating to the same type of fingerprint, the fingerprint modi, however, providing different fingerprints differing from each other with regard to their data volume, on the one hand, and to their characterizing strength for characterizing the audio signal, on the other hand, the fingerprint modi being pre-determined such that a fingerprint in accordance with a fingerprint modus having a first characterizing strength is convertible to a fingerprint in accordance with a fingerprint modus having a second characterizing strength, without using the audio signal. A predetermined fingerprint modus of the plurality of predetermined fingerprint modi is set and subsequently used for computing a fingerprint using the audio signal. The convertibility feature of the fingerprints having been produced by the different fingerprint modi enables setting a flexible compromise between the data volume and the characterizing strength for certain applications without having to re-generate a fingerprint database with each change of the fingerprint modus. Fingerprint representations scaled with regard to time or frequency may readily be converted to a different fingerprint modus.

    摘要翻译: 为了产生音频信号的指纹,使用定义多个预定指纹模式的信息,与相同类型的指纹相关的所有指纹模式,指纹模式,然而,提供彼此不同的不同指纹 一方面涉及它们的数据量,以及它们用于表征音频信号的特征强度,另一方面,预先指定的指纹模式使得根据具有第一特征强度的指纹模式的指纹可转换 根据具有第二特征强度的指纹模式,指纹,而不使用音频信号。 设置多个预定指纹模式的预定指纹模式,并随后用于使用音频信号计算指纹。 由不同的指纹模式产生的指纹的可转换特征使得能够在某些应用的数据量和特征强度之间设置灵活的折衷,而不必随着指纹模式的每次变化重新生成指纹数据库。 关于时间或频率缩放的指纹表示可以容易地转换成不同的指纹模式。

    Device and method for embedding a watermark in an audio signal
    18.
    发明授权
    Device and method for embedding a watermark in an audio signal 有权
    将音频信号嵌入水印的装置和方法

    公开(公告)号:US07346514B2

    公开(公告)日:2008-03-18

    申请号:US10481860

    申请日:2002-05-10

    IPC分类号: G10L11/00 G04L9/00

    CPC分类号: G10L19/018 G10L19/02

    摘要: Prior to embedding a watermark in an audio signal, a spectral representation of the audio signal and a spectral representation of the watermark signal are determined. The spectral representation of the watermark signal is then processed on the basis of a psychoacoustic masking threshold of the audio signal. The processed watermark signal is combined with the audio signal to obtain an audio signal bearing a watermark. The spectral representation of the watermark signal is processed iteratively as follows: first a predetermined watermark initial value is selected, then the interference introduced into the spectral representation of the audio signal after a quantization of the spectral representation of the audio signal is determined and then, if the interference introduced by the watermark initial value exceeds the predetermined interference threshold, the watermark initial value is modified progressively until the resulting interference introduced into the spectral representation of the audio signal after quantization is less than or equal to the predetermined interference threshold. The modified watermark initial value at the end of the iteration is used as the processed watermark signal to be combined with the audio signal. As a result it is no longer possible for a watermark to be quantized out. Instead, full control over the energy of the watermark is achieved. A watermark can therefore be embedded in an audio signal to provide either the best possible degree of watermark detectability or the best possible audio quality.

    摘要翻译: 在将音频信号嵌入水印之前,确定音频信号的频谱表示和水印信号的频谱表示。 然后基于音频信号的心理声学屏蔽阈值处理水印信号的频谱表示。 经处理的水印信号与音频信号组合以获得带有水印的音频信号。 水印信号的频谱表示如下进行迭代处理:首先选择一个预定的水印初始值,然后确定音频信号的频谱表示量化后引入到音频信号的频谱表示中的干扰, 如果由水印初始值引入的干扰超过预定的干扰阈值,则水印初始值被逐渐修改,直到引入量化后的音频信号的频谱表示的干扰小于或等于预定的干扰阈值。 使用迭代结束时的修改水印初始值作为与音频信号组合的经处理水印信号。 因此,不可能将水印量化出来。 相反,实现了对水印能量的完全控制。 因此,水印可以嵌入在音频信号中以提供最佳可能程度的水印检测能力或最佳音频质量。

    Energy dependent quantization for efficient coding of spatial audio parameters
    19.
    发明授权
    Energy dependent quantization for efficient coding of spatial audio parameters 有权
    能量相关量化用于空间音频参数的有效编码

    公开(公告)号:US08054981B2

    公开(公告)日:2011-11-08

    申请号:US11406631

    申请日:2006-04-19

    IPC分类号: H04R5/00

    CPC分类号: G10L19/03 G10L19/008

    摘要: Parameters being a measure for a characteristic of a channel or of a pair of channels, wherein the parameter is a measure for a characteristic of the channel or of the pair of channels with respect to another channel of a multi-channel signal can be quantized more efficiently using a quantization rule that is generated based on a relation of an energy measure of the channel or the pair of channels and an energy measure of the multi-channel signal. With generation of the quantization rule taking into account a psycho acoustic approach, the size of an encoded representation of the multi-channel signal can be decreased by coarser quantization without significantly disturbing the perceptual quality of the multi-channel signal when reconstructed from the encoded representation.

    摘要翻译: 参数是用于信道或一对信道特性的度量,其中参数是相对于多信道信号的另一信道的信道或信道对的特性的度量可以被更多地量化 有效地使用基于信道的能量测量或信道对的关系产生的量化规则和多信道信号的能量测量。 考虑到心理声学方法的生成量化规则,可以通过较粗略的量化来减少多信道信号的编码表示的大小,而不会在从编码表示重建时显着干扰多信道信号的感知质量 。