Method for coding an audio signal
    2.
    发明授权
    Method for coding an audio signal 有权
    音频信号编码方法

    公开(公告)号:US06424939B1

    公开(公告)日:2002-07-23

    申请号:US09402684

    申请日:1999-10-06

    IPC分类号: G10L1900

    CPC分类号: H04B1/665 G10L19/028

    摘要: A method for coding or decoding an audio signal combines the advantages of TNS processing and noise substitution. A time-discrete audio signal is initially transformed to the frequency domain in order to obtain spectral values of the temporal audio signal. Subsequently, a prediction of the spectral values in relation to frequency is carried out in order to obtain spectral residual values. Within the spectral residual values, areas are detected encompassing spectral residual values with noise properties. The spectral residual values in the noise areas are noise-substituted, whereupon information concerning the noise areas and noise substitution is incorporated into side information pertaining to a coded audio signal. Thus, considerable bit savings in case of transient signals can be achieved.

    摘要翻译: 用于对音频信号进行编码或解码的方法结合了TNS处理和噪声替换的优点。 时间离散音频信号最初被变换到频域以获得时间音频信号的频谱值。 随后,进行与频率相关的频谱值的预测,以获得谱残差值。 在光谱残差值内,检测到包含具有噪声特性的光谱残差值的区域。 噪声区域中的频谱残差值被噪声替代,因此关于噪声区域和噪声替换的信息被并入与编码音频信号有关的侧面信息中。 因此,可以实现在瞬态信号的情况下相当可观的位节省。

    Process for coding and decoding stereophonic spectral values
    3.
    发明授权
    Process for coding and decoding stereophonic spectral values 有权
    立体声频谱值的编码和解码过程

    公开(公告)号:US06771777B1

    公开(公告)日:2004-08-03

    申请号:US09214656

    申请日:1999-05-28

    IPC分类号: H04H500

    CPC分类号: H04S1/007

    摘要: A method of coding stereo audio spectral values first carries out grouping of those values in scale factor bands, with which scale factors are associated. Sections are formed next, each comprising at least one scale factor band. The spectral values are coded within at least one section with a code book assigned to the section, out of a plurality of code books each with a code book number assigned to it, the number of the code book used being transmitted as side information to the coded stereo audio spectral values. At least one additional code book number is provided, which does not refer to a code book but shows information relevant to the section to which it is assigned. A method of decoding stereo audio spectral values which are partly coded by the intensity stereo process and which have side information uses the relevant information, showing the additional code book numbers, to cancel the existing coding of the stereo audio spectral values.

    摘要翻译: 对立体声音频频谱值进行编码的方法首先对与比例因子相关联的比例因子频带中的那些值进行分组。 接下来形成切片,每个部分包括至少一个比例因子带。 频谱值在至少一个部分内被编码,其中分配有代码簿的部分,在分配有代码簿编号的多个代码簿中,使用的代码簿的编号作为辅助信息被发送到 编码立体声音频频谱值。 提供至少一个附加的代码簿编号,其不涉及代码簿,但是显示与其被分配的部分相关的信息。 解码由强度立体声处理部分地编码并且具有侧面信息的立体声音频频谱值的方法使用显示附加码本号码的相关信息来取消立体声音频频谱值的现有编码。

    Method for signalling a noise substitution during audio signal coding
    4.
    发明授权
    Method for signalling a noise substitution during audio signal coding 有权
    在音频信号编码期间用信号通知噪声替换的方法

    公开(公告)号:US06766293B1

    公开(公告)日:2004-07-20

    申请号:US09367775

    申请日:1999-08-18

    IPC分类号: G10L2102

    CPC分类号: G10L19/028 H04B1/665

    摘要: In a method for signalling a noise substitution when coding an audio signal, the time-domain audio signal is first transformed into the frequency domain to obtain spectral values. The spectral values are subsequently grouped together to form groups of spectral values. On the basis of a detection establishing whether a group of spectral values is a noisy group or not, a codebook is allocated to a non-noisy or tonal group by means of a codebook number for redundancy coding of the same. If a group is noisy, an additional codebook number which does not refer to a codebook is allocated to it in order to signal that this group is noisy and therefore does not have to be redundancy coded. By signalling noise substitution by means of a Huffman codebook number for noisy groups of spectral values, which are e.g. sections made up of scale factor bands which do not have to be redundancy coded, an opportunity is provided to indicate the presence of a noise substitution in a scale factor band in the bit stream syntax of the MPEG-2 Advanced Audio Coding (AAC) Standard without having to interfere with the basic coding structure and without having to meddle with the structure of the existing bit stream syntax.

    摘要翻译: 在对音频信号编码时用于发信号通知的方法中,首先将时域音频信号变换成频域以获得频谱值​​。 光谱值随后被分组在一起以形成光谱值组。 基于确定一组频谱值是否为噪声组的检测,通过用于冗余编码的码本号将码本分配给非噪声或色调组。 如果组噪声,则分配不附加码本的附加码本号,以便发信号通知该组噪声,因此不必进行冗余编码。 通过用于噪声组的频谱值的霍夫曼码本号对信号进行信号替换, 由不必冗余编码的比例因子带组成的部分提供了一种机会,以指示在MPEG-2高级音频编码(AAC)标准的比特流语法中的比例因子频带中存在噪声替换 而不必干扰基本编码结构,而不必介入现有比特流语法的结构。

    Method and a device for coding audio signals and a method and a device for decoding a bit stream
    5.
    发明授权
    Method and a device for coding audio signals and a method and a device for decoding a bit stream 有权
    用于编码音频信号的方法和装置以及用于解码比特流的方法和装置

    公开(公告)号:US06502069B1

    公开(公告)日:2002-12-31

    申请号:US09530001

    申请日:2000-04-20

    IPC分类号: G10L1912

    CPC分类号: H04B1/665 H04B14/046

    摘要: The present invention permits a combination of a scalable audio coder with the TNS technique. In a method for coding time signals sampled in a first sampling rate, second time signals are first generated whose sampling rate is smaller than the first sampling rate. The second time signals are then coded according to a first coding algorithm and written into a bit stream. The coded second time signals are, however, decoded again, and, like the first time signals, transformed into the frequency domain. From a spectral representation of the first time signals, TNS prediction coefficients are calculated. The transformed output signal of the coder/decoder with the first coding algorithm, like the spectral representation of the first time signal, undergoes a prediction over the frequency to obtain residual spectral values for both signals, though only the prediction coefficients calculated on the basis of the first time signals are used. These two signals are evaluated against each other. The evaluated residual spectral values are then coded by means of a second coding algorithm to obtain coded evaluated residual spectral values, which, together with the side information containing the calculated prediction coefficients, are written into the bit stream.

    摘要翻译: 本发明允许可扩展音频编码器与TNS技术的组合。 在对以第一采样率采样的时间信号进行编码的方法中,首先生成采样率小于第一采样率的第二时间信号。 然后根据第一编码算法对第二时间信号进行编码并写入比特流。 然而,编码的第二时间信号被再次解码,并且像第一次信号一样被转换成频域。 根据第一时间信号的频谱表示,计算TNS预测系数。 使用第一编码算法的编码器/解码器的变换输出信号,如第一时间信号的频谱表示,对频率进行预测,以获得两个信号的残差频谱值,尽管仅基于 第一次使用信号。 这两个信号被相互评估。 然后通过第二编码算法对所评估的残差频谱值进行编码,以获得编码的估计残差频谱值,其与包含计算的预测系数的边信息一起写入比特流。

    Method and device for processing time-discrete audio sampled values
    6.
    发明授权
    Method and device for processing time-discrete audio sampled values 有权
    用于处理时间离散音频采样值的方法和装置

    公开(公告)号:US07512539B2

    公开(公告)日:2009-03-31

    申请号:US10479398

    申请日:2002-05-28

    IPC分类号: G06F17/14 G10L19/00

    CPC分类号: G10L19/0212 G06F17/147

    摘要: An integer transform, which provides integer output values, carries out the TDAC function of a MDCT in the time domain before the forward transform. In overlapping windows, this results in a Givens rotation which may be represented by lifting matrices, wherein time-discrete sampled values of an audio signal may at first be summed up on a pair-wise basis to build a vector so as to be sequentially provided with a lifting matrix. After each multiplication of a vector by a lifting matrix, a rounding step is carried out such that, on the output-side, only integers will result. By transforming the windowed integer sampled value with an integer transform, a spectral representation with integer spectral values may be obtained. The inverse mapping with an inverse rotation matrix and corresponding inverse lifting matrices results in an exact reconstruction.

    摘要翻译: 提供整数输出值的整数变换在正向变换之前的时域中执行MDCT的TDAC功能。 在重叠窗口中,这导致Givens旋转,其可以由提升矩阵表示,其中音频信号的时间离散采样值可以首先在成对的基础上相加以构建向量以便顺序地提供 与提升矩阵。 在通过提升矩阵对向量进行每次乘法之后,执行舍入步骤,使得在输出侧仅将导致整数。 通过用整数变换变换窗口整数采样值,可以获得具有整数频谱值的频谱表示。 具有逆旋转矩阵和对应的反提升矩阵的逆映射导致精确重建。

    Method and device for detecting a transient in a discrete-time audio signal
    7.
    发明授权
    Method and device for detecting a transient in a discrete-time audio signal 有权
    用于检测离散时间音频信号中的瞬变的方法和装置

    公开(公告)号:US06826525B2

    公开(公告)日:2004-11-30

    申请号:US10183139

    申请日:2002-06-25

    IPC分类号: G10L1900

    CPC分类号: H04B1/665

    摘要: A method for detecting a transient in a discrete-time audio signal is performed completely in the time domain and includes the step of segmenting the discrete-time audio signal so as to generate consecutive segments of the same length with unfiltered discrete-time audio signals xs(T−1). The discrete-time audio signal in a current segment is subsequently filtered. Then either the energy of the filtered discrete-time audio signal in the current segment can be compared with the energy of the filtered discrete-time audio signal in a preceding segment or a current relationship between the energy of the filtered discrete-time audio signal in the current segment and the energy of the unfiltered discrete-time audio signal in the current segment can be formed and this current relationship compared with a preceding corresponding relationship. On the basis of the one and/or the other of these comparisons it is detected whether a transient is present in the discrete-time audio signal.

    摘要翻译: 用于检测离散时间音频信号中的瞬态的方法在时域中完全执行,并且包括分段离散时间音频信号以便生成具有未滤波的离散时间音频信号xs的相同长度的连续片段的步骤 (T-1)。 随后过滤当前片段中的离散时间音频信号。 然后可以将当前段中滤波的离散时间音频信号的能量与前一段中滤波的离散时间音频信号的能量或滤波后的离散时间音频信号的能量之间的当前关系进行比较 可以形成当前段的当前段和未过滤离散时间音频信号的能量,并将该当前关系与先前的对应关系进行比较。 基于这些比较中的一个和/或另一个,检测离散时间音频信号中是否存在瞬态。

    Method and device for detecting a transient in a discrete-time audiosignal
    8.
    发明授权
    Method and device for detecting a transient in a discrete-time audiosignal 有权
    用于检测离散时间音频信号中的瞬态的方法和装置

    公开(公告)号:US06453282B1

    公开(公告)日:2002-09-17

    申请号:US09424596

    申请日:1999-11-24

    IPC分类号: G10L1900

    CPC分类号: H04B1/665

    摘要: A method for detecting a transient in a discrete-time audio signal is performed completely in the time domain and includes the step of segmenting the discrete-time audio signal as to generate consecutive segments of the same length with unfiltered discrete-time audio signals. The discrete-time audio signal in a current segment is filtered. Either the energy of the filtered discrete-time audio signal in the current segment is compared with the energy of the filtered discrete-time audio signal in a preceding segment or a current relationship between the energy of the filtered discrete-time audio signal in the current segment and the energy of the unfiltered discrete-time audio signal in the current segment is formed and this current relationship compared with a preceding corresponding relationship. Whether a transient is present in the discrete-time audio signal is detected using one and/or the other of these comparisons.

    摘要翻译: 用于检测离散时间音频信号中的瞬态的方法在时域中完全执行,并且包括分段离散时间音频信号以生成具有未滤波离散时间音频信号的相同长度的连续片段的步骤。 当前片段中的离散时间音频信号被过滤。 将当前片段中滤波的离散时间音频信号的能量与先前片段中滤波的离散时间音频信号的能量或电流中滤波后的离散时间音频信号的能量之间的当前关系进行比较 形成当前段中未经滤波的离散时间音频信号的能量,并将该当前关系与先前的对应关系进行比较。 使用这些比较中的一个和/或另一个来检测离散时间音频信号中是否存在瞬态。

    Method for masking defects in a stream of audio data
    9.
    发明授权
    Method for masking defects in a stream of audio data 有权
    用于掩蔽音频数据流中的缺陷的方法

    公开(公告)号:US06421802B1

    公开(公告)日:2002-07-16

    申请号:US09331697

    申请日:1999-06-23

    IPC分类号: G10L1900

    摘要: In a method for concealing errors in an audio data stream the occurrence of an error is detected in the audio data stream, audio data prior to the occurrence of the fault being intact audio data. Thereafter a spectral energy of a subgroup of the intact audio data is calculated. After forming a pattern for substitute data on the basis of the spectral energy calculated for the subgroup of the intact audio data, substitute data for erroneous or missing audio data which correspond to the subgroup are created on the basis of the pattern.

    摘要翻译: 在用于隐藏音频数据流中的错误的方法中,在音频数据流中检测到错误的发生,在发生故障之前的音频数据是完整的音频数据。 此后,计算完整音频数据的子组的频谱能量。 基于为完整音频数据的子组计算的频谱能量形成用于替代数据的模式之后,基于该模式创建与该子组对应的错误或缺失音频数据的替代数据。

    Method and apparatus for producing a fingerprint, and method and apparatus for identifying an audio signal
    10.
    发明授权
    Method and apparatus for producing a fingerprint, and method and apparatus for identifying an audio signal 有权
    用于制造指纹的方法和装置,以及用于识别音频信号的方法和装置

    公开(公告)号:US07460994B2

    公开(公告)日:2008-12-02

    申请号:US10483452

    申请日:2002-06-20

    IPC分类号: G10L15/00

    摘要: For producing a fingerprint of an audio signal, use is made of information defining a plurality of predetermined fingerprint modi, all of the fingerprint modi relating to the same type of fingerprint, the fingerprint modi, however, providing different fingerprints differing from each other with regard to their data volume, on the one hand, and to their characterizing strength for characterizing the audio signal, on the other hand, the fingerprint modi being pre-determined such that a fingerprint in accordance with a fingerprint modus having a first characterizing strength is convertible to a fingerprint in accordance with a fingerprint modus having a second characterizing strength, without using the audio signal. A predetermined fingerprint modus of the plurality of predetermined fingerprint modi is set and subsequently used for computing a fingerprint using the audio signal. The convertibility feature of the fingerprints having been produced by the different fingerprint modi enables setting a flexible compromise between the data volume and the characterizing strength for certain applications without having to re-generate a fingerprint database with each change of the fingerprint modus. Fingerprint representations scaled with regard to time or frequency may readily be converted to a different fingerprint modus.

    摘要翻译: 为了产生音频信号的指纹,使用定义多个预定指纹模式的信息,与相同类型的指纹相关的所有指纹模式,指纹模式,然而,提供彼此不同的不同指纹 一方面涉及它们的数据量,以及它们用于表征音频信号的特征强度,另一方面,预先指定的指纹模式使得根据具有第一特征强度的指纹模式的指纹可转换 根据具有第二特征强度的指纹模式,指纹,而不使用音频信号。 设置多个预定指纹模式的预定指纹模式,并随后用于使用音频信号计算指纹。 由不同的指纹模式产生的指纹的可转换特征使得能够在某些应用的数据量和特征强度之间设置灵活的折衷,而不必随着指纹模式的每次变化重新生成指纹数据库。 关于时间或频率缩放的指纹表示可以容易地转换成不同的指纹模式。