Method and apparatus for scalable encoding and method and apparatus for scalable decoding
    1.
    发明授权
    Method and apparatus for scalable encoding and method and apparatus for scalable decoding 有权
    用于可缩放编码的方法和装置以及用于可扩展解码的方法和装置

    公开(公告)号:US07343287B2

    公开(公告)日:2008-03-11

    申请号:US10636149

    申请日:2003-08-07

    IPC分类号: G10L19/00

    摘要: An apparatus for scalable encoding a spectrum of a signal including audio and/or video information, with the spectrum comprising binary spectral values, includes a means for generating a first sub-scaling layer and a second sub-scaling layer in addition to a means for forming the encoded signal, with the means for forming being implemented so as to include the first sub-scaling layer and the second sub-scaling layer into the encoded signal that the first and the second sub-scaling layer are separately decodable from each other. In contrast to a full-scaling layer, a sub-scaling layer includes only the bits of a certain order of a part of the binary spectral values in the band, so that, by additionally decoding a sub-scaling layer, a more finely controllable and a more finely scalable precision gain may be achieved.

    摘要翻译: 一种用于对包括音频和/或视频信息的信号的频谱进行可分级编码的装置,其中包括二进制频谱值的频谱包括用于生成第一子缩放层和第二子缩放层的装置, 形成所述编码信号,所述用于形成的装置被实现为将所述第一子缩放层和所述第二子缩放层包括在所述编码信号中,所述第一和第二子缩放层可以彼此分开地解码。 与完全缩放层相反,子缩放层仅包括频带中二分谱值的一部分的一定顺序的位,使得通过附加地解码子缩放层,更精细地控制 并且可以实现更精细地扩展的精度增益。

    Apparatus and method for coding a time-discrete audio signal to obtain coded audio data and for decoding coded audio data
    2.
    发明授权
    Apparatus and method for coding a time-discrete audio signal to obtain coded audio data and for decoding coded audio data 有权
    用于编码时分离音频信号以获得编码音频数据和解码编码音频数据的装置和方法

    公开(公告)号:US07275036B2

    公开(公告)日:2007-09-25

    申请号:US10966780

    申请日:2004-10-15

    IPC分类号: G10L19/00

    摘要: A time-discrete audio signal is processed to provide a quantization block with quantized spectral values. Furthermore, an integer spectral representation is generated from the time-discrete audio signal using an integer transform algorithm. The quantization block having been generated using a psychoacoustic model is inversely quantized and rounded to then form a difference between the integer spectral values and the inversely quantized rounded spectral values. The quantization block alone provides a lossy psychoacoustically coded/decoded audio signal after the decoding, whereas the quantization block, together with the combination block, provides a lossless or almost lossless coded and again decoded audio signal in the decoding. By generating the differential signal in the frequency domain, a simpler coder/decoder structure results.

    摘要翻译: 处理时间离散音频信号以向量化块提供量化的频谱值。 此外,使用整数变换算法从时间离散音频信号生成整数谱表示。 已经使用心理声学模型产生的量化块被逆量化并舍入,从而形成整数频谱值和逆量化的舍入频谱值之间的差。 量化块单独提供在解码之后的有损心理声学编码/解码音频信号,而量化块与组合块一起在解码中提供无损或几乎无损的编码和再次解码的音频信号。 通过在频域中产生差分信号,可以得到更简单的编码器/解码器结构。

    Apparatus and method for coding a time-discrete audio signal and apparatus and method for decoding coded audio data
    3.
    发明申请
    Apparatus and method for coding a time-discrete audio signal and apparatus and method for decoding coded audio data 有权
    用于编码时分离音频信号的装置和方法,以及用于对编码音频数据进行解码的装置和方法

    公开(公告)号:US20050114126A1

    公开(公告)日:2005-05-26

    申请号:US10966780

    申请日:2004-10-15

    IPC分类号: G10L19/00 G10L19/02 G10L19/14

    摘要: A time-discrete audio signal is processed to provide a quantization block with quantized spectral values. Furthermore, an integer spectral representation is generated from the time-discrete audio signal using an integer transform algorithm. The quantization block having been generated using a psychoacoustic model is inversely quantized and rounded to then form a difference between the integer spectral values and the inversely quantized rounded spectral values. The quantization block alone provides a lossy psychoacoustically coded/decoded audio signal after the decoding, whereas the quantization block, together with the combination block, provides a lossless or almost lossless coded and again decoded audio signal in the decoding. By generating the differential signal in the frequency domain, a simpler coder/decoder structure results.

    摘要翻译: 处理时间离散音频信号以向量化块提供量化的频谱值。 此外,使用整数变换算法从时间离散音频信号生成整数频谱表示。 已经使用心理声学模型产生的量化块被逆量化并舍入,从而形成整数频谱值和逆量化的舍入频谱值之间的差。 量化块单独提供在解码之后的有损心理声学编码/解码音频信号,而量化块与组合块一起在解码中提供无损或几乎无损的编码和再次解码的音频信号。 通过在频域中产生差分信号,可以得到更简单的编码器/解码器结构。

    Device and method for determining a coding block raster of a decoded signal
    4.
    发明授权
    Device and method for determining a coding block raster of a decoded signal 有权
    用于确定解码信号的编码块光栅的装置和方法

    公开(公告)号:US06750789B2

    公开(公告)日:2004-06-15

    申请号:US10168456

    申请日:2002-10-25

    IPC分类号: H03M700

    CPC分类号: G10L19/02

    摘要: In determining a coding block raster on which a decoded signal is based, a segment of the decoded signal is picked out first, said segment beginning at a certain output sampling value of the decoded signal. Said segment is then converted into a spectral representation, whereupon said spectral representation is then evaluated in relation to a predetermined criterion in order to obtain an evaluation result for the segment. This procedure is repeated for a plurality of different segments beginning at different output sampling values each, in order to obtain a plurality of evaluation results. Finally, the plurality of the evaluation results is searched in order to establish the evaluation result that has an extreme value as compared to the other evaluation results, in such a way that it can be assumed that the segment to which this evaluation result is allocated matches the coding block raster on which the decoded signal is based. This method can be used to determine the coding block raster for any decoded signal that has no explicit information about its coding block raster.

    摘要翻译: 在确定解码信号所基于的编码块光栅时,首先选择解码信号的片段,所述片段以解码信号的特定输出采样值开始。 然后将所述段转换为频谱表示,然后根据预定标准对所述频谱表示进行评估,以获得该段的评估结果。 对于从不同的输出采样值开始的多个不同的段重复该过程,以获得多个评估结果。 最后,搜索多个评估结果以便建立与其他评估结果相比具有极值的评估结果,使得可以假定分配了该评估结果的分段匹配 解码信号所基于的编码块光栅。 该方法可以用于确定任何没有关于其编码块光栅的显式信息的解码信号的编码块栅格。

    Method and Apparatus for Introducing Information into a Data Stream and Method and Apparatus for Encoding an Audio Signal
    7.
    发明申请
    Method and Apparatus for Introducing Information into a Data Stream and Method and Apparatus for Encoding an Audio Signal 审中-公开
    用于将信息引入数据流的方法和装置以及用于对音频信号进行编码的方法和装置

    公开(公告)号:US20090138259A1

    公开(公告)日:2009-05-28

    申请号:US12366586

    申请日:2009-02-05

    IPC分类号: G10L19/00

    摘要: An inventive method for introducing information into a data stream including data about spectral values representing a short-term spectrum of an audio signal first performs a processing of the data stream to obtain the spectral values of the short-term spectrum of the audio signal. Apart from that, the information to be introduced are combined with a spread sequence to obtain a spread information signal, whereupon a spectral representation of the spread information is generated which will then be weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein the energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal will then be summed and afterwards processed again to obtain a processed data stream including both audio information and information to be introduced. By the fact that the information to be introduced are introduced into the data stream without changing to the time domain, the block rastering underlying the short-term spectrum will not be touched, so that introducing a watermark will not lead to tandem encoding effects.

    摘要翻译: 将信息引入包括表示音频信号的短期频谱的频谱值的数据的数据流的创新方法首先执行数据流的处理以获得音频信号的短期频谱的频谱值。 除此之外,将要引入的信息与扩展序列组合以获得扩展信息信号,从而生成扩展信息的频谱表示,然后将其利用已建立的心理声学可屏蔽噪声能量进行加权,以产生加权信息信号 ,其中所引入的信息的能量基本上等于或低于心理声学掩蔽阈值。 然后将加权信息信号和音频信号的短期频谱的频谱值相加,然后再次处理,以获得包括音频信息和要引入的信息的处理数据流。 由于将要引入的信息被引入到数据流而不改变到时域的事实,所以短期频谱下面的块划像不会被触及,所以引入水印不会导致串联编码效应。

    Method and Apparatus for Introducing Information into a Data Stream and Method and Apparatus for Encoding an Audio Signal

    公开(公告)号:US20090076801A1

    公开(公告)日:2009-03-19

    申请号:US12238365

    申请日:2008-09-25

    IPC分类号: G10L19/00

    摘要: An inventive method for introducing information into a data stream including data about spectral values representing a short-term spectrum of an audio signal first performs a processing of the data stream to obtain the spectral values of the short-term spectrum of the audio signal. Apart from that, the information to be introduced are combined with a spread sequence to obtain a spread information signal, whereupon a spectral representation of the spread information is generated which will then be weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein the energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal will then be summed and afterwards processed again to obtain a processed data stream including both audio information and information to be introduced. By the fact that the information to be introduced are introduced into the data stream without changing to the time domain, the block rastering underlying the short-term spectrum will not be touched, so that introducing a watermark will not lead to tandem encoding effects.

    Method and apparatus for introducing information into a data stream and method and apparatus for encoding an audio signal
    9.
    发明授权
    Method and apparatus for introducing information into a data stream and method and apparatus for encoding an audio signal 有权
    用于将信息引入数据流的方法和装置以及用于对音频信号进行编码的方法和装置

    公开(公告)号:US08117027B2

    公开(公告)日:2012-02-14

    申请号:US12238365

    申请日:2008-09-25

    IPC分类号: G10L19/02 H04B1/66 H04B1/69

    摘要: Techniques for introducing information into a data stream first obtains the spectral values of the short-term spectrum of the audio signal. Separately, information to be introduced are combined with a spread sequence obtaining a spread information signal, whereupon a spectral representation of the spread information is generated, then weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal are then summed and afterwards processed again to obtain a processed data stream including audio information and information to be introduced. Because the information to be introduced are introduced without changing to the time domain, the block rastering underlying the short-term spectrum are not touched, thus introducing a watermark will not lead to tandem encoding effects.

    摘要翻译: 将信息引入数据流的技术首先获得音频信号的短期频谱的频谱值。 单独地,要引入的信息与获得扩展信息信号的扩展序列组合,从而生成扩展信息的频谱表示,然后用已建立的心理声学可屏蔽噪声能量进行加权,以产生加权信息信号,其中引入的能量 信息基本上等于或低于心理声学屏蔽阈值。 然后,将加权信息信号和音频信号的短期频谱的频谱值相加,然后再次进行处理,以获得包括音频信息和要引入的信息的处理数据流。 由于在不改变时域的情况下引入要引入的信息,因此不会触及短期频谱下的块划像,因此引入水印不会导致串联编码效果。

    Method and apparatus for introducing information into a data stream and method and apparatus for encoding an audio signal
    10.
    发明授权
    Method and apparatus for introducing information into a data stream and method and apparatus for encoding an audio signal 有权
    用于将信息引入数据流的方法和装置以及用于对音频信号进行编码的方法和装置

    公开(公告)号:US07454327B1

    公开(公告)日:2008-11-18

    申请号:US10089950

    申请日:2000-10-05

    IPC分类号: G10L19/02 H04B1/69

    摘要: An inventive method for introducing information into a data stream including data about spectral values representing a short-term spectrum of an audio signal first performs a processing of the data stream to obtain the spectral values of the short-term spectrum of the audio signal. Apart from that, the information to be introduced are combined with a spread sequence to obtain a spread information signal, whereupon a spectral representation of the spread information is generated which will then be weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein the energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal will then be summed and afterwards processed again to obtain a processed data stream including both audio information and information to be introduced. By the fact that the information to be introduced are introduced into the data stream without changing to the time domain, the block rastering underlying the short-term spectrum will not be touched, so that introducing a watermark will not lead to tandem encoding effects.

    摘要翻译: 将信息引入包括表示音频信号的短期频谱的频谱值的数据的数据流的创新方法首先执行数据流的处理以获得音频信号的短期频谱的频谱值。 除此之外,将要引入的信息与扩展序列组合以获得扩展信息信号,从而生成扩展信息的频谱表示,然后将其利用已建立的心理声学可屏蔽噪声能量进行加权,以产生加权信息信号 ,其中所引入的信息的能量基本上等于或低于心理声学掩蔽阈值。 然后将加权信息信号和音频信号的短期频谱的频谱值相加,然后再次处理,以获得包括音频信息和要引入的信息的处理数据流。 由于将要引入的信息被引入到数据流而不改变到时域的事实,所以短期频谱下面的块划像不会被触及,所以引入水印不会导致串联编码效应。