Method and Apparatus for Introducing Information into a Data Stream and Method and Apparatus for Encoding an Audio Signal

    公开(公告)号:US20090076801A1

    公开(公告)日:2009-03-19

    申请号:US12238365

    申请日:2008-09-25

    IPC分类号: G10L19/00

    摘要: An inventive method for introducing information into a data stream including data about spectral values representing a short-term spectrum of an audio signal first performs a processing of the data stream to obtain the spectral values of the short-term spectrum of the audio signal. Apart from that, the information to be introduced are combined with a spread sequence to obtain a spread information signal, whereupon a spectral representation of the spread information is generated which will then be weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein the energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal will then be summed and afterwards processed again to obtain a processed data stream including both audio information and information to be introduced. By the fact that the information to be introduced are introduced into the data stream without changing to the time domain, the block rastering underlying the short-term spectrum will not be touched, so that introducing a watermark will not lead to tandem encoding effects.

    METHOD AND APPARATUS FOR CONVERSION BETWEEN MULTI-CHANNEL AUDIO FORMATS
    72.
    发明申请
    METHOD AND APPARATUS FOR CONVERSION BETWEEN MULTI-CHANNEL AUDIO FORMATS 有权
    用于多通道音频格式转换的方法和装置

    公开(公告)号:US20080232616A1

    公开(公告)日:2008-09-25

    申请号:US11742502

    申请日:2007-04-30

    IPC分类号: H04R5/02 H04R5/00

    摘要: An input multi-channel representation is converted into a different output multi-channel representation of a spatial audio signal, in that an intermediate representation of the spatial audio signal is derived, the intermediate representation having direction parameters indicating a direction of origin of a portion of the spatial audio signal; and in that the output multi-channel representation of the spatial audio signal is generated using the intermediate representation of the spatial audio signal.

    摘要翻译: 输入多声道表示被转换成空间音频信号的不同输出多声道表示,因为导出空间音频信号的中间表示,中间表示具有指示一部分的原点方向的方向参数 空间音频信号; 并且使用空间音频信号的中间表示来生成空间音频信号的输出多声道表示。

    Watermark Embedding
    73.
    发明申请
    Watermark Embedding 有权
    水印嵌入

    公开(公告)号:US20080027729A1

    公开(公告)日:2008-01-31

    申请号:US11554492

    申请日:2006-10-30

    IPC分类号: G10L21/00

    CPC分类号: H04H20/31

    摘要: According to an inventive scheme for introducing a watermark into an information signal, the information signal is at first transferred from a time representation to a spectral/modulation spectral representation). The information signal is then manipulated in the spectral/modulation spectral representation in dependence on the watermark to be introduced to obtain a modified spectral/modulation spectral representation, and subsequently an information signal provided with a watermark is formed based on the modified spectral/modulation spectral representation. An advantage is that, due to the fact that the watermark is embedded and/or derived in the spectral/modulation spectral representation or range, traditional correlation attacks as are used in watermark methods based on a spread-band modulation cannot succeed easily.

    摘要翻译: 根据用于将水印引入信息信号的发明方案,信息信号首先从时间表示传送到频谱/调制频谱表示)。 然后根据要引入的水印在频谱/调制频谱表示中操作信息信号以获得修改的频谱/调制频谱表示,随后基于修改的频谱/调制频谱形成提供有水印的信息信号 表示。 优点在于,由于在频谱/调制频谱表示或范围内嵌入和/或导出水印的事实,所以在基于扩频调制的水印方法中使用的传统的相关攻击不能容易地成功。

    Enhanced Method for Signal Shaping in Multi-Channel Audio Reconstruction
    74.
    发明申请
    Enhanced Method for Signal Shaping in Multi-Channel Audio Reconstruction 有权
    多通道音频重建信号整形的增强方法

    公开(公告)号:US20070236858A1

    公开(公告)日:2007-10-11

    申请号:US11384000

    申请日:2006-05-18

    IPC分类号: H01G4/255

    摘要: The present invention is based on the finding that a reconstructed output channel, reconstructed with a multi-channel reconstructor using at least one downmix channel derived by downmixing a plurality of original channels and using a parameter representation including additional information on a temporal fine structure of an original channel can be reconstructed efficiently with high quality, when a generator for generating a direct signal component and a diffuse signal component based on the downmix channel is used. The quality can be essentially enhanced, if only the direct signal component is modified such that the temporal fine structure of the reconstructed output channel is fitting a desired temporal fine structure, indicated by the additional information on the temporal fine structure transmitted.

    摘要翻译: 本发明基于以下发现:重建的输出通道,其使用通过将多个原始通道进行下混合而导出的至少一个下混频道重建的多通道重构器,并且使用参数表示,该参数表示包括关于时间精细结构的附加信息 当使用用于产生基于下混通道的直接信号分量和扩散信号分量的发生器时,可以高质量地重构原始信道。 如果仅修改直接信号分量,使得重建的输出信道的时间精细结构适合于由发送的时间精细结构的附加信息所指示的期望的时间精细结构,则质量可以基本上增强。

    Apparatus and method for coding a time-discrete audio signal to obtain coded audio data and for decoding coded audio data
    75.
    发明授权
    Apparatus and method for coding a time-discrete audio signal to obtain coded audio data and for decoding coded audio data 有权
    用于编码时分离音频信号以获得编码音频数据和解码编码音频数据的装置和方法

    公开(公告)号:US07275036B2

    公开(公告)日:2007-09-25

    申请号:US10966780

    申请日:2004-10-15

    IPC分类号: G10L19/00

    摘要: A time-discrete audio signal is processed to provide a quantization block with quantized spectral values. Furthermore, an integer spectral representation is generated from the time-discrete audio signal using an integer transform algorithm. The quantization block having been generated using a psychoacoustic model is inversely quantized and rounded to then form a difference between the integer spectral values and the inversely quantized rounded spectral values. The quantization block alone provides a lossy psychoacoustically coded/decoded audio signal after the decoding, whereas the quantization block, together with the combination block, provides a lossless or almost lossless coded and again decoded audio signal in the decoding. By generating the differential signal in the frequency domain, a simpler coder/decoder structure results.

    摘要翻译: 处理时间离散音频信号以向量化块提供量化的频谱值。 此外,使用整数变换算法从时间离散音频信号生成整数谱表示。 已经使用心理声学模型产生的量化块被逆量化并舍入,从而形成整数频谱值和逆量化的舍入频谱值之间的差。 量化块单独提供在解码之后的有损心理声学编码/解码音频信号,而量化块与组合块一起在解码中提供无损或几乎无损的编码和再次解码的音频信号。 通过在频域中产生差分信号,可以得到更简单的编码器/解码器结构。

    Temporal and spatial shaping of multi-channel audio signals
    76.
    发明申请
    Temporal and spatial shaping of multi-channel audio signals 有权
    多通道音频信号的时空整形

    公开(公告)号:US20070081597A1

    公开(公告)日:2007-04-12

    申请号:US11363985

    申请日:2006-02-27

    IPC分类号: H04B14/04

    CPC分类号: G10L19/008 H04S3/008

    摘要: A selected channel of a multi-channel signal which is represented by frames composed from sampling values having a high time resolution can be encoded with higher quality when a wave form parameter representation representing a wave form of an intermediate resolution representation of the selected channel is derived, the wave form parameter representation including a sequence of intermediate wave form parameters having a time resolution lower than the high time resolution of the sampling values and higher than a time resolution defined by a frame repetition rate. The wave form parameter representation with the intermediate resolution can be used to shape a reconstructed channel to retrieve a channel having a signal envelope close to that one of the selected original channel. The time scale on which the shaping is performed is shorter than the time scale of a framewise processing, thus enhancing the quality of the reconstructed channel. On the other hand, the shaping time scale is larger than the time scale of the sampling values, significantly reducing the amount of data needed by the wave form parameter representation.

    摘要翻译: 当表示所选频道的中间分辨率表示的波形的波形参数表示被导出时,由具有高时间分辨率的采样值组成的帧表示的多信道信号的选择信道可以被更高质量地编码 波形参数表示,包括具有低于采样值的高时间分辨率的时间分辨率并且高于由帧重复率定义的时间分辨率的中间波形参数序列。 具有中间分辨率的波形参数表示可用于对重建的信道进行整形以检索具有接近所选择的原始信道中的那一个的信号包络的信道。 进行整形的时间标度比框架处理的时间标度短,从而提高重构信道的质量。 另一方面,成形时间尺度大于采样值的时间尺度,显着减少波形参数表示所需的数据量。

    Frequency-based coding of channels in parametric multi-channel coding systems
    77.
    发明申请
    Frequency-based coding of channels in parametric multi-channel coding systems 有权
    参数化多通道编码系统中频道的频率编码

    公开(公告)号:US20050195981A1

    公开(公告)日:2005-09-08

    申请号:US10827900

    申请日:2004-04-20

    IPC分类号: G10L19/00 H04S3/00 H04R5/00

    摘要: For a multi-channel audio signal, parametric coding is applied to different subsets of audio input channels for different frequency regions. For example, for a 5.1 surround sound signal having five regular channels and one low-frequency (LFE) channel, binaural cue coding (BCC) can be applied to all six audio channels for sub-bands at or below a specified cut-off frequency, but to only five audio channels (excluding the LFE channel) for sub-bands above the cut-off frequency. Such frequency-based coding of channels can reduce the encoding and decoding processing loads and/or size of the encoded audio bitstream relative to parametric coding techniques that are applied to all input channels over the entire frequency range.

    摘要翻译: 对于多声道音频信号,参数编码被应用于不同频率区域的音频输入通道的不同子集。 例如,对于具有五个常规频道和一个低频(LFE)频道的5.1环绕声信号,可以将双耳提示编码(BCC)应用于所有六个音频通道,用于等于或小于指定截止频率的子频带 ,但对于截止频率以上的子频带,只有五个音频通道(不包括LFE通道)。 通道的这种基于频率的编码可以相对于在整个频率范围上应用于所有输入通道的参数编码技术来减少编码和解码处理负载和/或编码音频比特流的大小。

    Device and method for analysing a decoded time signal
    78.
    发明申请
    Device and method for analysing a decoded time signal 有权
    用于分析解码时间信号的装置和方法

    公开(公告)号:US20050175252A1

    公开(公告)日:2005-08-11

    申请号:US10220651

    申请日:2001-02-16

    摘要: An apparatus for analyzing an analysis time signal that has been generated from encoding and decoding an original time signal according to an encoding algorithm first, wherein first the encoding block raster underlying the analysis time signal used by the encoding algorithm is determined. Thereupon, the analysis time signal will be converted from its timely representation comprising a plurality of analysis spectral coefficients, to a spectral representation by using the established encoding block raster. Then, at least two analysis spectral coefficients or at least two spectral coefficients derived from the analysis spectral coefficients by multiplication of an encoding amplification factor or by multiplication with a compression function are grouped. Then, the greatest common divisor of the analysis spectral coefficients or the spectral coefficients derived from the analysis spectral coefficients will be calculated, corresponding to the quantization step width used when quantizing the encoding algorithm or an integer multiple of it. Then, in the case of an audio signal, the scale factor can easily be established for this group of spectral coefficients, i.e. for a scale factor band, from the quantization step width. Thus, all parameters used for the quantization of the original time signal are known, so that for quantizing the analysis time signal no longer full iteration loops have to be performed, which are, on the one hand, very computing time intensive and, on the other hand, introduce tandem encoding distortions.

    摘要翻译: 一种用于分析根据编码算法首先对原始时间信号进行编码和解码而产生的分析时间信号的装置,其中首先确定编码算法使用的分析时间信号下面的编码块光栅。 因此,分析时间信号将通过使用所建立的编码块光栅从包括多个分析频谱系数的及时表示转换成频谱表示。 然后,将通过编码放大因子的乘法或通过与压缩函数相乘而从分析频谱系数导出的至少两个分析频谱系数或至少两个频谱系数分组。 然后,对应于当量化编码算法或其整数倍时使用的量化步长,将计算分析频谱系数的最大公约数或从分析频谱系数导出的频谱系数。 然后,在音频信号的情况下,从量化步长可以容易地为该组频谱系数建立比例因子,即缩放因子频带。 因此,用于原始时间信号的量化的所有参数是已知的,使得对于分析时间信号的量化不再必须执行完整的迭代循环,这一方面一方面非常计算时间密集,并且在 另一方面,引入串联编码失真。

    Method and apparatus for conversion between multi-channel audio formats
    79.
    发明授权
    Method and apparatus for conversion between multi-channel audio formats 有权
    用于多声道音频格式之间转换的方法和装置

    公开(公告)号:US08908873B2

    公开(公告)日:2014-12-09

    申请号:US12530645

    申请日:2008-02-01

    IPC分类号: H04R5/00

    摘要: An input multi-channel representation is converted into a different output multi-channel representation of a spatial audio signal, in that an intermediate representation of the spatial audio signal is derived, the intermediate representation having direction parameters indicating a direction of origin of a portion of the spatial audio signal; and in that the output multi-channel representation of the spatial audio signal is generated using the intermediate representation of the spatial audio signal.

    摘要翻译: 输入多声道表示被转换成空间音频信号的不同输出多声道表示,因为导出空间音频信号的中间表示,中间表示具有指示一部分的原点方向的方向参数 空间音频信号; 并且使用空间音频信号的中间表示来生成空间音频信号的输出多声道表示。