MULTI-MODE AUDIO SIGNAL DECODER, MULTI-MODE AUDIO SIGNAL ENCODER, METHODS AND COMPUTER PROGRAM USING A LINEAR-PREDICTION-CODING BASED NOISE SHAPING
    81.
    发明申请
    MULTI-MODE AUDIO SIGNAL DECODER, MULTI-MODE AUDIO SIGNAL ENCODER, METHODS AND COMPUTER PROGRAM USING A LINEAR-PREDICTION-CODING BASED NOISE SHAPING 有权
    多模式音频信号解码器,多模式音频信号编码器,使用基于线性预测编码的噪声形状的方法和计算机程序

    公开(公告)号:US20120245947A1

    公开(公告)日:2012-09-27

    申请号:US13441469

    申请日:2012-04-06

    IPC分类号: G10L19/00

    CPC分类号: G10L19/20 G10L19/022

    摘要: A multi-mode audio signal decoder has a spectral value determinator to obtain sets of decoded spectral coefficients for a plurality of portions of an audio content and a spectrum processor configured to apply a spectral shaping to a set of spectral coefficients in dependence on a set of linear-prediction-domain parameters for a portion of the audio content encoded in a linear-prediction mode, and in dependence on a set of scale factor parameters for a portion of the audio content encoded in a frequency-domain mode. The audio signal decoder has a frequency-domain-to-time-domain converter configured to obtain a time-domain audio representation on the basis of a spectrally-shaped set of decoded spectral coefficients for a portion of the audio content encoded in the linear-prediction mode and for a portion of the audio content encoded in the frequency domain mode. An audio signal encoder is also described.

    摘要翻译: 多模式音频信号解码器具有频谱值确定器,以获得用于音频内容的多个部分的解码频谱系数集合;以及频谱处理器,被配置为将频谱整形应用于一组频谱系数 用于以线性预测模式编码的音频内容的一部分的线性预测域参数,并且依赖于以频域模式编码的音频内容的一部分的一组比例因子参数。 音频信号解码器具有频域 - 时域转换器,其被配置为基于频谱形式的解码频谱系数集来获得时域音频表示,所述解码频谱系数被编码在线性 - 预测模式和频域模式中编码的音频内容的一部分。 还描述了音频信号编码器。

    Apparatus and Method for Synchronizing Additional Data and Base Data
    82.
    发明申请
    Apparatus and Method for Synchronizing Additional Data and Base Data 审中-公开
    用于同步附加数据和基本数据的装置和方法

    公开(公告)号:US20110282471A1

    公开(公告)日:2011-11-17

    申请号:US13190221

    申请日:2011-07-25

    IPC分类号: G06F17/00

    摘要: For adding additional data, such as multi-channel extension data, to base data, such as conventional stereo data, a test fingerprint of test data relating to a test time instant of the test data is provided. The test data equals the additional data or the base data or depends on the additional data or the base data in parametric manner. Using the test fingerprint, reference time instant information is determined, which depends on a reference time instant in reference data, the reference data being the conventional stereo data. Finally, the additional data or the base data is manipulated, namely using the reference time instant information and the test time instant information, to obtain manipulated data, by which synchronous reproduction of the data information can be performed. Thus, a robust and flexible possibility for synchronous, especially late extension of base data by additional data is obtained.

    摘要翻译: 为了将诸如多通道扩展数据的附加数据添加到诸如常规立体数据的基础数据,提供了与测试数据的测试时刻有关的测试数据的测试指纹。 测试数据等于附加数据或基本数据,或者以参数方式取决于附加数据或基本数据。 使用测试指纹,确定参考时刻信息,其取决于参考数据中的参考时刻,参考数据是常规立体声数据。 最后,使用参考时刻信息和测试时刻信息来操纵附加数据或基本数据,以获得可以执行数据信息的同步再现的操纵数据。 因此,获得了通过附加数据同步,特别是延迟基础数据的鲁棒且灵活的可能性。

    Frequency-based coding of channels in parametric multi-channel coding systems
    83.
    发明授权
    Frequency-based coding of channels in parametric multi-channel coding systems 有权
    参数化多通道编码系统中频道的频率编码

    公开(公告)号:US07805313B2

    公开(公告)日:2010-09-28

    申请号:US10827900

    申请日:2004-04-20

    IPC分类号: G10L19/00 H04R5/00

    摘要: For a multi-channel audio signal, parametric coding is applied to different subsets of audio input channels for different frequency regions. For example, for a 5.1 surround sound signal having five regular channels and one low-frequency (LFE) channel, binaural cue coding (BCC) can be applied to all six audio channels for sub-bands at or below a specified cut-off frequency, but to only five audio channels (excluding the LFE channel) for sub-bands above the cut-off frequency. Such frequency-based coding of channels can reduce the encoding and decoding processing loads and/or size of the encoded audio bitstream relative to parametric coding techniques that are applied to all input channels over the entire frequency range.

    摘要翻译: 对于多声道音频信号,参数编码被应用于不同频率区域的音频输入通道的不同子集。 例如,对于具有五个常规频道和一个低频(LFE)频道的5.1环绕声信号,可以将双耳提示编码(BCC)应用于所有六个音频通道,用于等于或小于指定截止频率的子频带 ,但对于截止频率以上的子频带,只有五个音频通道(不包括LFE通道)。 通道的这种基于频率的编码可以相对于在整个频率范围上应用于所有输入通道的参数编码技术来减少编码和解码处理负载和/或编码音频比特流的大小。

    Method and Apparatus for Conversion Between Multi-Channel Audio Formats
    84.
    发明申请
    Method and Apparatus for Conversion Between Multi-Channel Audio Formats 有权
    多声道音频格式转换的方法和装置

    公开(公告)号:US20100166191A1

    公开(公告)日:2010-07-01

    申请号:US12530645

    申请日:2008-02-01

    IPC分类号: H04R5/00

    摘要: An input multi-channel representation is converted into a different output multi-channel representation of a spatial audio signal, in that an intermediate representation of the spatial audio signal is derived, the intermediate representation having direction parameters indicating a direction of origin of a portion of the spatial audio signal; and in that the output multi-channel representation of the spatial audio signal is generated using the intermediate representation of the spatial audio signal.

    摘要翻译: 输入多声道表示被转换成空间音频信号的不同输出多声道表示,因为导出空间音频信号的中间表示,中间表示具有指示一部分的原点方向的方向参数 空间音频信号; 并且使用空间音频信号的中间表示来生成空间音频信号的输出多声道表示。

    APPARATUS FOR ENCODING AND DECODING
    85.
    发明申请
    APPARATUS FOR ENCODING AND DECODING 审中-公开
    编码和解码的装置

    公开(公告)号:US20100027625A1

    公开(公告)日:2010-02-04

    申请号:US12514629

    申请日:2007-11-16

    IPC分类号: H04N11/04

    CPC分类号: G10L19/002

    摘要: An apparatus for encoding a sequence of samples of an audio signal, with each sample within the sequence having an original position, includes a sorter for sorting the samples depending on their sizes, in order to obtain a sorted sequence of samples, with each sample having a sorting position within the sorted sequence. Furthermore, the apparatus has an encoder for encoding the sorted samples and information on a relation between the original and sorting positions of the samples.

    摘要翻译: 一种用于对音频信号的样本序列进行编码的装置,其中序列内的每个样本具有原始位置,包括用于根据其大小对样本进行分类的分类器,以便获得样本的排序顺序,每个样品具有 排序顺序中的排序位置。 此外,该装置具有用于对分类样本进行编码的编码器和关于样本的原始和分类位置之间的关系的信息。

    Compatible Multi-Channel Coding/Decoding
    86.
    发明申请
    Compatible Multi-Channel Coding/Decoding 有权
    兼容的多通道编码/解码

    公开(公告)号:US20090003612A1

    公开(公告)日:2009-01-01

    申请号:US12206778

    申请日:2008-09-09

    IPC分类号: H04R5/00

    摘要: In processing a multi-channel audio signal having at least three original channels, a first downmix channel and a second downmix channel are provided, which are derived from the original channels. For a selected original channel of the original channels, channel side information are calculated such that a downmix channel or a combined downmix channel including the first and the second downmix channels, when weighted using the channel side information, results in an approximation of the selected original channel. The channel side information and the first and second downmix channels form output data to be transmitted to a decoder, which, in case of a low level decoder only decodes the first and second downmix channels or, in case of a high level decoder provides a full multi-channel audio signal based on the downmix channels and the channel side information. Since the channel side information only occupy a low number of bits, and since the decoder does not use dematrixing, an efficient and high quality multi-channel extension for stereo players and enhanced multi-channel players is obtained.

    摘要翻译: 在处理具有至少三个原始信道的多声道音频信号时,提供从原始信道导出的第一下混通道和第二下混通道。 对于原始频道的所选择的原始频道,计算频道侧信息,使得当使用频道侧信息加权时,包括第一和第二下混频道的下混频道或组合缩混频道导致所选原稿的近似 渠道。 信道侧信息以及第一和第二下混频道形成要发送到解码器的输出数据,其在低电平解码器的情况下仅解码第一和第二下混通道,或者在高电平解码器提供满 基于下混频道的多声道音频信号和频道侧信息。 由于信道侧信息仅占用少量的比特,并且由于解码器不使用解矩阵,因此可以获得用于立体声播放器和增强型多声道播放器的有效且高质量的多声道扩展。

    Apparatus and method for processing a multi-channel signal
    87.
    发明授权
    Apparatus and method for processing a multi-channel signal 有权
    用于处理多声道信号的装置和方法

    公开(公告)号:US07340391B2

    公开(公告)日:2008-03-04

    申请号:US11464315

    申请日:2006-08-14

    IPC分类号: G10L19/04

    CPC分类号: G10L19/03 G10L19/008

    摘要: An apparatus for processing a multi-channel signal includes a means for determining a similarity between a first one of two channels and a second one of the two channels. Furthermore, a means for performing a prediction filtering of the spectral coefficients is provided, which is formed to perform a prediction filtering with only a single prediction filter for both channels in case of high similarity between the first and the second channel, and to perform a prediction filtering with two separate prediction filters in case of a dissimilarity between the first and the second channel. With this, an introduction of stereo artifacts and a deterioration of the coding gain in stereo coding techniques are avoided.

    摘要翻译: 用于处理多信道信号的装置包括用于确定两个信道中的第一个信道和两个信道中的第二信道之间的相似性的装置。 此外,提供了一种用于执行频谱系数的预测滤波的装置,其被形成为在第一和第二信道之间具有高相似性的情况下仅对两个信道执行预测滤波,并且执行 在第一和第二信道之间具有不相似性的情况下,使用两个单独的预测滤波器进行预测滤波。 由此,避免立体声伪影的引入和立体声编码技术中的编码增益的恶化。

    Device and Method for Analyzing an Information Signal
    88.
    发明申请
    Device and Method for Analyzing an Information Signal 有权
    用于分析信息信号的装置和方法

    公开(公告)号:US20070127717A1

    公开(公告)日:2007-06-07

    申请号:US11557023

    申请日:2006-11-06

    IPC分类号: H04N7/167

    CPC分类号: G10L25/48

    摘要: For analyzing an information signal having a sequence of blocks of information units, wherein a plurality of consecutive blocks of the sequence of blocks represents an information entity, using a sequence of fingerprints for the sequence of blocks, identification results are provided for consecutive fingerprints, wherein an identification result represents an association of a block of information units with a predetermined information entity. Then at least two hypotheses are formed from the identification results for the consecutive fingerprints, wherein a first hypothesis is an assumption for the association of the sequence of blocks with a first information entity, and wherein the second hypothesis is an assumption for the association of the sequence of blocks with the second information entity. Then various hypotheses are examined to obtain an examination result on the basis of which there is then made a statement on the information signal. This achieves a meaningful and reliable time-continuous analysis of an information signal.

    摘要翻译: 为了分析具有信息单元块序列的信息信号,其中块序列的多个连续块表示信息实体,使用块序列的指纹序列,为连续指纹提供识别结果,其中 识别结果表示信息单元块与预定信息实体的关联。 然后,从连续指纹的识别结果形成至少两个假设,其中第一假设是块的序列与第一信息实体的关联的假设,并且其中第二假设是关于 具有第二信息实体的块序列。 然后检查各种假设以获得检查结果,然后根据该结果对信息信号进行声明。 这实现了对信息信号的有意义和可靠的时间连续分析。

    Method and device for error concealment in an encoded audio-signal and method and device for decoding an encoded audio signal
    89.
    发明授权
    Method and device for error concealment in an encoded audio-signal and method and device for decoding an encoded audio signal 有权
    用于编码音频信号中的错误隐藏的方法和装置,以及用于对编码的音频信号进行解码的方法和装置

    公开(公告)号:US07003448B1

    公开(公告)日:2006-02-21

    申请号:US09980534

    申请日:2000-04-12

    IPC分类号: G10L19/00

    CPC分类号: G10L19/005

    摘要: In a method for concealing an error in an encoded audio signal a set of spectral coefficients is subdivided into at least two sub-bands (14), whereupon the sub-bands are subjected to a re-verse transform (16). A specific prediction is performed (18) for each quasi time signal of a sub-band to obtain an estimated temporal representation for a sub-band of a set of spectral coefficients following the current set. A forward transform (20) of the time signal of each sub-band provides estimated spectral coefficients which can be used (28) instead of erroneous spectral coefficients of a following set of spectral coefficients, e.g. in order to conceal transmission errors. Transforming at the sub-band level provides independence from transform characteristics such as block length, window type and MDCT algorithm while at the same time preserving spectral processing for error concealment. Thus the spectral characteristics of audio signals can also be taken into account during error concealment.

    摘要翻译: 在用于隐藏编码音频信号中的错误的方法中,一组频谱系数被细分为至少两个子带(14),于是子带经受逆变换(16)。 对于子带的每个准时间信号执行特定的预测(18),以获得在当前集合之后的一组频谱系数的子带的估计时间表示。 每个子带的时间信号的正向变换(20)提供可以使用的估计的频谱系数(28),而不是以下的频谱系数集合的错误频谱系数,例如。 以掩盖传输错误。 在子带级变换提供独立于诸如块长度,窗口类型和MDCT算法的变换特征,同时保留用于错误隐藏的频谱处理。 因此,在错误隐藏期间也可以考虑音频信号的频谱特性。

    Method and device for embedding watermark information and method and device for extracting embedded watermark information
    90.
    发明申请
    Method and device for embedding watermark information and method and device for extracting embedded watermark information 有权
    用于嵌入水印信息的方法和装置以及用于提取嵌入水印信息的方法和装置

    公开(公告)号:US20050105726A1

    公开(公告)日:2005-05-19

    申请号:US10502622

    申请日:2003-02-25

    IPC分类号: G06T1/00 H04N7/167

    摘要: For embedding watermark information into an information signal including audio and/or video information, first of all a synchronization sequence with a plurality of synchronization sequence units and a data sequence with a plurality of data sequence units are provided, wherein between the data sequence and the synchronization sequence a time shift is present and wherein a degree of shifting depends on the watermark information. A combination means generates a combination sequence having a plurality of combination sequence units from the synchronization sequence and the data sequence shifted with regard to the synchronization sequence, wherein the combination sequence units are derived from synchronization sequence units and shifted data sequence units. The combination sequence is combined with the information signal in order to embed the watermark information into the information signal. A watermark extractor receives a synchronization sequence correlation peak for every data sequence correlation peak associated with the same and therefore determines the watermark information on the basis of the time interval between the synchronization sequence correlation peak and the data sequence correlation peak in a secure and robust way. The concept is robust, provides a high data rate and is simultaneously flexible with regard to the weighting of synchronization energy and data energy and with regard to the robustness on the one hand and data rate on the other hand, respectively.

    摘要翻译: 为了将水印信息嵌入到包括音频和/或视频信息的信息信号中,首先提供具有多个同步序列单元的同步序列和具有多个数据序列单元的数据序列,其中在数据序列和 同步序列存在时移,其中移位程度取决于水印信息。 组合装置根据同步序列产生具有多个组合序列单元的组合序列和相对于同步序列移位的数据序列,其中组合序列单元从同步序列单元和移位数据序列单元导出。 将组合序列与信息信号组合,以将水印信息嵌入到信息信号中。 水印提取器接收与其相关联的每个数据序列相关峰值的同步序列相关峰值,因此以安全和鲁棒的方式基于同步序列相关峰值与数据序列相关峰值之间的时间间隔来确定水印信息 。 该概念是鲁棒的,提供高数据速率,并且在同步能量和数据能量的加权方面以及另一方面一方面的鲁棒性和数据速率方面同时是灵活的。