Apparatus and method for processing a multi-channel signal
    91.
    发明授权
    Apparatus and method for processing a multi-channel signal 有权
    用于处理多声道信号的装置和方法

    公开(公告)号:US07340391B2

    公开(公告)日:2008-03-04

    申请号:US11464315

    申请日:2006-08-14

    IPC分类号: G10L19/04

    CPC分类号: G10L19/03 G10L19/008

    摘要: An apparatus for processing a multi-channel signal includes a means for determining a similarity between a first one of two channels and a second one of the two channels. Furthermore, a means for performing a prediction filtering of the spectral coefficients is provided, which is formed to perform a prediction filtering with only a single prediction filter for both channels in case of high similarity between the first and the second channel, and to perform a prediction filtering with two separate prediction filters in case of a dissimilarity between the first and the second channel. With this, an introduction of stereo artifacts and a deterioration of the coding gain in stereo coding techniques are avoided.

    摘要翻译: 用于处理多信道信号的装置包括用于确定两个信道中的第一个信道和两个信道中的第二信道之间的相似性的装置。 此外,提供了一种用于执行频谱系数的预测滤波的装置,其被形成为在第一和第二信道之间具有高相似性的情况下仅对两个信道执行预测滤波,并且执行 在第一和第二信道之间具有不相似性的情况下,使用两个单独的预测滤波器进行预测滤波。 由此,避免立体声伪影的引入和立体声编码技术中的编码增益的恶化。

    Device and Method for Analyzing an Information Signal
    92.
    发明申请
    Device and Method for Analyzing an Information Signal 有权
    用于分析信息信号的装置和方法

    公开(公告)号:US20070127717A1

    公开(公告)日:2007-06-07

    申请号:US11557023

    申请日:2006-11-06

    IPC分类号: H04N7/167

    CPC分类号: G10L25/48

    摘要: For analyzing an information signal having a sequence of blocks of information units, wherein a plurality of consecutive blocks of the sequence of blocks represents an information entity, using a sequence of fingerprints for the sequence of blocks, identification results are provided for consecutive fingerprints, wherein an identification result represents an association of a block of information units with a predetermined information entity. Then at least two hypotheses are formed from the identification results for the consecutive fingerprints, wherein a first hypothesis is an assumption for the association of the sequence of blocks with a first information entity, and wherein the second hypothesis is an assumption for the association of the sequence of blocks with the second information entity. Then various hypotheses are examined to obtain an examination result on the basis of which there is then made a statement on the information signal. This achieves a meaningful and reliable time-continuous analysis of an information signal.

    摘要翻译: 为了分析具有信息单元块序列的信息信号,其中块序列的多个连续块表示信息实体,使用块序列的指纹序列,为连续指纹提供识别结果,其中 识别结果表示信息单元块与预定信息实体的关联。 然后,从连续指纹的识别结果形成至少两个假设,其中第一假设是块的序列与第一信息实体的关联的假设,并且其中第二假设是关于 具有第二信息实体的块序列。 然后检查各种假设以获得检查结果,然后根据该结果对信息信号进行声明。 这实现了对信息信号的有意义和可靠的时间连续分析。

    Diffuse sound shaping for BCC schemes and the like
    93.
    发明申请
    Diffuse sound shaping for BCC schemes and the like 有权
    BCC方案的漫射声音整形等

    公开(公告)号:US20060085200A1

    公开(公告)日:2006-04-20

    申请号:US11006492

    申请日:2004-12-07

    IPC分类号: G10L21/00

    CPC分类号: G10L19/008 H04S3/02

    摘要: An input audio signal having an input temporal envelope is converted into an output audio signal having an output temporal envelope. The input temporal envelope of the input audio signal is characterized. The input audio signal is processed to generate a processed audio signal, wherein the processing de-correlates the input audio signal. The processed audio signal is adjusted based on the characterized input temporal envelope to generate the output audio signal, wherein the output temporal envelope substantially matches the input temporal envelope.

    摘要翻译: 具有输入时间包络的输入音频信号被转换成具有输出时间包络的输出音频信号。 表征输入音频信号的输入时间包络。 输入音频信号被处理以产生经处理的音频信号,其中该处理使输入音频信号去相关。 经处理的音频信号基于表征的输入时间包络被调整以产生输出音频信号,其中输出时间包络基本上与输入的时间包络相匹配。

    Individual channel shaping for BCC schemes and the like
    94.
    发明申请
    Individual channel shaping for BCC schemes and the like 有权
    BCC方案的单个通道整形等

    公开(公告)号:US20060083385A1

    公开(公告)日:2006-04-20

    申请号:US11006482

    申请日:2004-12-07

    IPC分类号: H04R5/00

    CPC分类号: G10L19/008

    摘要: At an audio encoder, cue codes are generated for one or more audio channels, wherein an envelope cue code is generated by characterizing a temporal envelope in an audio channel. At an audio decoder, E transmitted audio channel(s) are decoded to generate C playback audio channels, where C>E≧1. Received cue codes include an envelope cue code corresponding to a characterized temporal envelope of an audio channel corresponding to the transmitted channel(s). One or more transmitted channel(s) are upmixed to generate one or more upmixed channels. One or more playback channels are synthesized by applying the cue codes to the one or more upmixed channels, wherein the envelope cue code is applied to an upmixed channel or a synthesized signal to adjust a temporal envelope of the synthesized signal based on the characterized temporal envelope such that the adjusted temporal envelope substantially matches the characterized temporal envelope.

    摘要翻译: 在音频编码器中,为一个或多个音频通道生成提示码,其中通过表征音频通道中的时间包络来产生包络线索码。 在音频解码器处,E个发送的音频信道被解码以产生C个播放音频信道,其中C> E> = 1。 接收的提示码包括与对应于所发送的频道的音频信道的特征化时间包络对应的信封提示码。 一个或多个传输的信道被混合以产生一个或多个上混频道。 通过将提示码应用于一个或多个上混合通道来合成一个或多个回放通道,其中,将包络提示码应用于上混合通道或合成信号,以基于表征的时间包络线来调整合成信号的时间包络 使得调整的时间包络基本上与所表征的时间包络相匹配。

    Method and device for error concealment in an encoded audio-signal and method and device for decoding an encoded audio signal
    95.
    发明授权
    Method and device for error concealment in an encoded audio-signal and method and device for decoding an encoded audio signal 有权
    用于编码音频信号中的错误隐藏的方法和装置,以及用于对编码的音频信号进行解码的方法和装置

    公开(公告)号:US07003448B1

    公开(公告)日:2006-02-21

    申请号:US09980534

    申请日:2000-04-12

    IPC分类号: G10L19/00

    CPC分类号: G10L19/005

    摘要: In a method for concealing an error in an encoded audio signal a set of spectral coefficients is subdivided into at least two sub-bands (14), whereupon the sub-bands are subjected to a re-verse transform (16). A specific prediction is performed (18) for each quasi time signal of a sub-band to obtain an estimated temporal representation for a sub-band of a set of spectral coefficients following the current set. A forward transform (20) of the time signal of each sub-band provides estimated spectral coefficients which can be used (28) instead of erroneous spectral coefficients of a following set of spectral coefficients, e.g. in order to conceal transmission errors. Transforming at the sub-band level provides independence from transform characteristics such as block length, window type and MDCT algorithm while at the same time preserving spectral processing for error concealment. Thus the spectral characteristics of audio signals can also be taken into account during error concealment.

    摘要翻译: 在用于隐藏编码音频信号中的错误的方法中,一组频谱系数被细分为至少两个子带(14),于是子带经受逆变换(16)。 对于子带的每个准时间信号执行特定的预测(18),以获得在当前集合之后的一组频谱系数的子带的估计时间表示。 每个子带的时间信号的正向变换(20)提供可以使用的估计的频谱系数(28),而不是以下的频谱系数集合的错误频谱系数,例如。 以掩盖传输错误。 在子带级变换提供独立于诸如块长度,窗口类型和MDCT算法的变换特征,同时保留用于错误隐藏的频谱处理。 因此,在错误隐藏期间也可以考虑音频信号的频谱特性。

    Method and device for embedding watermark information and method and device for extracting embedded watermark information
    96.
    发明申请
    Method and device for embedding watermark information and method and device for extracting embedded watermark information 有权
    用于嵌入水印信息的方法和装置以及用于提取嵌入水印信息的方法和装置

    公开(公告)号:US20050105726A1

    公开(公告)日:2005-05-19

    申请号:US10502622

    申请日:2003-02-25

    IPC分类号: G06T1/00 H04N7/167

    摘要: For embedding watermark information into an information signal including audio and/or video information, first of all a synchronization sequence with a plurality of synchronization sequence units and a data sequence with a plurality of data sequence units are provided, wherein between the data sequence and the synchronization sequence a time shift is present and wherein a degree of shifting depends on the watermark information. A combination means generates a combination sequence having a plurality of combination sequence units from the synchronization sequence and the data sequence shifted with regard to the synchronization sequence, wherein the combination sequence units are derived from synchronization sequence units and shifted data sequence units. The combination sequence is combined with the information signal in order to embed the watermark information into the information signal. A watermark extractor receives a synchronization sequence correlation peak for every data sequence correlation peak associated with the same and therefore determines the watermark information on the basis of the time interval between the synchronization sequence correlation peak and the data sequence correlation peak in a secure and robust way. The concept is robust, provides a high data rate and is simultaneously flexible with regard to the weighting of synchronization energy and data energy and with regard to the robustness on the one hand and data rate on the other hand, respectively.

    摘要翻译: 为了将水印信息嵌入到包括音频和/或视频信息的信息信号中,首先提供具有多个同步序列单元的同步序列和具有多个数据序列单元的数据序列,其中在数据序列和 同步序列存在时移,其中移位程度取决于水印信息。 组合装置根据同步序列产生具有多个组合序列单元的组合序列和相对于同步序列移位的数据序列,其中组合序列单元从同步序列单元和移位数据序列单元导出。 将组合序列与信息信号组合,以将水印信息嵌入到信息信号中。 水印提取器接收与其相关联的每个数据序列相关峰值的同步序列相关峰值,因此以安全和鲁棒的方式基于同步序列相关峰值与数据序列相关峰值之间的时间间隔来确定水印信息 。 该概念是鲁棒的,提供高数据速率,并且在同步能量和数据能量的加权方面以及另一方面一方面的鲁棒性和数据速率方面同时是灵活的。

    Temporal and spatial shaping of multi-channel audio signal
    97.
    发明授权
    Temporal and spatial shaping of multi-channel audio signal 有权
    多声道音频信号的时空整形

    公开(公告)号:US09361896B2

    公开(公告)日:2016-06-07

    申请号:US14151152

    申请日:2014-01-09

    IPC分类号: G10L19/008 H04S3/00

    CPC分类号: G10L19/008 H04S3/008

    摘要: A selected channel of a multi-channel signal which is represented by frames composed from sampling values having a high time resolution can be encoded with higher quality when a wave form parameter representation representing a wave form of an intermediate resolution representation of the selected channel is derived, the wave form parameter representation including a sequence of intermediate wave form parameters having a time resolution lower than the high time resolution of the sampling values and higher than a time resolution defined by a frame repetition rate. The wave form parameter representation with the intermediate resolution can be used to shape a reconstructed channel to retrieve a channel having a signal envelope close to that one of the selected original channel. The time scale on which the shaping is performed is shorter than the time scale of a framewise processing, thus enhancing the quality of the reconstructed channel. On the other hand, the shaping time scale is larger than the time scale of the sampling values, significantly reducing the amount of data needed by the wave form parameter representation.

    摘要翻译: 当表示所选频道的中间分辨率表示的波形的波形参数表示被导出时,由具有高时间分辨率的采样值组成的帧表示的多信道信号的选择信道可以被更高质量地编码 波形参数表示,包括具有低于采样值的高时间分辨率的时间分辨率并且高于由帧重复率定义的时间分辨率的中间波形参数序列。 具有中间分辨率的波形参数表示可用于对重建的信道进行整形以检索具有接近所选择的原始信道中的那一个的信号包络的信道。 进行整形的时间标度比框架处理的时间标度短,从而提高重构信道的质量。 另一方面,成形时间尺度大于采样值的时间尺度,显着减少波形参数表示所需的数据量。

    Apparatus, method and computer program for providing one or more adjusted parameters for provision of an upmix signal representation on the basis of a downmix signal representation and a parametric side information associated with the downmix signal representation, using an average value
    98.
    发明授权
    Apparatus, method and computer program for providing one or more adjusted parameters for provision of an upmix signal representation on the basis of a downmix signal representation and a parametric side information associated with the downmix signal representation, using an average value 有权
    装置,方法和计算机程序,用于提供一个或多个经调整的参数,用于根据降混信号表示和与缩混信号表示相关联的参数侧信息来提供上混合信号表示,使用平均值

    公开(公告)号:US09245530B2

    公开(公告)日:2016-01-26

    申请号:US13446747

    申请日:2012-04-13

    IPC分类号: H04R5/00 G10L19/008

    CPC分类号: G10L19/008

    摘要: An apparatus for providing one or more adjusted parameters for a provision of an upmix signal representation on the basis of a downmix signal representation and a parametric side information associated with the downmix signal representation has a parameter adjuster. The parameter adjuster is configured to receive one or more parameters and to provide, on the basis thereof, one or more adjusted parameters. The parameter adjuster is configured to provide the one or more adjusted parameters in dependence on an average value of a plurality of parameter values, such that a distortion of the upmix signal representation caused by the use of non-optimal parameters is reduced at least for parameters deviating from optimal parameters by more than a predetermined deviation.

    摘要翻译: 基于降混信号表示和与下混合信号表示相关联的参数侧信息来提供用于提供上混合信号表示的一个或多个调整参数的装置具有参数调整器。 参数调整器被配置为接收一个或多个参数,并且在其基础上提供一个或多个调整参数。 参数调整器被配置为根据多个参数值的平均值来提供一个或多个经调整的参数,使得至少由于使用非最佳参数引起的上混信号表示的失真至少对于参数 偏离最佳参数大于预定偏差。

    Audio format transcoder
    99.
    发明授权
    Audio format transcoder 有权
    音频格式转码器

    公开(公告)号:US08891797B2

    公开(公告)日:2014-11-18

    申请号:US13289252

    申请日:2011-11-04

    CPC分类号: G10L21/0272 G10L19/008

    摘要: An audio format transcoder for transcoding an input audio signal, the input audio signal having at least two directional audio components. The audio format transcoder including a converter for converting the input audio signal into a converted signal, the converted signal having a converted signal representation and a converted signal direction of arrival. The audio format transcoder further includes a position provider for providing at least two spatial positions of at least two spatial audio sources and a processor for processing the converted signal representation based on the at least two spatial positions to obtain at least two separated audio source measures.

    摘要翻译: 一种用于对输入音频信号进行代码转换的音频格式转码器,所述输入音频信号具有至少两个方向音频分量。 音频格式转码器包括用于将输入音频信号转换成转换信号的转换器,转换后的信号具有转换的信号表示和转换的信号到达方向。 音频格式转码器还包括用于提供至少两个空间音频源的至少两个空间位置的位置提供器和用于基于至少两个空间位置处理转换的信号表示的处理器,以获得至少两个分离的音频源测量。