Device and Method for Graduated Encoding of a Multichannel Audio Signal Based on a Principal Component Analysis
    1.
    发明申请
    Device and Method for Graduated Encoding of a Multichannel Audio Signal Based on a Principal Component Analysis 有权
    基于主成分分析的多通道音频信号分级编码的装置和方法

    公开(公告)号:US20090083045A1

    公开(公告)日:2009-03-26

    申请号:US12293072

    申请日:2007-03-08

    CPC classification number: G10L19/008 G10L19/24

    Abstract: A system and a method for the scalable coding of a multi-channel audio signal comprising a principal component analysis (PCA) transformation of at least two channels (L, R) of the audio signal into a principal component (CP) and at least one residual sub-component (r) by rotation defined by a transformation parameter (θ), comprising the following steps: formation of a frequency subband-based residual structure (Sfr) on the basis of the at least one residual sub-component (r), and definition of a coded audio signal (SC) comprising the principal component (CP), at least one residual structure (Sfr) of a frequency subband and the transformation parameter (θ).

    Abstract translation: 一种用于多声道音频信号的可缩放编码的系统和方法,包括将音频信号的至少两个声道(L,R)的主成分分析(PCA)变换成主要分量(CP)和至少一个 通过由变换参数(θ)定义的旋转的残余子分量(r),包括以下步骤:基于所述至少一个残余子分量(r)形成基于频率子带的残差结构(Sfr) 以及包括主成分(CP)的编码音频信号(SC),频率子带的至少一个残留结构(Sfr)和变换参数(theta)的定义。

    Device and Method for Encoding by Principal Component Analysis a Multichannel Audio Signal
    2.
    发明申请
    Device and Method for Encoding by Principal Component Analysis a Multichannel Audio Signal 有权
    通过主成分分析来编码多通道音频信号的装置和方法

    公开(公告)号:US20090083044A1

    公开(公告)日:2009-03-26

    申请号:US12293041

    申请日:2007-03-08

    CPC classification number: G10L19/008

    Abstract: A system and a method for coding by principal component analysis (PCA) of a multi-channel audio signal comprising the following steps: decomposing at least two channels (L, R) of said audio signal into a plurality of frequency sub-bands (1(b1), . . . , 1(bN), r(b1), . . . , r(bN)), calculating at least one transformation parameter (θ(b1), . . . , θ(bN)) as a function of at least some of said plurality of frequency sub-bands, transforming at least some of said plurality of frequency sub-bands into a plurality of frequency sub-components as a function of said at least one transformation parameter (θ(b1), . . . , θ(bN)), said plurality of frequency sub-components comprising principal frequency sub-components (CP(b1), . . . , CP(bN)), combining at least some of said principal frequency sub-components (CP(b1), . . . , CP(bN)) in order to form a principal component (CP), and defining a coded audio signal (SC) representing said multi-channel audio signal (C1, . . . ,CM), said coded audio signal (SC) comprising said principal component (CP) and said at least one transformation parameter (θ(b1), . . . , θ(bN)).

    Abstract translation: 一种用于通过多声道音频信号的主成分分析(PCA)进行编码的系统和方法,包括以下步骤:将所述音频信号的至少两个声道(L,R)分解成多个频率子带(1 (b1),...,1(bN),r(b1),...,r(bN)),计算至少一个变换参数(theta(b1),...,theta(bN) 所述多个频率子带中的至少一些的功能是将所述多个频率子带中的至少一些变换为多个频率子分量作为所述至少一个变换参数(θ(b1))的函数, ,...,theta(bN)),所述多个频率子分量包括主频分量分量(CP(b1),...,CP(bN)),将至少一些所述主频分量 组件(CP(b1),...,CP(bN)),以形成主要组件(CP),并且定义表示所述多声道音频信号(C1,..., CM)说 包括所述主成分(CP)和所述至少一个变换参数(theta(b1))的编码音频信号(SC)。 。 。 ,theta(bN))。

    Device and method for encoding by principal component analysis a multichannel audio signal
    3.
    发明授权
    Device and method for encoding by principal component analysis a multichannel audio signal 有权
    通过主成分分析来编码多通道音频信号的装置和方法

    公开(公告)号:US08370134B2

    公开(公告)日:2013-02-05

    申请号:US12293041

    申请日:2007-03-08

    CPC classification number: G10L19/008

    Abstract: A system and a method for coding by principal component analysis (PCA) of a multi-channel audio signal comprising the following steps: decomposing at least two channels (L, R) of said audio signal into a plurality of frequency sub-bands (I(b1), . . . , I(bN), r(b1), . . . , r(bN)), calculating at least one transformation parameter (θ(b1), . . . , θ(bN)) as a function of at least some of said plurality of frequency sub-bands, transforming at least some of said plurality of frequency sub-bands into a plurality of frequency sub-components as a function of said at least one transformation parameter (θ(b1), . . . , θ(bN)), said plurality of frequency sub-components comprising principal frequency sub-components (CP(b1), . . . , CP(bN)), combining at least some of said principal frequency sub-components (CP(b1), . . . , CP(bN)) in order to form a principal component (CP), and defining a coded audio signal (SC) representing said multi-channel audio signal (C1, . . . , CM), said coded audio signal (SC) comprising said principal component (CP) and said at least one transformation parameter (θ(b1), . . . , θ(bN)).

    Abstract translation: 一种用于通过主要分量分析(PCA)编码多声道音频信号的系统和方法,包括以下步骤:将所述音频信号的至少两个声道(L,R)分解成多个频率子带(I (b1),...,I(bN),r(b1),...,r(bN)),计算至少一个变换参数(&Thetas;(b1),...,&thetas;(bN) )作为所述多个频率子带中的至少一些的函数,将所述多个频率子带中的至少一些频率子带变换为多个频率子分量,作为所述至少一个变换参数的函数(“ (b1),...,...,...(bN)),所述多个频率子分量包括主频分量分量(CP(b1),...,CP(bN)), 主要频率子分量(CP(b1),...,CP(bN)),以形成主分量(CP),并且定义表示所述多声道音频信号(C1, ... ,CM),所述编码音频信号(SC)包括所述主分量(CP)和所述至少一个变换参数(“The”;(b1),...,&thetas;(bN))。

    Device and method for graduated encoding of a multichannel audio signal based on a principal component analysis
    4.
    发明授权
    Device and method for graduated encoding of a multichannel audio signal based on a principal component analysis 有权
    基于主成分分析的多通道音频信号的分级编码的装置和方法

    公开(公告)号:US08359194B2

    公开(公告)日:2013-01-22

    申请号:US12293072

    申请日:2007-03-08

    CPC classification number: G10L19/008 G10L19/24

    Abstract: A system and a method for the scalable coding of a multi-channel audio signal comprising a principal component analysis (PCA) transformation of at least two channels (L, R) of the audio signal into a principal component (CP) and at least one residual sub-component (r) by rotation defined by a transformation parameter (θ), comprising the following steps: formation of a frequency subband-based residual structure (Sfr) on the basis of the at least one residual sub-component (r), and definition of a coded audio signal (SC) comprising the principal component (CP), at least one residual structure (Sfr) of a frequency subband and the transformation parameter (θ).

    Abstract translation: 一种用于多声道音频信号的可缩放编码的系统和方法,包括将音频信号的至少两个声道(L,R)的主成分分析(PCA)变换为主成分(CP)和至少一个 包括以下步骤:基于所述至少一个残余子分量(r)来形成基于频率子带的残余结构(Sfr),所述残余子分量(r)由转换参数(& )和包括主成分(CP)的编码音频信号(SC)的定义,频率子带的至少一个残留结构(Sfr)和变换参数(“thetas”)。

    Binaural spatialization of compression-encoded sound data utilizing phase shift and delay applied to each subband
    6.
    发明授权
    Binaural spatialization of compression-encoded sound data utilizing phase shift and delay applied to each subband 有权
    使用相移和延迟的压缩编码声音数据的双耳空间化应用于每个子带

    公开(公告)号:US08880413B2

    公开(公告)日:2014-11-04

    申请号:US12309074

    申请日:2007-06-19

    Abstract: The invention is aimed at improving the quality of the filtering by transfer functions of HRTF type of signals (L, R) compressed in a transformed domain, for binaural playing on two channels (L-BIN, R-BIN), using a combination of HRTF filters (hL,L, hL,R) including a decorrelated version (HRTF-C*, HRTF-E*) of a few of these filters. For this purpose, a decorrelation cue is given with spatialization parameters (SPAT) accompanying the compressed signals (L, R). The Decorrelation comprises applying a different phase shift to each subband of the input signal combined with addition of an overall delay. The invention makes it possible to improve the broadening in the binaural rendition of audio scenes initially in a multi-channel format.

    Abstract translation: 本发明旨在通过在变换域中压缩的信号(L,R)的HRTF类型的传递函数来提高滤波质量,用于双通道(L-BIN,R-BIN)上的双耳播放,使用 HRTF滤波器(hL,L,hL,R)包括几个滤波器的去相关版本(HRTF-C *,HRTF-E *)。 为此,给出了伴随压缩信号(L,R)的空间化参数(SPAT)的去相关提示。 解相关包括对输入信号的每个子带应用不同的相移以及总延迟的加法。 本发明可以改善最初以多声道格式的音频场景的双耳再现的扩展。

    Method for updating an encoder by filter interpolation
    7.
    发明授权
    Method for updating an encoder by filter interpolation 有权
    通过滤波器插值更新编码器的方法

    公开(公告)号:US08788555B2

    公开(公告)日:2014-07-22

    申请号:US13056154

    申请日:2009-07-03

    CPC classification number: G10L19/0212 H03H17/0264

    Abstract: A method for updating the processing capacity of an encoder or decoder to use a modulated transform having a size greater than a predetermined initial size is provided, particularly, where the encoders or decoders are for storing an initial prototype filter defined by an ordered set of initial size coefficients. A step is provided for constructing a prototype filter of a size greater than the initial size to implement the modulated transform of the greater size by inserting at least one coefficient between two consecutive coefficients of the initial prototype filter.

    Abstract translation: 提供了一种用于更新编码器或解码器以使用尺寸大于预定初始尺寸的调制变换的处理能力的方法,特别地,其中编码器或解码器用于存储由初始化的有序集合定义的初始原型滤波器 尺寸系数。 提供了一个步骤,用于构建大于初始尺寸的原型滤波器,以通过在初始原型滤波器的两个连续系数之间插入至少一个系数来实现较大尺寸的调制变换。

    IMPROVED CODING/DECODING OF DIGITAL AUDIO SIGNALS
    8.
    发明申请
    IMPROVED CODING/DECODING OF DIGITAL AUDIO SIGNALS 有权
    改进数字音频信号的编码/解码

    公开(公告)号:US20120185255A1

    公开(公告)日:2012-07-19

    申请号:US13382786

    申请日:2010-06-25

    CPC classification number: G10L19/002 G10L19/0212 G10L19/038 G10L19/24

    Abstract: A method of hierarchical coding of a digital audio frequency input signal into several frequency sub-bands, including a core coding of the input signal according to a first throughput and at least one enhancement coding of higher throughput, of a residual signal. The core coding uses a binary allocation according to an energy criterion. The method includes for the enhancement coding: calculating a frequency-based masking threshold for at least part of the frequency bands processed by the enhancement coding; determining a perceptual importance per frequency sub-band as a function of the masking threshold and as a function of the number of bits allocated for the core coding; binary allocation of bits in the frequency sub-bands processed by the enhancement coding, as a function of the perceptual importance determined; and coding the residual signal according to the bit allocation. Also provided are a decoding method, a coder and a decoder.

    Abstract translation: 一种将数字音频输入信号分层编码成若干频率子带的方法,包括根据第一吞吐量的输入信号的核心编码和较高吞吐量的残余信号的至少一个增强编码。 核心编码根据能量标准使用二进制分配。 该方法包括用于增强编码:计算由增强编码处理的至少部分频带的基于频率的掩蔽阈值; 确定每个频率子带的感知重要性作为掩蔽阈值的函数,并且作为分配给核心编码的比特数的函数; 由增强编码处理的频率子带中的位的二进制分配作为确定的感知重要性的函数; 并根据比特分配对残差信号进行编码。 还提供了解码方法,编码器和解码器。

    RECONSTRUCTION OF MULTI-CHANNEL AUDIO DATA
    9.
    发明申请
    RECONSTRUCTION OF MULTI-CHANNEL AUDIO DATA 有权
    重建多通道音频数据

    公开(公告)号:US20110129092A1

    公开(公告)日:2011-06-02

    申请号:US13056169

    申请日:2009-07-03

    CPC classification number: G10L19/008 G10L19/005 H04R2420/03 H04S3/02

    Abstract: A method for processing sound data is provided for the reconstruction of multi-channel audio data on the basis at least of data on a reduced number of channels and of spatialization data. A test is carried out to determine whether the spatialization data received are valid. If the test is positive, a spatialization value is predicted according to a per respective model of a plurality of models. A prediction model is chosen on the basis of the spatialization values thus predicted and on the basis of the spatialization data received, to permit, in case of subsequent reception of defective spatialization data, a prediction according to this chosen model of a spatialization value and to use this predicted spatialization value for the reconstruction of the multi-channel audio data.

    Abstract translation: 提供用于处理声音数据的方法,用于至少基于减少数量的信道和空间数据的数据的重建多声道音频数据。 进行测试以确定接收到的空间化数据是否有效。 如果测试是正的,则根据多个模型的每个相应模型来预测空间化值。 基于如此预测的空间化值,并且基于所接收的空间化数据来选择预测模型,以便在随后接收不合格的空间数据的情况下,根据该空间化值的所选择的模型进行预测,并且 使用该预测的空间化值来重建多声道音频数据。

    HIERARCHICAL ENCODING/DECODING DEVICE
    10.
    发明申请
    HIERARCHICAL ENCODING/DECODING DEVICE 有权
    分层编码/解码设备

    公开(公告)号:US20090326931A1

    公开(公告)日:2009-12-31

    申请号:US11988758

    申请日:2006-07-07

    CPC classification number: G10L19/24

    Abstract: A system for coding a hierarchical audio signal, comprising, at least, a core layer using parametric coding by analysis by synthesis in a first frequency band, a band extension layer for widening said first frequency band into a second frequency band, or wideband. The system also comprises a wideband audio coding quality enhancement layer based on transform coding using a spectral parameter obtained from said band extension layer. Application to transmitting speech and/or audio signals over packet networks.

    Abstract translation: 一种用于对分层音频信号进行编码的系统,至少包括使用第一频带中的合成分析的参数编码的核心层,用于将所述第一频带扩展成第二频带的带扩展层或宽带。 该系统还包括基于使用从所述频带扩展层获得的频谱参数的变换编码的宽带音频编码质量增强层。 应用于通过分组网络传输语音和/或音频信号。

Patent Agency Ranking