Device and Method for Graduated Encoding of a Multichannel Audio Signal Based on a Principal Component Analysis
    1.
    发明申请
    Device and Method for Graduated Encoding of a Multichannel Audio Signal Based on a Principal Component Analysis 有权
    基于主成分分析的多通道音频信号分级编码的装置和方法

    公开(公告)号:US20090083045A1

    公开(公告)日:2009-03-26

    申请号:US12293072

    申请日:2007-03-08

    CPC classification number: G10L19/008 G10L19/24

    Abstract: A system and a method for the scalable coding of a multi-channel audio signal comprising a principal component analysis (PCA) transformation of at least two channels (L, R) of the audio signal into a principal component (CP) and at least one residual sub-component (r) by rotation defined by a transformation parameter (θ), comprising the following steps: formation of a frequency subband-based residual structure (Sfr) on the basis of the at least one residual sub-component (r), and definition of a coded audio signal (SC) comprising the principal component (CP), at least one residual structure (Sfr) of a frequency subband and the transformation parameter (θ).

    Abstract translation: 一种用于多声道音频信号的可缩放编码的系统和方法,包括将音频信号的至少两个声道(L,R)的主成分分析(PCA)变换成主要分量(CP)和至少一个 通过由变换参数(θ)定义的旋转的残余子分量(r),包括以下步骤:基于所述至少一个残余子分量(r)形成基于频率子带的残差结构(Sfr) 以及包括主成分(CP)的编码音频信号(SC),频率子带的至少一个残留结构(Sfr)和变换参数(theta)的定义。

    Device and Method for Encoding by Principal Component Analysis a Multichannel Audio Signal
    2.
    发明申请
    Device and Method for Encoding by Principal Component Analysis a Multichannel Audio Signal 有权
    通过主成分分析来编码多通道音频信号的装置和方法

    公开(公告)号:US20090083044A1

    公开(公告)日:2009-03-26

    申请号:US12293041

    申请日:2007-03-08

    CPC classification number: G10L19/008

    Abstract: A system and a method for coding by principal component analysis (PCA) of a multi-channel audio signal comprising the following steps: decomposing at least two channels (L, R) of said audio signal into a plurality of frequency sub-bands (1(b1), . . . , 1(bN), r(b1), . . . , r(bN)), calculating at least one transformation parameter (θ(b1), . . . , θ(bN)) as a function of at least some of said plurality of frequency sub-bands, transforming at least some of said plurality of frequency sub-bands into a plurality of frequency sub-components as a function of said at least one transformation parameter (θ(b1), . . . , θ(bN)), said plurality of frequency sub-components comprising principal frequency sub-components (CP(b1), . . . , CP(bN)), combining at least some of said principal frequency sub-components (CP(b1), . . . , CP(bN)) in order to form a principal component (CP), and defining a coded audio signal (SC) representing said multi-channel audio signal (C1, . . . ,CM), said coded audio signal (SC) comprising said principal component (CP) and said at least one transformation parameter (θ(b1), . . . , θ(bN)).

    Abstract translation: 一种用于通过多声道音频信号的主成分分析(PCA)进行编码的系统和方法,包括以下步骤:将所述音频信号的至少两个声道(L,R)分解成多个频率子带(1 (b1),...,1(bN),r(b1),...,r(bN)),计算至少一个变换参数(theta(b1),...,theta(bN) 所述多个频率子带中的至少一些的功能是将所述多个频率子带中的至少一些变换为多个频率子分量作为所述至少一个变换参数(θ(b1))的函数, ,...,theta(bN)),所述多个频率子分量包括主频分量分量(CP(b1),...,CP(bN)),将至少一些所述主频分量 组件(CP(b1),...,CP(bN)),以形成主要组件(CP),并且定义表示所述多声道音频信号(C1,..., CM)说 包括所述主成分(CP)和所述至少一个变换参数(theta(b1))的编码音频信号(SC)。 。 。 ,theta(bN))。

    Device and method for encoding by principal component analysis a multichannel audio signal
    3.
    发明授权
    Device and method for encoding by principal component analysis a multichannel audio signal 有权
    通过主成分分析来编码多通道音频信号的装置和方法

    公开(公告)号:US08370134B2

    公开(公告)日:2013-02-05

    申请号:US12293041

    申请日:2007-03-08

    CPC classification number: G10L19/008

    Abstract: A system and a method for coding by principal component analysis (PCA) of a multi-channel audio signal comprising the following steps: decomposing at least two channels (L, R) of said audio signal into a plurality of frequency sub-bands (I(b1), . . . , I(bN), r(b1), . . . , r(bN)), calculating at least one transformation parameter (θ(b1), . . . , θ(bN)) as a function of at least some of said plurality of frequency sub-bands, transforming at least some of said plurality of frequency sub-bands into a plurality of frequency sub-components as a function of said at least one transformation parameter (θ(b1), . . . , θ(bN)), said plurality of frequency sub-components comprising principal frequency sub-components (CP(b1), . . . , CP(bN)), combining at least some of said principal frequency sub-components (CP(b1), . . . , CP(bN)) in order to form a principal component (CP), and defining a coded audio signal (SC) representing said multi-channel audio signal (C1, . . . , CM), said coded audio signal (SC) comprising said principal component (CP) and said at least one transformation parameter (θ(b1), . . . , θ(bN)).

    Abstract translation: 一种用于通过主要分量分析(PCA)编码多声道音频信号的系统和方法,包括以下步骤:将所述音频信号的至少两个声道(L,R)分解成多个频率子带(I (b1),...,I(bN),r(b1),...,r(bN)),计算至少一个变换参数(&Thetas;(b1),...,&thetas;(bN) )作为所述多个频率子带中的至少一些的函数,将所述多个频率子带中的至少一些频率子带变换为多个频率子分量,作为所述至少一个变换参数的函数(“ (b1),...,...,...(bN)),所述多个频率子分量包括主频分量分量(CP(b1),...,CP(bN)), 主要频率子分量(CP(b1),...,CP(bN)),以形成主分量(CP),并且定义表示所述多声道音频信号(C1, ... ,CM),所述编码音频信号(SC)包括所述主分量(CP)和所述至少一个变换参数(“The”;(b1),...,&thetas;(bN))。

    Device and method for graduated encoding of a multichannel audio signal based on a principal component analysis
    4.
    发明授权
    Device and method for graduated encoding of a multichannel audio signal based on a principal component analysis 有权
    基于主成分分析的多通道音频信号的分级编码的装置和方法

    公开(公告)号:US08359194B2

    公开(公告)日:2013-01-22

    申请号:US12293072

    申请日:2007-03-08

    CPC classification number: G10L19/008 G10L19/24

    Abstract: A system and a method for the scalable coding of a multi-channel audio signal comprising a principal component analysis (PCA) transformation of at least two channels (L, R) of the audio signal into a principal component (CP) and at least one residual sub-component (r) by rotation defined by a transformation parameter (θ), comprising the following steps: formation of a frequency subband-based residual structure (Sfr) on the basis of the at least one residual sub-component (r), and definition of a coded audio signal (SC) comprising the principal component (CP), at least one residual structure (Sfr) of a frequency subband and the transformation parameter (θ).

    Abstract translation: 一种用于多声道音频信号的可缩放编码的系统和方法,包括将音频信号的至少两个声道(L,R)的主成分分析(PCA)变换为主成分(CP)和至少一个 包括以下步骤:基于所述至少一个残余子分量(r)来形成基于频率子带的残余结构(Sfr),所述残余子分量(r)由转换参数(& )和包括主成分(CP)的编码音频信号(SC)的定义,频率子带的至少一个残留结构(Sfr)和变换参数(“thetas”)。

    DETERMINING THE INTER-CHANNEL TIME DIFFERENCE OF A MULTI-CHANNEL AUDIO SIGNAL
    5.
    发明申请
    DETERMINING THE INTER-CHANNEL TIME DIFFERENCE OF A MULTI-CHANNEL AUDIO SIGNAL 有权
    确定多通道音频信号的通道时间差

    公开(公告)号:US20130301835A1

    公开(公告)日:2013-11-14

    申请号:US13980427

    申请日:2011-04-07

    Abstract: There is provided a method and device for determining an inter-channel time difference of a multi-channel audio signal having at least two channels. A determination is made, at a number of consecutive time instances, of inter-channel correlation based on a cross-correlation function involving at least two different channels of the multi-channel audio signal. Each value of the inter-channel correlation is associated with a corresponding value of the inter-channel time difference. An adaptive inter-channel correlation threshold is adaptively determined based on adaptive smoothing of the inter-channel correlation in time. A current value of the inter-channel correlation is then evaluated in relation to the adaptive inter-channel correlation threshold to determine whether the corresponding current value of the inter-channel time difference is relevant. Based on the result of this evaluation, an updated value of the inter-channel time difference is determined.

    Abstract translation: 提供了一种用于确定具有至少两个信道的多声道音频信号的信道间时间差的方法和装置。 在多个连续时间实例中,基于涉及多声道音频信号的至少两个不同信道的互相关函数,确定信道间相关。 信道间相关的每个值与信道间时间差的对应值相关联。 基于时间上的信道间相关的自适应平滑自适应地确定自适应信道间相关阈值。 然后,关于自适应信道间相关阈值来评估信道间相关的当前值,以确定信道间时间差的对应当前值是否相关。 根据该评价结果,确定通道间时差的更新值。

    Method and device for noise filling
    6.
    发明授权
    Method and device for noise filling 有权
    噪声填充的方法和装置

    公开(公告)号:US08370133B2

    公开(公告)日:2013-02-05

    申请号:US12675290

    申请日:2008-08-26

    CPC classification number: G10L19/028 G10L19/035 G10L21/0364

    Abstract: A method for perceptual spectral decoding comprises decoding of spectral coefficients recovered from a binary flux into decoded spectral coefficients of an initial set of spectral coefficients. The initial set of spectral coefficients are spectrum filled. The spectrum filling comprises noise filling of spectral holes by setting spectral coefficients in the initial set of spectral coefficients not being decoded from the binary flux equal to elements derived from the decoded spectral coefficients. The set of reconstructed spectral coefficients of a frequency domain formed by the spectrum filling is converted into an audio signal of a time domain. A perceptual spectral decoder comprises a noise filler, operating according to the method for perceptual spectral decoding.

    Abstract translation: 用于感知频谱解码的方法包括将从二进制通量恢复的频谱系数解码为初始频谱系数集合的解码频谱系数。 频谱系数的初始集合是频谱填充的。 频谱填充包括通过将频谱系数的初始集合中的频谱系数设置为不等于从解码频谱系数导出的元素的二进制通量来解码频谱孔的噪声填充。 通过频谱填充形成的频域的重构频谱系数的集合被转换成时域的音频信号。 感知频谱解码器包括根据用于感知频谱解码的方法操作的噪声填充器。

    Method for determining an audio data spatial encoding mode
    7.
    发明授权
    Method for determining an audio data spatial encoding mode 有权
    用于确定音频数据空间编码模式的方法

    公开(公告)号:US08600530B2

    公开(公告)日:2013-12-03

    申请号:US12087321

    申请日:2006-12-13

    CPC classification number: H04L67/303 G10L19/008 H04L65/1069 H04L69/24

    Abstract: A method of determining a spatial coding mode for audio data sent by a sender entity (20) to a receiver terminal (10) adapted to receive said audio data in one or more sound reproduction formats. The method comprises the steps of the receiver terminal (10) indicating (1) to the sender entity (20) said reproduction format(s) in a given order of preference and the sender entity (20) determining a mode of spatial coding of the audio data compatible with a reproduction format taken in the order of preference and indicating (2) said reproduction format to the receiver terminal (10). Application to Voice over IP services.

    Abstract translation: 确定由发送器实体(20)发送到适于以一种或多种声音再现格式接收所述音频数据的接收器终端(10)的音频数据的空间编码模式的方法。 该方法包括接收终端(10)的步骤,其以给定的优先顺序指示(1)发送者实体(20)所述再现格式,发送方实体(20)确定 音频数据与按照偏好顺序拍摄的再现格式兼容,并且向接收机终端(10)指示(2)所述再现格式。 应用于IP语音业务。

    Determining the inter-channel time difference of a multi-channel audio signal

    公开(公告)号:US10002614B2

    公开(公告)日:2018-06-19

    申请号:US13981035

    申请日:2011-04-07

    CPC classification number: G10L19/008 G10L25/06

    Abstract: There is provided a method and device for determining an inter-channel time difference of a multi-channel audio signal having at least two channels. A set of local maxima of a cross-correlation function involving at least two different channels of the multi-channel audio signal is determined (S1) for positive and negative time-lags, where each local maximum is associated with a corresponding time-lag. From the set of local maxima, a local maximum for positive time-lags is selected as a so-called positive time-lag inter-channel correlation candidate and a local maximum for negative time-lags is selected as a so-called negative time-lag inter-channel correlation candidate (S2). When the absolute value of a difference in amplitude between the inter-channel correlation candidates is smaller than a first threshold, it is evaluated whether there is an energy-dominant channel (S3). When there is an energy-dominant-channel, the sign of the inter-channel time difference is identified and a current value of the inter-channel time difference is extracted based on either the time-lag corresponding to the positive time-lag inter-channel con-elation candidate or the time-lag corresponding to the negative time-lag inter-channel correlation candidate (S4).

    Determining the inter-channel time difference of a multi-channel audio signal
    9.
    发明授权
    Determining the inter-channel time difference of a multi-channel audio signal 有权
    确定多声道音频信号的通道间时差

    公开(公告)号:US09424852B2

    公开(公告)日:2016-08-23

    申请号:US13980427

    申请日:2011-04-07

    Abstract: There is provided a method and device for determining an inter-channel time difference of a multi-channel audio signal having at least two channels. A determination is made, at a number of consecutive time instances, of inter-channel correlation based on a cross-correlation function involving at least two different channels of the multi-channel audio signal. Each value of the inter-channel correlation is associated with a corresponding value of the inter-channel time difference. An adaptive inter-channel correlation threshold is adaptively determined based on adaptive smoothing of the inter-channel correlation in time. A current value of the inter-channel correlation is then evaluated in relation to the adaptive inter-channel correlation threshold to determine whether the corresponding current value of the inter-channel time difference is relevant. Based on the result of this evaluation, an updated value of the inter-channel time difference is determined.

    Abstract translation: 提供了一种用于确定具有至少两个信道的多声道音频信号的信道间时间差的方法和装置。 在多个连续时间实例中,基于涉及多声道音频信号的至少两个不同信道的互相关函数,确定信道间相关。 信道间相关的每个值与信道间时间差的对应值相关联。 基于时间上的信道间相关的自适应平滑自适应地确定自适应信道间相关阈值。 然后,关于自适应信道间相关阈值评估信道间相关的当前值,以确定信道间时间差的相应当前值是否相关。 根据该评价结果,确定通道间时差的更新值。

    Determining the Inter-Channel Time Difference of a Multi-Channel Audio Signal
    10.
    发明申请
    Determining the Inter-Channel Time Difference of a Multi-Channel Audio Signal 有权
    确定多通道音频信号的通道间差

    公开(公告)号:US20130304481A1

    公开(公告)日:2013-11-14

    申请号:US13981035

    申请日:2011-04-07

    CPC classification number: G10L19/008 G10L25/06

    Abstract: There is provided a method and device for determining an inter-channel time difference of a multi-channel audio signal having at least two channels. A set of local maxima of a cross-correlation function involving at least two different channels of the multi-channel audio signal is determined (S1) for positive and negative time-lags, where each local maximum is associated with a corresponding time-lag. From the set of local maxima, a local maximum for positive time-lags is selected as a so-called positive time-lag inter-channel correlation candidate and a local maximum for negative time-lags is selected as a so-called negative time-lag inter-channel correlation candidate (S2). When the absolute value of a difference in amplitude between the inter-channel correlation candidates is smaller than a first threshold, it is evaluated whether there is an energy-dominant channel (S3). When there is an energy-dominant-channel, the sign of the inter-channel time difference is identified and a current value of the inter-channel time difference is extracted based on either the time-lag corresponding to the positive time-lag inter-channel con-elation candidate or the time-lag corresponding to the negative time-lag inter-channel correlation candidate (S4).

    Abstract translation: 提供了一种用于确定具有至少两个信道的多声道音频信号的信道间时间差的方法和装置。 确定涉及多通道音频信号的至少两个不同通道的互相关函数的一组局部最大值,用于正和负时间滞后,其中每个局部最大值与相应的时间滞后相关联。 从本地最大值集合中,选择正时滞的局部最大值作为所谓的正时滞信道间相关候选,并且选择负时间滞后的局部最大值作为所谓的负时间 - 滞后信道间相关候选(S2)。 当信道间相关候选之间的幅度差的绝对值小于第一阈值时,评估是否存在能量占优通道(S3)。 当存在能量占优通道时,识别通道间时差的符号,并且基于对应于正时滞时间差的时间延迟来提取通道间时间差的当前值, 信道相关候选或对应于负时间延迟信道间相关候选的时间延迟(S4)。

Patent Agency Ranking