Phase-Amplitude 3-D Stereo Encoder and Decoder
    1.
    发明申请
    Phase-Amplitude 3-D Stereo Encoder and Decoder 有权
    相位振幅三维立体声编码器和解码器

    公开(公告)号:US20090092259A1

    公开(公告)日:2009-04-09

    申请号:US12246491

    申请日:2008-10-06

    IPC分类号: H04R5/00

    CPC分类号: H04S3/02 G10L19/008

    摘要: A two-channel phase-amplitude stereo encoding and decoding scheme enabling flexible and spatially accurate interactive 3-D audio reproduction via standard audio-only two-channel transmission. The encoding scheme allows associating a 2-D or 3-D positional localization to each of a plurality of sound sources by use of frequency independent inter-channel phase and amplitude differences. The decoder is based on frequency-domain spatial analysis of 2-D or 3-D directional cues in a two-channel stereo signal and re-synthesis of these cues using any preferred spatialization technique, thereby allowing faithful reproduction of positional audio cues and reverberation or ambient cues over arbitrary multi-channel loudspeaker reproduction formats or over headphones, while preserving source separation despite the intermediate encoding over only two audio channels.

    摘要翻译: 双通道相位立体声编码和解码方案,通过标准的仅音频双通道传输实现灵活和空间准确的交互式3-D音频再现。 编码方案允许通过使用频率无关的信道间相位和幅度差来将多维声源中的每一个与二维或三维位置定位相关联。 解码器基于双声道立体声信号中的2-D或3-D方向提示的频域空间分析,并且使用任何优选的空间化技术重新合成这些线索,从而允许忠实地再现位置音频线索和混响 或环境提示超过任意多声道扬声器再现格式或通过耳机,同时保留源分离,尽管中间编码只有两个音频通道。

    Phase-amplitude 3-D stereo encoder and decoder
    2.
    发明授权
    Phase-amplitude 3-D stereo encoder and decoder 有权
    相位振幅三维立体声编码器和解码器

    公开(公告)号:US08712061B2

    公开(公告)日:2014-04-29

    申请号:US12246491

    申请日:2008-10-06

    IPC分类号: H04R5/00

    CPC分类号: H04S3/02 G10L19/008

    摘要: A two-channel phase-amplitude stereo encoding and decoding scheme enabling flexible and spatially accurate interactive 3-D audio reproduction via standard audio-only two-channel transmission. The encoding scheme allows associating a 2-D or 3-D positional localization to each of a plurality of sound sources by use of frequency independent inter-channel phase and amplitude differences. The decoder is based on frequency-domain spatial analysis of 2-D or 3-D directional cues in a two-channel stereo signal and re-synthesis of these cues using any preferred spatialization technique, thereby allowing faithful reproduction of positional audio cues and reverberation or ambient cues over arbitrary multi-channel loudspeaker reproduction formats or over headphones, while preserving source separation despite the intermediate encoding over only two audio channels.

    摘要翻译: 双通道相位立体声编码和解码方案,通过标准的仅音频双通道传输实现灵活和空间准确的交互式3-D音频再现。 编码方案允许通过使用频率无关的信道间相位和幅度差来将多维声源中的每一个与二维或三维位置定位相关联。 解码器基于双声道立体声信号中的2-D或3-D方向提示的频域空间分析,并且使用任何优选的空间化技术重新合成这些线索,从而允许忠实地再现位置音频线索和混响 或环境提示超过任意多声道扬声器再现格式或通过耳机,同时保留源分离,尽管中间编码只有两个音频通道。

    Correlation-based method for ambience extraction from two-channel audio signals
    3.
    发明授权
    Correlation-based method for ambience extraction from two-channel audio signals 有权
    从双声道音频信号提取气氛的相关方法

    公开(公告)号:US08107631B2

    公开(公告)日:2012-01-31

    申请号:US12196239

    申请日:2008-08-21

    IPC分类号: H04R5/00

    CPC分类号: H04S7/30 G10L19/008 H04S3/002

    摘要: A method of ambience extraction includes analyzing an input signal to determine the time-dependent and frequency-dependent amount of ambience in the input signal, wherein the amount of ambience is determined based on a signal model and correlation quantities computed from the input signals and wherein the ambience is extracted using a multiplicative time-frequency mask. Another method of ambience extraction includes compensating a bias in the estimation of a short-term cross-correlation coefficient. In addition, systems having various modules for implementing the above methods are disclosed.

    摘要翻译: 一种环境提取方法包括分析输入信号以确定输入信号中的时间依赖性和与频率相关的环境量,其中基于从输入信号计算的信号模型和相关量来确定环境量,并且其中 使用乘法时频掩码提取环境。 另一种环境提取方法包括补偿短期互相关系数估计中的偏差。 此外,公开了具有用于实现上述方法的各种模块的系统。

    Spatial audio analysis and synthesis for binaural reproduction and format conversion
    6.
    发明授权
    Spatial audio analysis and synthesis for binaural reproduction and format conversion 有权
    双耳再现和格式转换的空间音频分析和综合

    公开(公告)号:US08374365B2

    公开(公告)日:2013-02-12

    申请号:US12243963

    申请日:2008-10-01

    IPC分类号: H04R5/02

    摘要: A frequency-domain method for format conversion or reproduction of 2-channel or multi-channel audio signals such as recordings is described. The reproduction is based on spatial analysis of directional cues in the input audio signal and conversion of these cues into audio output signal cues for two or more channels in the frequency domain.

    摘要翻译: 描述用于格式转换或再现诸如记录的2声道或多声道音频信号的频域方法。 再现是基于输入音频信号中的方向提示的空间分析,并且将这些提示转换成频域中两个或更多个频道的音频输出信号提示。

    CORRELATION-BASED METHOD FOR AMBIENCE EXTRACTION FROM TWO-CHANNEL AUDIO SIGNALS
    7.
    发明申请
    CORRELATION-BASED METHOD FOR AMBIENCE EXTRACTION FROM TWO-CHANNEL AUDIO SIGNALS 有权
    用于从两声道音频信号中提取的相关性方法

    公开(公告)号:US20090092258A1

    公开(公告)日:2009-04-09

    申请号:US12196239

    申请日:2008-08-21

    IPC分类号: H04R5/00

    CPC分类号: H04S7/30 G10L19/008 H04S3/002

    摘要: A method of ambience extraction includes analyzing an input signal to determine the time-dependent and frequency-dependent amount of ambience in the input signal, wherein the amount of ambience is determined based on a signal model and correlation quantities computed from the input signals and wherein the ambience is extracted using a multiplicative time-frequency mask. Another method of ambience extraction includes compensating a bias in the estimation of a short-term cross-correlation coefficient. In addition, systems having various modules for implementing the above methods are disclosed.

    摘要翻译: 一种环境提取方法包括分析输入信号以确定输入信号中的时间依赖性和与频率相关的环境量,其中基于从输入信号计算的信号模型和相关量来确定环境量,其中 使用乘法时频掩码提取环境。 另一种环境提取方法包括补偿短期互相关系数估计中的偏差。 此外,公开了具有用于实现上述方法的各种模块的系统。

    Method for segmenting audio signals
    9.
    发明授权
    Method for segmenting audio signals 有权
    分割音频信号的方法

    公开(公告)号:US08521529B2

    公开(公告)日:2013-08-27

    申请号:US10907851

    申请日:2005-04-18

    IPC分类号: G10L15/06

    摘要: An input signal is converted to a feature-space representation. The feature-space representation is projected onto a discriminant subspace using a linear discriminant analysis transform to enhance the separation of feature clusters. Dynamic programming is used to find global changes to derive optimal cluster boundaries. The cluster boundaries are used to identify the segments of the audio signal.

    摘要翻译: 输入信号被转换为特征空间表示。 使用线性判别分析变换将特征空间表示投影到判别子空间上,以增强特征簇的分离。 动态编程用于查找全局变化以导出最佳集群边界。 簇边界用于识别音频信号的段。

    Adaptive primary-ambient decomposition of audio signals
    10.
    发明授权
    Adaptive primary-ambient decomposition of audio signals 有权
    音频信号的自适应初级环境分解

    公开(公告)号:US08204237B2

    公开(公告)日:2012-06-19

    申请号:US12416099

    申请日:2009-03-31

    IPC分类号: H04R29/00

    CPC分类号: H04S3/008 G10L19/008

    摘要: A stereo audio signal is processed to determine primary and ambient components by transforming the signal into vectors corresponding to subband signals, and decomposing the left and right channel vectors into ambient and primary components by matrix and vector operations. Principal component analysis is used to determine a primary component unit vector, and ambience components are determined according to a correlation-based cross-fade or an orthogonal basis derivation.

    摘要翻译: 处理立体声音频信号以通过将信号变换成对应于子带信号的矢量来确定主要和环境分量,并且通过矩阵和矢量操作将左和右声道矢量分解为环境和主要分量。 主分量分析用于确定主要分量单位向量,并且根据基于相关的交叉淡入淡出或正交基础推导来确定氛围分量。