Spatial audio analysis and synthesis for binaural reproduction and format conversion
    1.
    发明授权
    Spatial audio analysis and synthesis for binaural reproduction and format conversion 有权
    双耳再现和格式转换的空间音频分析和综合

    公开(公告)号:US08374365B2

    公开(公告)日:2013-02-12

    申请号:US12243963

    申请日:2008-10-01

    IPC分类号: H04R5/02

    摘要: A frequency-domain method for format conversion or reproduction of 2-channel or multi-channel audio signals such as recordings is described. The reproduction is based on spatial analysis of directional cues in the input audio signal and conversion of these cues into audio output signal cues for two or more channels in the frequency domain.

    摘要翻译: 描述用于格式转换或再现诸如记录的2声道或多声道音频信号的频域方法。 再现是基于输入音频信号中的方向提示的空间分析,并且将这些提示转换成频域中两个或更多个频道的音频输出信号提示。

    SPATIAL AUDIO ANALYSIS AND SYNTHESIS FOR BINAURAL REPRODUCTION AND FORMAT CONVERSION
    2.
    发明申请
    SPATIAL AUDIO ANALYSIS AND SYNTHESIS FOR BINAURAL REPRODUCTION AND FORMAT CONVERSION 有权
    空间音频分析和合成生物复制和格式转换

    公开(公告)号:US20090252356A1

    公开(公告)日:2009-10-08

    申请号:US12243963

    申请日:2008-10-01

    IPC分类号: H04R5/02

    摘要: A frequency-domain method for format conversion or reproduction of 2-channel or multi-channel audio signals such as recordings is described. The reproduction is based on spatial analysis of directional cues in the input audio signal and conversion of these cues into audio output signal cues for two or more channels in the frequency domain.

    摘要翻译: 描述用于格式转换或再现诸如记录的2声道或多声道音频信号的频域方法。 再现是基于输入音频信号中的方向提示的空间分析,并且将这些提示转换成频域中两个或更多个频道的音频输出信号提示。

    CORRELATION-BASED METHOD FOR AMBIENCE EXTRACTION FROM TWO-CHANNEL AUDIO SIGNALS
    4.
    发明申请
    CORRELATION-BASED METHOD FOR AMBIENCE EXTRACTION FROM TWO-CHANNEL AUDIO SIGNALS 有权
    用于从两声道音频信号中提取的相关性方法

    公开(公告)号:US20090092258A1

    公开(公告)日:2009-04-09

    申请号:US12196239

    申请日:2008-08-21

    IPC分类号: H04R5/00

    CPC分类号: H04S7/30 G10L19/008 H04S3/002

    摘要: A method of ambience extraction includes analyzing an input signal to determine the time-dependent and frequency-dependent amount of ambience in the input signal, wherein the amount of ambience is determined based on a signal model and correlation quantities computed from the input signals and wherein the ambience is extracted using a multiplicative time-frequency mask. Another method of ambience extraction includes compensating a bias in the estimation of a short-term cross-correlation coefficient. In addition, systems having various modules for implementing the above methods are disclosed.

    摘要翻译: 一种环境提取方法包括分析输入信号以确定输入信号中的时间依赖性和与频率相关的环境量,其中基于从输入信号计算的信号模型和相关量来确定环境量,其中 使用乘法时频掩码提取环境。 另一种环境提取方法包括补偿短期互相关系数估计中的偏差。 此外,公开了具有用于实现上述方法的各种模块的系统。

    Correlation-based method for ambience extraction from two-channel audio signals
    6.
    发明授权
    Correlation-based method for ambience extraction from two-channel audio signals 有权
    从双声道音频信号提取气氛的相关方法

    公开(公告)号:US08107631B2

    公开(公告)日:2012-01-31

    申请号:US12196239

    申请日:2008-08-21

    IPC分类号: H04R5/00

    CPC分类号: H04S7/30 G10L19/008 H04S3/002

    摘要: A method of ambience extraction includes analyzing an input signal to determine the time-dependent and frequency-dependent amount of ambience in the input signal, wherein the amount of ambience is determined based on a signal model and correlation quantities computed from the input signals and wherein the ambience is extracted using a multiplicative time-frequency mask. Another method of ambience extraction includes compensating a bias in the estimation of a short-term cross-correlation coefficient. In addition, systems having various modules for implementing the above methods are disclosed.

    摘要翻译: 一种环境提取方法包括分析输入信号以确定输入信号中的时间依赖性和与频率相关的环境量,其中基于从输入信号计算的信号模型和相关量来确定环境量,并且其中 使用乘法时频掩码提取环境。 另一种环境提取方法包括补偿短期互相关系数估计中的偏差。 此外,公开了具有用于实现上述方法的各种模块的系统。

    Phase-amplitude 3-D stereo encoder and decoder
    8.
    发明授权
    Phase-amplitude 3-D stereo encoder and decoder 有权
    相位振幅三维立体声编码器和解码器

    公开(公告)号:US08712061B2

    公开(公告)日:2014-04-29

    申请号:US12246491

    申请日:2008-10-06

    IPC分类号: H04R5/00

    CPC分类号: H04S3/02 G10L19/008

    摘要: A two-channel phase-amplitude stereo encoding and decoding scheme enabling flexible and spatially accurate interactive 3-D audio reproduction via standard audio-only two-channel transmission. The encoding scheme allows associating a 2-D or 3-D positional localization to each of a plurality of sound sources by use of frequency independent inter-channel phase and amplitude differences. The decoder is based on frequency-domain spatial analysis of 2-D or 3-D directional cues in a two-channel stereo signal and re-synthesis of these cues using any preferred spatialization technique, thereby allowing faithful reproduction of positional audio cues and reverberation or ambient cues over arbitrary multi-channel loudspeaker reproduction formats or over headphones, while preserving source separation despite the intermediate encoding over only two audio channels.

    摘要翻译: 双通道相位立体声编码和解码方案,通过标准的仅音频双通道传输实现灵活和空间准确的交互式3-D音频再现。 编码方案允许通过使用频率无关的信道间相位和幅度差来将多维声源中的每一个与二维或三维位置定位相关联。 解码器基于双声道立体声信号中的2-D或3-D方向提示的频域空间分析,并且使用任何优选的空间化技术重新合成这些线索,从而允许忠实地再现位置音频线索和混响 或环境提示超过任意多声道扬声器再现格式或通过耳机,同时保留源分离,尽管中间编码只有两个音频通道。

    SPATIAL AUDIO ENCODING AND REPRODUCTION OF DIFFUSE SOUND
    9.
    发明申请
    SPATIAL AUDIO ENCODING AND REPRODUCTION OF DIFFUSE SOUND 有权
    空间音频编码和复制声音

    公开(公告)号:US20120082319A1

    公开(公告)日:2012-04-05

    申请号:US13228336

    申请日:2011-09-08

    IPC分类号: H03G3/00 G10L19/00

    摘要: A method and apparatus processes multi-channel audio by encoding, transmitting or recording “dry” audio tracks or “stems” in synchronous relationship with time-variable metadata controlled by a content producer and representing a desired degree and quality of diffusion. Audio tracks are compressed and transmitted in connection with synchronized metadata representing diffusion and preferably also mix and delay parameters. The separation of audio stems from diffusion metadata facilitates the customization of playback at the receiver, taking into account the characteristics of local playback environment.

    摘要翻译: 一种方法和装置通过编码,发送或记录与内容制作者控制的时变元数据同步关系并表示所需的扩散程度和质量的“干”音轨或“干”来处理多声道音频。 音频轨道被压缩并且与表示扩散的同步元数据一起发送,并且优选地也是混合和延迟参数。 音频分离源于扩散元数据有助于在接收机上定制播放,同时考虑到本地播放环境的特性。