Apparatus and method for separating sound source
    11.
    发明授权
    Apparatus and method for separating sound source 有权
    声源分离装置及方法

    公开(公告)号:US09049532B2

    公开(公告)日:2015-06-02

    申请号:US13276974

    申请日:2011-10-19

    CPC classification number: H04S7/30 G10H2210/056 G10L19/008

    Abstract: Disclosed are an apparatus and a method for separating sound sources capable of learning distributions of corresponding sound sources based on the assumption that specific sound sources have specific distributions based on interchannel correlation parameter in audio signals providing space perception through a plurality of channels to separate an amount corresponding to energy contribution of the corresponding sound sources from mixture signals. Exemplary embodiments of the present invention can more precisely predict the channel distributions of the specific sound sources included in the input mixture signals and more accurately separate sound sources than a method for separating a sound source based on the channel according to the related art, under conditions that general channel distribution information of the specific sound sources are approximately modeled.

    Abstract translation: 公开了一种用于分离能够学习相应声源的分布的声源的装置和方法,所述设备和方法基于以下假设:特定声源具有基于通过多个声道提供空间感知的音频信号中的通道间相关参数的特定分布,以分离数量 对应于来自混合信号的相应声源的能量贡献。 本发明的示例性实施例可以根据现有技术更准确地预测包括在输入混合信号中的特定声源的信道分布和更准确地分离声源,而不是根据现有技术在信道上分离声源的方法 特定声源的通用通道分配信息被大致建模。

    APPARATUS AND METHOD FOR SEPARATING SOUND SOURCE
    12.
    发明申请
    APPARATUS AND METHOD FOR SEPARATING SOUND SOURCE 有权
    用于分离声源的装置和方法

    公开(公告)号:US20120093341A1

    公开(公告)日:2012-04-19

    申请号:US13276974

    申请日:2011-10-19

    CPC classification number: H04S7/30 G10H2210/056 G10L19/008

    Abstract: Disclosed are an apparatus and a method for separating sound sources capable of learning distributions of corresponding sound sources based on the assumption that specific sound sources have specific distributions based on interchannel correlation parameter in audio signals providing space perception through a plurality of channels to separate an amount corresponding to energy contribution of the corresponding sound sources from mixture signals. Exemplary embodiments of the present invention can more precisely predict the channel distributions of the specific sound sources included in the input mixture signals and more accurately separate sound sources than a method for separating a sound source based on the channel according to the related art, under conditions that general channel distribution information of the specific sound sources are approximately modeled.

    Abstract translation: 公开了一种用于分离能够学习相应声源的分布的声源的装置和方法,所述设备和方法基于以下假设:特定声源具有基于通过多个声道提供空间感知的音频信号中的通道间相关参数的特定分布,以分离数量 对应于来自混合信号的相应声源的能量贡献。 本发明的示例性实施例可以根据现有技术更准确地预测包括在输入混合信号中的特定声源的信道分布和更准确地分离声源,而不是根据现有技术在信道上分离声源的方法 特定声源的通用通道分配信息被大致建模。

    Convolutive blind source separation using relative optimization
    15.
    发明授权
    Convolutive blind source separation using relative optimization 失效
    卷积盲源分离使用相对优化

    公开(公告)号:US07738574B2

    公开(公告)日:2010-06-15

    申请号:US11478212

    申请日:2006-06-29

    CPC classification number: G06K9/6243 G06K9/6245

    Abstract: A method and apparatus for separating a multi-channel mixed signal are provided. The method includes the steps of: a) transforming a temporal domain to a frequency domain by performing a discrete Fourier transform onto at least one of mixed signals inputted from an external device through multi-channel; b) estimating multi-decorrelation by calculating a plurality of cross power spectra for the mixed signal in the transformed frequency domain; c) estimating a separation coefficient of the mixed signal based on relative optimization in order to decorrelate the calculated cross power spectra, where the separation coefficient is serially updated; d) transforming the frequency domain to the temporal domain by performing an inverse discrete Fourier transform on the estimated separation coefficient in the temporal domain; and e) separating an original signal from the mixed signal by filtering the mixed signal using the separation coefficient of the transformed temporal domain.

    Abstract translation: 提供了一种用于分离多通道混合信号的方法和装置。 该方法包括以下步骤:a)通过对通过多信道从外部设备输入的混合信号中的至少一个进行离散付里叶变换,将时域转换为频域; b)通过计算变换频域中的混合信号的多个交叉功率谱来估计多重相关; c)基于相对优化来估计混合信号的分离系数,以便将所计算的交叉功率谱解相关,其中分离系数被连续更新; d)通过对时域中估计的分离系数执行逆离散傅立叶变换,将频域变换到时域; 以及e)使用所述变换的时域的分离系数对所述混合信号进行滤波,从所述混合信号中分离原始信号。

    METHOD FOR CREATING, EDITING, AND REPRODUCING MULTI-OBJECT AUDIO CONTENTS FILES FOR OBJECT-BASED AUDIO SERVICE, AND METHOD FOR CREATING AUDIO PRESETS
    16.
    发明申请
    METHOD FOR CREATING, EDITING, AND REPRODUCING MULTI-OBJECT AUDIO CONTENTS FILES FOR OBJECT-BASED AUDIO SERVICE, AND METHOD FOR CREATING AUDIO PRESETS 有权
    用于创建,编辑和复制用于基于对象的音频服务的多目标音频内容文件的方法以及用于创建音频预设的方法

    公开(公告)号:US20100076577A1

    公开(公告)日:2010-03-25

    申请号:US12527330

    申请日:2008-02-18

    CPC classification number: G11B27/034 G10L19/008 G11B27/3027

    Abstract: Provided are a method for creating, editing and reproducing a multi-object audio content file for an object-based audio service and a method for creating audio presets. The multi-object audio content file creating method includes creating a plurality of frames for each audio object forming an audio content; and creating a multi-object audio content file by grouping and storing the frames according to each reproduction time. This invention can enhance functions of the object-based audio service and make it easy to access to each audio object of an audio content file.

    Abstract translation: 提供了一种用于创建,编辑和再现用于基于对象的音频服务的多对象音频内容文件的方法和用于创建音频预设的方法。 多对象音频内容文件创建方法包括为形成音频内容的每个音频对象创建多个帧; 以及通过根据每个再现时间对帧进行分组和存储来创建多对象音频内容文件。 本发明可以增强基于对象的音频服务的功能,并且可以容易地访问音频内容文件的每个音频对象。

    OBJECT-BASED 3-DIMENSIONAL AUDIO SERVICE SYSTEM USING PRESET AUDIO SCENES
    17.
    发明申请
    OBJECT-BASED 3-DIMENSIONAL AUDIO SERVICE SYSTEM USING PRESET AUDIO SCENES 有权
    使用预置音频场景的基于对象的三维音频服务系统

    公开(公告)号:US20090147961A1

    公开(公告)日:2009-06-11

    申请号:US12300720

    申请日:2007-05-16

    Abstract: Provided are an object-based three dimensional (3-D) audio service system using preset audio scenes and a method thereof. The system and the method are suggested for enabling a user to easily and conveniently watch and listen an object based 3-D audio service by eliminating inconvenience that requires a user to control each of object audio signals of sound sources. The system includes: audio input means for inputting an audio signal; preset audio scene generating means for extracting object audio signals from the audio signal inputted through the audio input means and generating more than one of 3-D audio scene information by arranging the extracted object audio signals in a 3-D space and editing features of each object; and encoding means for encoding and multiplexing the audio signal and the 3-D audio scene information for each object audio signal.

    Abstract translation: 提供了使用预设音频场景的基于对象的三维(3-D)音频服务系统及其方法。 建议使用该系统和方法,以便用户能够通过消除需要用户控制声源的每个对象音频信号的不便,来容易且方便地观看和收听基于对象的3-D音频服务。 该系统包括:用于输入音频信号的音频输入装置; 预设音频场景产生装置,用于从通过音频输入装置输入的音频信号中提取对象音频信号,并通过将提取的对象音频信号排列在3-D空间中并产生多个3-D音频场景信息,并且编辑每个 目的; 以及用于对每个对象音频信号对音频信号和3-D音频场景信息进行编码和多路复用的编码装置。

    Method And Apparatus For Encoding And Decoding Multi-Channel Audio Signal Using Virtual Source Location Information
    19.
    发明申请
    Method And Apparatus For Encoding And Decoding Multi-Channel Audio Signal Using Virtual Source Location Information 有权
    用于使用虚拟源位置信息编码和解码多声道音频信号的方法和装置

    公开(公告)号:US20080167880A1

    公开(公告)日:2008-07-10

    申请号:US11631009

    申请日:2005-07-08

    CPC classification number: G10L19/008 H04S3/002 H04S2420/03

    Abstract: Provided is a method and apparatus for encoding/decoding a multi-channel audio signal. The apparatus for encoding a multi-channel audio signal includes a frame converter for converting the multi-channel audio signal into a framed audio signal; means for downmixing the framed audio signal; means for encoding the downmixed audio signal; a source location information estimator for estimating source location information from the framed multi-channel audio signal; means for quantizing the estimated source location information; and means for multiplexing the encoded audio signal and the quantized source location information, to generate an encoded multi-channel audio signal.

    Abstract translation: 提供了一种用于对多声道音频信号进行编码/解码的方法和装置。 用于编码多声道音频信号的装置包括用于将多声道音频信号转换为成帧音频信号的帧转换器; 用于降低框架音频信号的装置; 用于对下混合音频信号进行编码的装置; 源位置信息估计器,用于从成帧的多声道音频信号估计源位置信息; 用于量化估计的源位置信息的装置; 以及用于复用经编码的音频信号和量化源位置信息的装置,以产生编码的多声道音频信号。

    Convolutive blind source separation using relative optimization
    20.
    发明申请
    Convolutive blind source separation using relative optimization 失效
    卷积盲源分离使用相对优化

    公开(公告)号:US20070058737A1

    公开(公告)日:2007-03-15

    申请号:US11478212

    申请日:2006-06-29

    CPC classification number: G06K9/6243 G06K9/6245

    Abstract: A method and apparatus for separating a multi-channel mixed signal are provided. The method includes the steps of: a) transforming a temporal domain to a frequency domain by performing a discrete Fourier transform onto at least one of mixed signals inputted from an external device through multi-channel; b) estimating multi-decorrelation by calculating a plurality of cross power spectra for the mixed signal in the transformed frequency domain; c) estimating a separation coefficient of the mixed signal based on relative optimization in order to decorrelate the calculated cross power spectra, where the separation coefficient is serially updated; d) transforming the frequency domain to the temporal domain by performing an inverse discrete Fourier transform on the estimated separation coefficient in the temporal domain; and e) separating an original signal from the mixed signal by filtering the mixed signal using the separation coefficient of the transformed temporal domain.

    Abstract translation: 提供了一种用于分离多通道混合信号的方法和装置。 该方法包括以下步骤:a)通过对通过多信道从外部设备输入的混合信号中的至少一个进行离散付里叶变换,将时域转换为频域; b)通过计算变换频域中的混合信号的多个交叉功率谱来估计多重相关; c)基于相对优化来估计混合信号的分离系数,以便将所计算的交叉功率谱解相关,其中分离系数被连续更新; d)通过对时域中估计的分离系数执行逆离散傅立叶变换,将频域变换到时域; 以及e)使用所述变换的时域的分离系数对所述混合信号进行滤波,从所述混合信号中分离原始信号。

Patent Agency Ranking