MULTIPLE INPUT MULTIPLE OUTPUT (MIMO) AUDIO SIGNAL PROCESSING FOR SPEECH DE-REVERBERATION

    公开(公告)号:WO2018119467A1

    公开(公告)日:2018-06-28

    申请号:PCT/US2017/068358

    申请日:2017-12-22

    Abstract: Audio signal processing for adaptive de-reverberation uses a least mean squares (LMS) filter that has improved convergence over conventional LMS filters, making embodiments practical for reducing the effects of reverberation for use in many portable and embedded devices, such as smartphones, tablets, laptops, and hearing aids, for applications such as speech recognition and audio communication in general. The LMS filter employs a frequency-dependent adaptive step size to speed up the convergence of the predictive filter process, requiring fewer computational steps compared to a conventional LMS filter applied to the same inputs. The improved convergence is achieved at low memory consumption cost. Controlling the updates of the prediction filter in a high non-stationary condition of the acoustic channel improves the performance under such conditions. The techniques are suitable for single or multiple channels and are applicable to microphone array processing.

    TECHNIQUES FOR EMPIRICAL MODE DECOMPOSITION (EMD)-BASED SIGNAL DE-NOISING USING STATISTICAL PROPERTIES OF INTRINSIC MODE FUNCTIONS (IMFS)
    2.
    发明申请
    TECHNIQUES FOR EMPIRICAL MODE DECOMPOSITION (EMD)-BASED SIGNAL DE-NOISING USING STATISTICAL PROPERTIES OF INTRINSIC MODE FUNCTIONS (IMFS) 审中-公开
    利用本征模函数(IMFS)的统计特性实现经验模式分解(EMD)信号去噪的技术

    公开(公告)号:WO2017205382A1

    公开(公告)日:2017-11-30

    申请号:PCT/US2017/034017

    申请日:2017-05-23

    Abstract: Techniques for EMD-based signal de-noising are disclosed that use statistical characteristics of IMFs to identify information-carrying IMFs for the purposes of partially reconstructing the identified relevant IMFs into a de-noised signal. The present disclosure has identified that the statistical characteristics of IMFs with noise tend to follow a generalized Gaussian distribution (GGD) versus only a Gaussian or Laplace distribution. Accordingly, a framework for relevant IMF selection is disclosed that includes, in part, performing a null hypothesis test against a distribution of each IMF derived from the use of a generalized probability density function (PDF). IMFs that contribute more noise than signal may thus be identified through the null hypothesis test. Conversely, the aspects and embodiments disclosed herein enable the determination of which IMFs have a contribution of more signal than noise. Thus, a signal may be partially reconstructed based on the predominately information-carrying IMFs to result in de-noised output signal.

    Abstract translation: 公开了用于基于EMD的信号去噪的技术,其使用IMF的统计特性来识别信息承载的IMF,用于将所识别的相关IMF部分重构为去噪信号。 本公开已经识别出具有噪声的IMF的统计特性倾向于遵循广义高斯分布(GGD)而不是高斯或拉普拉斯分布。 因此,公开了用于相关IMF选择的框架,其部分地包括对由使用广义概率密度函数(PDF)导出的每个IMF的分布执行零假设测试。 因此可以通过零假设检验来识别比信号贡献更多噪声的IMF。 相反,本文公开的方面和实施例使得能够确定哪些IMF具有比噪声更多的信号的贡献。 因此,可以基于主要携带信息的IMF来部分重构信号以导致降噪输出信号。

    信号処理装置、信号処理方法、および信号処理プログラム
    3.
    发明申请
    信号処理装置、信号処理方法、および信号処理プログラム 审中-公开
    信号处理装置,信号处理方法和信号处理程序

    公开(公告)号:WO2017002525A1

    公开(公告)日:2017-01-05

    申请号:PCT/JP2016/066481

    申请日:2016-06-02

    Abstract: 拡散性の妨害音を精度よく推定するため、目的音と妨害音とが混在する環境において収集された音を処理する信号処理装置であって、目的音と妨害音とが混在する環境において入力した第1入力音に基づいて生成された第1入力信号と、前記環境において入力した第2入力音に基づいて生成された第2入力信号と、の位相差を出力する位相差出力手段と、位相差と第1入力信号とに基づいて、推定妨害音信号を生成する生成手段と、を備えたことを特徴とする。

    Abstract translation: 为了准确估计可扩散干扰声音,该信号处理装置处理在目标声音与可扩散干扰声音混合的环境中收集的声音,其特征在于包括相位差输出装置,其输出第一输入 基于从目标声音与干扰声音混合的环境输入的第一输入声音产生的信号,以及基于从环境输入的第二输入声音而生成的第二输入信号,以及生成装置, 基于相位差和第一输入信号的估计干扰声信号。

    AUDIO SOURCE SEPARATION WITH SOURCE DIRECTION DETERMINATION BASED ON ITERATIVE WEIGHTING
    4.
    发明申请
    AUDIO SOURCE SEPARATION WITH SOURCE DIRECTION DETERMINATION BASED ON ITERATIVE WEIGHTING 审中-公开
    基于迭代加权的源路确定的音频源分离

    公开(公告)号:WO2016183367A1

    公开(公告)日:2016-11-17

    申请号:PCT/US2016/032189

    申请日:2016-05-12

    Abstract: Example embodiments disclosed herein relate to audio source separation with source direction determined based on iterative weighted component analysis. A method of separating audio sources in audio content is disclosed. The audio content includes a plurality of channels. The method includes obtaining multiple data samples from multiple time-frequency tiles of the audio content. The method also includes analyzing the data samples to generate multiple components in a plurality of iterations, wherein each of the components indicates a direction with a variance of the data samples, and wherein in each of the plurality of iterations, each of the data samples is weighted with a weight that is determined based on a selected component from the multiple components. The method further includes determining a source direction of the audio content based on the selected component for separating an audio source from the audio content. Corresponding system and computer program product of separating audio sources in audio content are also disclosed.

    Abstract translation: 本文公开的示例实施例涉及基于迭代加权分量分析确定的源方向的音源分离。 公开了一种分离音频内容中的音频源的方法。 音频内容包括多个频道。 该方法包括从音频内容的多个时频瓦片中获取多个数据样本。 该方法还包括分析数据样本以在多个迭代中产生多个分量,其中每个分量指示具有数据样本的方差的方向,并且其中在多个迭代中的每一个中,每个数据样本是 以基于来自多个分量的所选择的分量确定的权重加权。 该方法还包括基于所选择的组件来确定音频内容的源方向,用于将音频源与音频内容分开。 还公开了音频内容中分离音频源的相应系统和计算机程序产品。

    AUDIO SIGNAL LEVEL ESTIMATION IN CAMERAS
    5.
    发明申请
    AUDIO SIGNAL LEVEL ESTIMATION IN CAMERAS 审中-公开
    摄像机中的音频信号电平估计

    公开(公告)号:WO2016081039A1

    公开(公告)日:2016-05-26

    申请号:PCT/US2015/047269

    申请日:2015-08-27

    Applicant: GOPRO, INC.

    Abstract: A camera system includes a first microphone, a second microphone, and a microphone controller. The first microphone and the second microphone are configured to capture audio over a time interval to produce a first captured audio signal and a second captured audio signal, respectively. The second captured audio signal is dampened relative to the first captured audio signal by a dampening factor. The microphone controller is configured to store the first captured audio signal in response to a determination that the first captured audio signal does not clip. In response to a determination that the first captured audio signal clips, the microphone controller is configured to identify a gain between the first captured audio signal and the second captured audio signal representative of the dampening factor, amplify the second captured audio signal based on the identified gain, and store the amplified second captured audio signal.

    Abstract translation: 相机系统包括第一麦克风,第二麦克风和麦克风控制器。 第一麦克风和第二麦克风被配置为在一段时间间隔内捕获音频以分别产生第一捕获音频信号和第二捕获音频信号。 第二捕获的音频信号相对于第一捕获的音频信号被阻尼因子衰减。 麦克风控制器被配置为响应于确定第一捕获的音频信号不被剪切而存储第一捕获的音频信号。 响应于确定第一捕获的音频信号剪辑,麦克风控制器被配置为识别代表衰减因子的第一捕获音频信号和第二捕获音频信号之间的增益,基于所识别的音频信号放大第二捕获音频信号 增益并存储放大的第二捕获音频信号。

    BINAURALLY INTEGRATED CROSS-CORRELATION AUTO-CORRELATION MECHANISM
    6.
    发明申请
    BINAURALLY INTEGRATED CROSS-CORRELATION AUTO-CORRELATION MECHANISM 审中-公开
    双向综合交叉相关自动关联机制

    公开(公告)号:WO2016025812A1

    公开(公告)日:2016-02-18

    申请号:PCT/US2015/045239

    申请日:2015-08-14

    Inventor: BRAASCH, Jonas

    Abstract: A sound processing system, method and program product for estimating parameters from binaural audio data. A system is provided having: a system for inputting binaural audio; and a binaural signal analyzer (BICAM) that: performs autocorrelation on both the first channel and second channel to generate a pair of autocorrelation functions; performs a first layer cross-correlation between the first channel and second channel to generate a first layer cross-correlation function; removes the center peak from the first layer cross-correlation function and a selected autocorrelation function to create a modified pair; performs a second layer cross-correlation between the modified pair to determine a temporal mismatch; generates a resulting function by replacing the first layer cross correlation function with the selected autocorrelation function using the temporal mismatch; and utilizes the resulting function to determine ITD parameters and interaural level difference ILD parameters of the direct sound components and reflected sound components.

    Abstract translation: 一种用于从双耳音频数据估计参数的声音处理系统,方法和程序产品。 提供一种具有:用于输入双耳音频的系统的系统; 以及双向信号分析器(BICAM),其在第一通道和第二通道两者上执行自相关以产生一对自相关函数; 在第一信道和第二信道之间执行第一层互相关,以产生第一层互相关函数; 从第一层互相关函数和选择的自相关函数中去除中心峰以创建修改的对; 在所述修改的对之间执行第二层互相关以确定时间不匹配; 通过使用时间不匹配将所选择的自相关函数替换为第一层互相关函数来产生结果函数; 并利用所得到的函数来确定直接声音分量和反射声音分量的ITD参数和眶内水平差ILD参数。

    信号処理装置、信号処理方法、および信号処理プログラム
    7.
    发明申请
    信号処理装置、信号処理方法、および信号処理プログラム 审中-公开
    信号处理装置,信号处理方法和信号处理程序

    公开(公告)号:WO2015141103A1

    公开(公告)日:2015-09-24

    申请号:PCT/JP2014/084617

    申请日:2014-12-26

    Abstract:  所望の信号成分を除去せずに、雑音成分だけを除去するため、少なくとも2つのチャンネルから、所望信号と雑音信号が混在する少なくとも2つの入力信号を入力し、少なくとも2つの入力信号の間で相関を有する雑音信号を除去する相関雑音除去手段と、相関雑音除去手段の出力信号と少なくとも2つの入力信号に含まれる少なくとも1つの入力信号との位相差に基づいて、相関雑音除去手段の出力信号に含まれる残留雑音を除去する残留雑音除去手段と、を備えた信号処理装置。

    Abstract translation: 为了仅去除噪声分量而不去除期望的信号分量,该信号处理装置包括:用于去除相关噪声的装置,其从至少两个信道接收具有期望信号和混合在其中的噪声信号的至少两个输入信号, 并且去除在所述至少两个输入信号之间相关的噪声信号; 以及用于去除残留噪声的装置,其基于从用于消除相关噪声的装置输出的信号与包括在相关噪声中的至少一个输入信号之间的相位差,去除从用于消除相关噪声的装置输出的信号中包括的残留噪声 至少两个输入信号。

    信号処理装置、方法及びプログラム
    8.
    发明申请
    信号処理装置、方法及びプログラム 审中-公开
    信号处理设备,方法和程序

    公开(公告)号:WO2015129760A1

    公开(公告)日:2015-09-03

    申请号:PCT/JP2015/055442

    申请日:2015-02-25

    Abstract:  雑音抑制性能を従来よりも向上させた信号処理技術を提供することを目的とする。第一成分抽出部14は、ターゲットエリアのパワースペクトル密度^φ S (ω,τ)から、ターゲットエリアから到来する音に由来する非定常成分^φ S (A) (ω,τ)及びインコヒーレントな雑音に由来する定常成分^φ S (B) (ω,τ)を時間平均処理により抽出する。第二成分抽出部15は、雑音エリアのパワースペクトル密度^φ N (ω,τ)から、干渉雑音に由来する非定常成分^φ N (A) (ω,τ)及びインコヒーレントな雑音に由来する定常成分^φ N (B) (ω,τ)を抽出する。

    Abstract translation: 本发明的目的是提供一种信号处理技术,其相对于现有的这种技术显示出改善的降噪性能。 第一分量提取单元(14)使用时间平均处理从目标区域从功率谱密度(φφS(ω,τ))中提取非平稳分量(φφ(A)(ω, τ))和与非相干噪声相关联的静止分量(^φS(B)(ω,τ))。 第二分量提取单元(15)从噪声区域的功率谱密度(^φN(ω,τ))中提取与干扰噪声相关联的非平稳分量(^φN(A)(ω,τ)) 以及与非相干噪声相关联的固定分量(^φN(B)(ω,τ))。

    SYSTEM FOR AUDIO ANALYSIS AND PERCEPTION ENHANCEMENT
    9.
    发明申请
    SYSTEM FOR AUDIO ANALYSIS AND PERCEPTION ENHANCEMENT 审中-公开
    音频分析和听觉增强系统

    公开(公告)号:WO2015122785A1

    公开(公告)日:2015-08-20

    申请号:PCT/NZ2015/050014

    申请日:2015-02-13

    Abstract: An audio perception system is described, comprising a capture module configured to capture acoustic speech signal information; a feature extraction module configured to extract features that identify a candidate unvoiced portion in an acoustic signal; a classification module configured to identify if the acoustic signal is or contains an unvoiced portion based on the extracted features; and a control module configured to generate a control signal to a sensory stimulation actuator for generating an aero- tactile stimulation to be delivered to a listener, the control signal based at least in part on a signal representing the identified unvoiced portion. Related methods are also described.

    Abstract translation: 描述了一种音频感知系统,包括被配置为捕获声学语音信号信息的捕获模块; 特征提取模块,被配置为提取识别声信号中的候选清音部分的特征; 分类模块,被配置为基于所提取的特征来识别声信号是否包含无声部分; 以及控制模块,被配置为产生到感觉刺激致动器的控制信号,用于产生要传送到收听者的航空触觉刺激,所述控制信号至少部分地基于表示所识别的无声部分的信号。 还描述了相关方法。

    LONG TERM MONITORING OF TRANSMISSION AND VOICE ACTIVITY PATTERNS FOR REGULATING GAIN CONTROL
    10.
    发明申请
    LONG TERM MONITORING OF TRANSMISSION AND VOICE ACTIVITY PATTERNS FOR REGULATING GAIN CONTROL 审中-公开
    用于调节增益控制的传输和语音活动模式的长期监测

    公开(公告)号:WO2014043024A1

    公开(公告)日:2014-03-20

    申请号:PCT/US2013/058735

    申请日:2013-09-09

    Abstract: The present document relates to audio communication systems. In particular, the present document relates to the control of the level of audio signals within audio communication systems. A method for leveling a near-end audio signal (211) using a leveling gain (214) is described. The near-end audio signal (211) comprises a sequence of segments, wherein the sequence of segments comprises a current segment and one or more preceding segments. The method comprises determining a nuisance measure (416) which is indicative of an amount of aberrant voice activity within the sequence of segments of the near-end audio signal (211); and determining the leveling gain (214) for the current segment of the near-end audio signal (211), at least based on the leveling gain (214) for the one or more preceding segments of the near-end audio signal (211), and by taking into account - according to a variable degree - an estimate of the level of the current segment of the near-end audio signal (211); wherein the variable degree is dependent on the nuisance measure (416).

    Abstract translation: 本文件涉及音频通信系统。 特别地,本文件涉及音频通信系统中的音频信号的级别的控制。 描述了使用调平增益(214)来调平近端音频信号(211)的方法。 近端音频信号(211)包括段序列,其中片段序列包括当前片段和一个或多个先前片段。 该方法包括确定指示近端音频信号(211)的段的序列内的异常语音活动量的扰动度量(416); 以及至少基于近端音频信号(211)的一个或多个先前段的调平增益(214)确定近端音频信号(211)的当前段的调平增益(214) 并且根据可变程度考虑近端音频信号(211)的当前段的电平的估计; 其中所述可变度取决于所述妨扰措施(416)。

Patent Agency Ranking