MULTI-MICROPHONE METHOD FOR ESTIMATION OF TARGET AND NOISE SPECTRAL VARIANCES FOR SPEECH DEGRADED BY REVERBERATION AND OPTIONALLY ADDITIVE NOISE
    6.
    发明申请
    MULTI-MICROPHONE METHOD FOR ESTIMATION OF TARGET AND NOISE SPECTRAL VARIANCES FOR SPEECH DEGRADED BY REVERBERATION AND OPTIONALLY ADDITIVE NOISE 有权
    用于估计通过反演和可选添加噪声降低语音的目标和噪声频谱变量的多麦克风方法

    公开(公告)号:US20150256956A1

    公开(公告)日:2015-09-10

    申请号:US14640664

    申请日:2015-03-06

    Applicant: Oticon A/S

    Abstract: The application relates to an audio processing system and a method of processing a noisy (e.g. reverberant) signal comprising first (v) and optionally second (w) noise signal components and a target signal component (x), the method comprising a) Providing or receiving a time-frequency representation Yi(k,m) of a noisy audio signal yi at an ith input unit, i=1, 2, . . . , M, where M≧2; b) Providing (e.g. predefined spatial) characteristics of said target signal component and said noise signal component(s); and c) Estimating spectral variances or scaled versions thereof λV, λX of said first noise signal component v (representing reverberation) and said target signal component x, respectively, said estimates of λV and λX being jointly optimal in maximum likelihood sense, based on the statistical assumptions that a) the time-frequency representations Yi(k,m), Xi(k,m), and Vi(k,m) (and Wi(k,m)) of respective signals yi(n), and signal components xi and vi (and wi) are zero-mean, complex-valued Gaussian distributed, b) that each of them are statistically independent across time m and frequency k, and c) that Xi(k,m) and Vi(k,m) (and Wi(k,m)) are uncorrelated. An advantage of the invention is that it provides the basis for an improved intelligibility of an input speech signal. The invention may e.g. be used for hearing assistance devices, e.g. hearing aids.

    Abstract translation: 该应用涉及音频处理系统和处理包括第一(v)和可选的第二(w)个噪声信号分量和目标信号分量(x)的噪声(例如混响)信号的方法,该方法包括:a)提供或 在第i个输入单元接收噪声音频信号yi的时间频率表示Yi(k,m),i = 1,2。 。 。 ,M,其中M≥2; b)提供所述目标信号分量和所述噪声信号分量的(例如预定义的空间)特性; 以及c)分别估计所述第一噪声信号分量v(表示混响)和所述目标信号分量x的λV,λX,λV和λX的估计在最大似然意义上是共同最优的,基于 统计假设a)各个信号yi(n)的时间频率表示Yi(k,m),Xi(k,m)和Vi(k,m)(和Wi(k,m))和信号 组分xi和vi(和wi)是零均值,复值高斯分布,b)它们中的每一个在时间m和频率k之间是统计学独立的,以及c)Xi(k,m)和Vi(k, m)(和Wi(k,m))是不相关的。 本发明的优点在于它提供了输入语音信号的改进的可懂度的基础。 本发明可以例如 用于听力辅助装置,例如 助听器。

Patent Agency Ranking