Adaptive interchannel discriminative rescaling filter

    公开(公告)号:US10013997B2

    公开(公告)日:2018-07-03

    申请号:US14938816

    申请日:2015-11-11

    CPC classification number: G10L21/0232 G10L21/0208 G10L25/84 G10L2021/02165

    Abstract: A method for adjusting a degree of filtering applied to an audio signal includes modeling a probability density function (PDF) of a fast Fourier transform (FFT) coefficient of a primary channel and reference channel of the audio signal; maximizing at least one of PDFs to provide a discriminative relevance difference (DRD) between a noise magnitude estimate of the reference channel and a noise magnitude estimate of the primary channel. The method further includes emphasizing the primary channel when the spectral magnitude of the primary channel is stronger than the spectral magnitude of the reference channel; and deemphasizing the primary channel when the spectral magnitude of the reference channel is stronger than the spectral magnitude of the primary channel. The emphasizing and deemphasizing includes computing a multiplicative rescaling factor and applying the multiplicative rescaling factor to a gain computed in a prior stage of a speech enhancement filter chain when there is a prior stage, and directly applying a gain when there is no prior stage.

    Multi-aural MMSE analysis techniques for clarifying audio signals

    公开(公告)号:US10149047B2

    公开(公告)日:2018-12-04

    申请号:US14308541

    申请日:2014-06-18

    Abstract: Techniques for processing audio signals include removing noise from the audio signals or otherwise clarifying the audio signals prior to outputting the audio signals. The disclosed techniques may employ minimum mean squared error (MMSE) analyses on audio signals received from a primary microphone and at least one reference microphone, and to techniques in which the MMSE analyses are used to reduce or eliminate noise from audio signals received by the primary microphone. Optionally, confidence intervals may be assigned to different frequency bands of an audio signal, with each confidence interval corresponding to a likelihood that its respective frequency band includes targeted audio, and each confidence interval representing a contribution of its respective frequency band in a reconstructed audio signal from which noise has been removed.

Patent Agency Ranking