摘要:
For purposes of noise suppression, spectral subtraction filtering is performed in sample-wise fashion in the time domain using a time-domain representation of a spectral subtraction gain function computed in block-wise fashion in the frequency domain. By continuously performing time-domain filtering on a sample by sample basis, the disclosed methods and apparatus avoid block-processing delays associated with frequency-domain based spectral subtraction systems. Consequently, the disclosed methods and apparatus are particularly well suited for applications requiring very short processing delays. In applications where only stationary, low-energy background noise is present, computational complexity is reduced by generating a number of separate spectral subtraction gain functions during an initialization period, each gain function being suitable for one of several predefined classes of input signal (e.g., for one of several predetermined signal energy ranges), and thereafter fixing the several gain functions until the input signal characteristics change.
摘要:
Methods and apparatus for providing speech enhancement in noise reduction systems include spectral subtraction algorithms using linear convolution, causal filtering and/or spectrum dependent exponential averaging of the spectral subtraction gain function. According to exemplary embodiments, successive blocks of a spectral subtraction gain function are averaged based on a discrepancy between an estimate of a spectral density of a noisy speech signal and an averaged estimate of a spectral density of a noise component of the noisy speech signal. The successive gain function blocks are averaged, for example, using controlled exponential averaging. Control is provided, for example, by making a memory of the exponential averaging inversely proportional to the discrepancy. Alternatively, the averaging memory can be made to increase in direct proportion with decreases in the discrepancy, while exponentially decaying with increases in the discrepancy to prevent audible voice shadows.