摘要:
Speech enhancement is provided in dual microphone noise reduction systems by including spectral subtraction algorithms using linear convolution, causal filtering and/or spectrum dependent exponential averaging of the spectral subtraction gain function. According to exemplary embodiments, when a far-mouth microphone is used in conjunction with a near-mouth microphone, it is possible to handle non-stationary background noise as long as the noise spectrum can continuously be estimated from a single block of input samples. The far-mouth microphone, in addition to picking up the background noise, also picks up the speaker's voice, albeit at a lower level than the near-mouth microphone. To enhance the noise estimate, a spectral subtraction stage is used to suppress the speech in the far-mouth microphone signal. To be able to enhance the noise estimate, a rough speech estimate is formed with another spectral subtraction stage from the near-mouth signal. Finally, a third spectral subtraction function is used to enhance the near-mouth signal by suppressing the background noise using the enhanced background noise estimate. A controller dynamically determines any or all of a first, second, and third subtraction factor for each of the first, second, and third spectral subtraction stages, respectively.
摘要:
Methods and apparatus for providing speech enhancement in noise reduction systems include spectral subtraction algorithms using linear convolution, causal filtering and/or spectrum dependent exponential averaging of the spectral subtraction gain function. According to exemplary embodiments, low order spectrum estimates are developed which have less frequency resolution and reduced variance as compared to spectrum estimates in conventional spectral subtraction systems. The low order spectra are used to form a gain function having a desired low variance which in turn reduces musical tones in the spectral subtraction output signal. Advantageously, the gain function can be further smoothed across blocks using input spectrum dependent exponential averaging. Additionally, the low order of the gain function permits a phase to be added during interpolation so that the spectral subtraction gain filter is causal and prevents discontinuities between blocks.
摘要:
For purposes of noise suppression, spectral subtraction filtering is performed in sample-wise fashion in the time domain using a time-domain representation of a spectral subtraction gain function computed in block-wise fashion in the frequency domain. By continuously performing time-domain filtering on a sample by sample basis, the disclosed methods and apparatus avoid block-processing delays associated with frequency-domain based spectral subtraction systems. Consequently, the disclosed methods and apparatus are particularly well suited for applications requiring very short processing delays. Moreover, since the spectral subtraction gain function is computed in a block-wise fashion in the frequency domain, high quality performance in terms of reduced tonal artifacts and low signal distortion is retained.
摘要:
Speech enhancement is provided in dual microphone noise reduction systems by including spectral subtraction algorithms using linear convolution, causal filtering and/or spectrum dependent exponential averaging of the spectral subtraction gain function. According to exemplary embodiments, when a far-mouth microphone is used in conjunction with a near-mouth microphone, it is possible to handle non-stationary background noise as long as the noise spectrum can continuously be estimated from a single block of input samples. The far-mouth microphone, in addition to picking up the background noise, also picks up the speaker's voice, albeit at a lower level than the near-mouth microphone. To enhance the noise estimate, a spectral subtraction stage is used to suppress the speech in the far-mouth microphone signal. To be able to enhance the noise estimate, a rough speech estimate is formed with another spectral subtraction stage from the near-mouth signal. Finally, a third spectral subtraction function is used to enhance the near-mouth signal by suppressing the background noise using the enhanced background noise estimate.
摘要:
For purposes of noise suppression, spectral subtraction filtering is performed in sample-wise fashion in the time domain using a time-domain representation of a spectral subtraction gain function computed in block-wise fashion in the frequency domain. By continuously performing time-domain filtering on a sample by sample basis, the disclosed methods and apparatus avoid block-processing delays associated with frequency-domain based spectral subtraction systems. Consequently, the disclosed methods and apparatus are particularly well suited for applications requiring very short processing delays. In applications where only stationary, low-energy background noise is present, computational complexity is reduced by generating a number of separate spectral subtraction gain functions during an initialization period, each gain function being suitable for one of several predefined classes of input signal (e.g., for one of several predetermined signal energy ranges), and thereafter fixing the several gain functions until the input signal characteristics change.
摘要:
Methods and apparatus for providing speech enhancement in noise reduction systems include spectral subtraction algorithms using linear convolution, causal filtering and/or spectrum dependent exponential averaging of the spectral subtraction gain function. According to exemplary embodiments, successive blocks of a spectral subtraction gain function are averaged based on a discrepancy between an estimate of a spectral density of a noisy speech signal and an averaged estimate of a spectral density of a noise component of the noisy speech signal. The successive gain function blocks are averaged, for example, using controlled exponential averaging. Control is provided, for example, by making a memory of the exponential averaging inversely proportional to the discrepancy. Alternatively, the averaging memory can be made to increase in direct proportion with decreases in the discrepancy, while exponentially decaying with increases in the discrepancy to prevent audible voice shadows.
摘要:
A beamformer is calibrated for use as an acoustic echo canceler in a hands-free communications environment having a loudspeaker and a plurality of microphones. To perform the calibration, a number of adaptive filters are provided in correspondence with each of the microphones, and each of the adaptive filters is trained to model echo properties of the environment as experienced by the corresponding one of the microphones. A target source is activated, thereby generating an acoustic signal that is received by the microphones. The trained adaptive filters are then used to generate jammer signals by, for example, having each one filter a pseudo noise signal. Respective ones of the jammer signals are then combined with corresponding signals supplied by the microphones, thereby generating combination signals. The combination signals are then used to adapt the beamformer to cancel the jammer signals. In another aspect of the invention, the adaptive filters may be utilized during normal operation by having them perform an echo cancellation operation on each of the signals that is to be supplied to the calibrated beamformer.
摘要:
An apparatus for sound enhancement has at least two microphones (9) that provide a directional microphone array which is arranged to be pointed in the direction of a sound source. The directional microphone array thereby receives sound emitted by the sound source and generates sound signals. A processor (20) processors the sound signals generated by the microphone array to enhance the sound received by the directional microphone array from the sound source relative to other sound received by the directional microphone array. The processor (20) generates a corresponding enhanced signal (ES). Loud speakers (22) reproduce the enhanced signal as audible sound. Furthermore, sound suppression devices (7, 7a) are provided to suppress ambient should from reaching the eardrums of the user. This sound suppression acts in conjunction with the directional microphone array and the processor (20) which enhance the SOI to provide a listening environment in which the SOI is enhanced.