摘要:
A system and method are disclosed for noise level/spectrum estimation and speech activity detection. Some embodiments include a probabilistic model to estimate noise level and subsequently detect the presence of speech. These embodiments outperform standard voice activity detectors (VADs), producing improved detection in a variety of noisy environments.
摘要:
A method for echo cancellation and noise suppression is disclosed. Linear echo cancellation (LEC) is performed for a primary microphone channel on an entire frequency band or in a range of frequencies where echo is audible. LEC is performed on one or more secondary microphone channels only on a lower frequency range over which spatial processing is effective. The microphone channels are spatially processed over the lower frequency range after LEC. Non-linear noise suppression post-processing is performed on the entire frequency band. Echo post-processing is performed on the entire frequency band.
摘要:
A method for echo cancellation and noise suppression is disclosed. Linear echo cancellation (LEC) is performed for a primary microphone channel on an entire frequency band or in a range of frequencies where echo is audible. LEC is performed on one or more secondary microphone channels only on a lower frequency range over which spatial processing is effective. The microphone channels are spatially processed over the lower frequency range after LEC. Non-linear noise suppression post-processing is performed on the entire frequency band. Echo post-processing is performed on the entire frequency band.
摘要:
A system and method are disclosed for noise level/spectrum estimation and speech activity detection. Some embodiments include a probabilistic model to estimate noise level and subsequently detect the presence of speech. These embodiments outperform standard voice activity detectors (VADs), producing improved detection in a variety of noisy environments.
摘要:
Sound signal reception is improved by utilizing a plurality of microphones to capture sound signals which are then weighed to dynamically adjust signal quality. A first sound signal and a second sound signal are obtained from first and second microphones, respectively, where the first and second sound signals originate from one or more sound sources. A first signal characteristic (e.g., signal power, signal signal-to-noise ratio, etc.) is obtained for the first sound signal and a second signal characteristic is obtained for the second sound signal. The first and second sound signals are weighed or scaled based on their respective first and second signal characteristics. The weighed first and second sound signals are then combined to obtain an output sound signal.
摘要:
An enhanced blind source separation technique is provided to improve separation of highly correlated signal mixtures. A beamforming algorithm is used to precondition correlated first and second input signals in order to avoid indeterminacy problems typically associated with blind source separation. The beamforming algorithm may apply spatial filters to the first signal and second signal in order to amplify signals from a first direction while attenuating signals from other directions. Such directionality may serve to amplify a desired speech signal in the first signal and attenuate the desired speech signal from the second signal. Blind source separation is then performed on the beamformer output signals to separate the desired speech signal and the ambient noise and reconstruct an estimate of the desired speech signal. To enhance the operation of the beamformer and/or blind source separation, calibration may be performed at one or more stages.
摘要:
A method for audio signal processing is described. The method includes decomposing a recorded auditory scene into a first category of localizable sources and a second category of ambient sound. The method also includes recording an indication of the directions of each of the localizable sources. The method may be performed with a device having a microphone array.
摘要:
An electronic device for suppressing noise in an audio signal is described. The electronic device includes a processor and instructions stored in memory. The electronic device receives an input audio signal and computes an overall noise estimate based on a stationary noise estimate, a non-stationary noise estimate and an excess noise estimate. The electronic device also computes an adaptive factor based on an input Signal-to-Noise Ratio (SNR) and one or more SNR limits. A set of gains is also computed using a spectral expansion gain function. The spectral expansion gain function is based on the overall noise estimate and the adaptive factor. The electronic device also applies the set of gains to the input audio signal to produce a noise-suppressed audio signal and provides the noise-suppressed audio signal.
摘要:
Signal processing solutions take advantage of microphones located on different devices and improve the quality of transmitted voice signals in a communication system. With usage of various devices such as Bluetooth headsets, wired headsets and the like in conjunction with mobile handsets, multiple microphones located on different devices are exploited for improving performance and/or voice quality in a communication system. Audio signals are recorded by microphones on different devices and processed to produce various benefits, such as improved voice quality, background noise reduction, voice activity detection and the like.
摘要:
In a digital system with more than one clock source, lack of synchronization between the clock sources may cause overflow or underflow in sample buffers, also called sample slipping. Sample slipping may lead to undesirable artifacts in the processed signal due to discontinuities introduced by the addition or removal of extra samples. To smooth out discontinuities caused by sample slipping, samples are filtered to when a buffer overflow condition occurs, and the samples are interpolated to produce additional samples when a buffer underflow condition occurs. The interpolated samples may also be filtered. The filtering and interpolation operations can be readily implemented without adding significant burden to the computational complexity of a real-time digital system.