摘要:
The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.
摘要:
A system and method to identify a sound source among a group of sound sources. The invention matches the acoustic input to a number of signal models, one per source class, and produces a goodness-of-match number for each signal model. The sound source is declared to be of the same class as that of the signal model with the best goodness-of-match if that score is sufficiently high. The data are recorded with a microphone, digitized and transformed into the frequency domain. A signal detector is applied to the transient. A harmonic detection method can be used to determine if the sound source has harmonic characteristics. If at least some part of a transient contains signal of interest, the spectrum of the signal after resealing is compared to a set of signal models, and the input signal's parameters are fitted to the data. The average distortion is calculated to compare patterns with those of sources that used in training the signal models. Before classification can occur, a source model is trained with signal data. Each signal model is built by creating templates from input signal spectrograms when they are significantly different from existing templates. If an existing template is found that resembles the input pattern, the template is averaged with the pattern in such a way that the resulting template is the average of all the spectra that matched that template in the past.
摘要:
A speech signal isolation system configured to isolate and reconstruct a speech signal transmitted in an environment in which frequency components of the speech signal are masked by background noise. The speech signal isolation system obtains a noisy speech signal from an audio source. The noisy speech signal may then be fed through a neural network that has been trained to isolate and reconstruct a clean speech signal from against background noise. Once the noisy speech signal has been fed through the neural network, the speech signal isolation system generates an estimated speech signal with substantially reduced noise.
摘要:
The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.
摘要:
A speech signal isolation system configured to isolate and reconstruct a speech signal transmitted in an environment in which frequency components of the speech signal are masked by background noise. The speech signal isolation system obtains a noisy speech signal from an audio source. The noisy speech signal may then be fed through a neural network that has been trained to isolate and reconstruct a clean speech signal from against background noise. Once the noisy speech signal has been fed through the neural network, the speech signal isolation system generates an estimated speech signal with substantially reduced noise.
摘要:
A voice enhancement logic improves the perceptual quality of a processed voice. The voice enhancement system includes a noise detector and a noise attenuator. The noise detector detects a wind buffet and a continuous noise by modeling the wind buffet. The noise attenuator dampens the wind buffet to improve the intelligibility of an unvoiced, a fully voiced, or a mixed voice segment.
摘要:
A system and method to identify a sound source among a group of sound sources. The invention matches the acoustic input to a number of signal models, one per source class, and produces a goodness-of-match number for each signal model. The sound source is declared to be of the same class as that of the signal model with the best goodness-of-match if that score is sufficiently high. The data are recorded with a microphone, digitized and transformed into the frequency domain. A signal detector is applied to the transient. A harmonic detection method can be used to determine if the sound source has harmonic characteristics. If at least some part of a transient contains signal of interest, the spectrum of the signal after resealing is compared to a set of signal models, and the input signal's parameters are fitted to the data. The average distortion is calculated to compare patterns with those of sources that used in training the signal models. Before classification can occur, a source model is trained with signal data. Each signal model is built by creating templates from input signal spectrograms when they are significantly different from existing templates. If an existing template is found that resembles the input pattern, the template is averaged with the pattern in such a way that the resulting template is the average of all the spectra that matched that template in the past.