摘要:
A method for improving the quality of a speech signal extracted from a noisy acoustic environment is provided. In one approach, a signal separation process is associated with a voice activity detector. The voice activity detector is a two-channel detector, which enables a particularly robust and accurate detection of voice activity. When speech is detected, the voice activity detector generates a control signal. The control signal is used to activate, adjust, or control signal separation processes or post-processing operations to improve the quality of the resulting speech signal. In another approach, a signal separation process is provided as a learning stage and an output stage. The learning stage aggressively adjusts to current acoustic conditions, and passes coefficients to the output stage. The output stage adapts more slowly, and generates a speech-content signal and a noise dominant signal. When the learning stage becomes unstable, only the learning stage is reset, allowing the output stage to continue outputting a high quality speech signal.
摘要:
A method for improving the quality of a speech signal extracted from a noisy acoustic environment is provided. In one approach, a signal separation process is associated with a voice activity detector. The voice activity detector is a two-channel detector, which enables a particularly robust and accurate detection of voice activity. When speech is detected, the voice activity detector generates a control signal. The control signal is used to activate, adjust, or control signal separation processes or post-processing operations to improve the quality of the resulting speech signal. In another approach, a signal separation process is provided as a learning stage and an output stage. The learning stage aggressively adjusts to current acoustic conditions, and passes coefficients to the output stage. The output stage adapts more slowly, and generates a speech-content signal and a noise dominant signal. When the learning stage becomes unstable, only the learning stage is reset, allowing the output stage to continue outputting a high quality speech signal.
摘要:
Methods, apparatus, and systems for source separation include a converged plurality of coefficient values that is based on each of a plurality of M-channel signals. Each of the plurality of M-channel signals is based on signals produced by M transducers in response to at least one information source and at least one interference source. In some examples, the converged plurality of coefficient values is used to filter an M-channel signal to produce an information output signal and an interference output signal.
摘要:
Methods and apparatus for generating an anti-noise signal and equalizing a reproduced audio signal (e.g., a far-end telephone signal) are described, wherein the generating and the equalizing are both based on information from an acoustic error signal.
摘要:
Methods and apparatus for generating an anti-noise signal and equalizing a reproduced audio signal (e.g., a far-end telephone signal) are described, wherein the generating and the equalizing are both based on information from an acoustic error signal.
摘要:
Methods, apparatus, and systems for source separation include a converged plurality of coefficient values that is based on each of a plurality of M-channel signals. Each of the plurality of M-channel signals is based on signals produced by M transducers in response to at least one information source and at least one interference source. In some examples, the converged plurality of coefficient values is used to filter an M-channel signal to produce an information output signal and an interference output signal.
摘要:
The system and method for spectral analysis uses a set of spectral data. The spectral data is arranged according to a second dimension, such as time, temperature, position, or other condition. The arranged spectral data is used in a signal separation process, such as an independent component analysis (ICA), which generates independent signals. The independent signals are then used for identifying or quantifying a target component.
摘要:
A method for echo cancellation and noise suppression is disclosed. Linear echo cancellation (LEC) is performed for a primary microphone channel on an entire frequency band or in a range of frequencies where echo is audible. LEC is performed on one or more secondary microphone channels only on a lower frequency range over which spatial processing is effective. The microphone channels are spatially processed over the lower frequency range after LEC. Non-linear noise suppression post-processing is performed on the entire frequency band. Echo post-processing is performed on the entire frequency band.
摘要:
A method for echo cancellation and noise suppression is disclosed. Linear echo cancellation (LEC) is performed for a primary microphone channel on an entire frequency band or in a range of frequencies where echo is audible. LEC is performed on one or more secondary microphone channels only on a lower frequency range over which spatial processing is effective. The microphone channels are spatially processed over the lower frequency range after LEC. Non-linear noise suppression post-processing is performed on the entire frequency band. Echo post-processing is performed on the entire frequency band.
摘要:
Systems, methods, and apparatus for spectral contrast enhancement of speech signals, based on information from a noise reference that is derived by a spatially selective processing filter from a multichannel sensed audio signal, are disclosed.