摘要:
The present invention provides a process for separating a good quality information signal from a noisy acoustic environment. The separation process uses a set of a least two spaced-apart transducers to capture noise and information components. The transducer signals, which have both a noise and information component, are received into a separation process. The separation process generates one channel that is substantially only noise, and another channel that is a combination of noise and information. An identification process is used to identify which channel has the information component. The noise signal is then used to set process characteristics that are applied to the combination signal to efficiently reduce or eliminate the noise component. In this way, the noise is effectively removed from the combination signal to generate a good qualify information signal. The information signal may be, for example, a speech signal, a seismic signal, a sonar signal, or other acoustic signal.
摘要:
Implementations and applications are disclosed for detection of a transition in a voice activity state of an audio signal, based on a change in energy that is consistent in time across a range of frequencies of the signal. For example, such detection may be based on a time derivative of energy for each of a number of different frequency components of the signal.
摘要:
A disclosed method selects a plurality of fewer than all of the channels of a multichannel signal, based on information relating to the direction of arrival of at least one frequency component of the multichannel signal.
摘要:
A mobile platform includes a microphone array and is capable of implementing beamforming to amplify or suppress audio information from a sound source. The sound source is indicated through a user input, such as pointing the mobile platform in the direction of the sound source or through a touch screen display interface. The mobile platform further includes orientation sensors capable of detecting movement of the mobile platform. When the mobile platform moves with respect to the sound source, the beamforming is adjusted based on the data from the orientation sensors so that beamforming is continuously implemented in the direction of the sound source. The audio information from the sound source may be included or suppressed from a telephone or video-telephony conversation. Images or video from a camera may be likewise controlled based on the data from the orientation sensors.
摘要:
A method for blind source separation based spatial filtering on an electronic device includes obtaining a first source audio signal and a second source audio signal. The method also includes applying a blind source separation filter set to the first source audio signal and to the second source audio signal to produce a spatially filtered first audio signal and a spatially filtered second audio signal. The method further includes playing the spatially filtered first audio signal over a first speaker to produce an acoustic spatially filtered first audio signal and playing the spatially filtered second audio signal over a second speaker to produce an acoustic spatially filtered second audio signal. The acoustic spatially filtered first audio signal and the acoustic spatially filtered second audio signal produce an isolated acoustic first source audio signal at a first position and an isolated acoustic second source audio signal at a second position.
摘要:
Systems, methods, apparatus, and machine-readable media for voice activity detection in a single-channel or multichannel audio signal are disclosed.
摘要:
A disclosed method selects a plurality of fewer than all of the channels of a multichannel signal, based on information relating to the direction of arrival of at least one frequency component of the multichannel signal.
摘要:
The present invention relates to blind source separation. More specifically it relates to the blind source separation using frequency domain processes.