摘要:
Implementations and applications are disclosed for detection of a transition in a voice activity state of an audio signal, based on a change in energy that is consistent in time across a range of frequencies of the signal. For example, such detection may be based on a time derivative of energy for each of a number of different frequency components of the signal.
摘要:
Systems, methods, apparatus, and machine-readable media for voice activity detection in a single-channel or multichannel audio signal are disclosed.
摘要:
Systems, methods, apparatus, and machine-readable media for voice activity detection in a single-channel or multichannel audio signal are disclosed.
摘要:
Implementations and applications are disclosed for detection of a transition in a voice activity state of an audio signal, based on a change in energy that is consistent in time across a range of frequencies of the signal.
摘要:
Based on phase differences between corresponding frequency components of different channels of a multichannel signal, a measure of directional coherency is calculated. Application of such a measure to voice activity detection and noise reduction are also disclosed.
摘要:
A method for audio signal processing is described. The method includes decomposing a recorded auditory scene into a first category of localizable sources and a second category of ambient sound. The method also includes recording an indication of the directions of each of the localizable sources. The method may be performed with a device having a microphone array.
摘要:
Based on phase differences between corresponding frequency components of different channels of a multichannel signal, a measure of directional coherency is calculated. Application of such a measure to voice activity detection and noise reduction are also disclosed.
摘要:
A method of orientation-sensitive recording control includes indicating, within a portable device and at a first time, that the portable device has a first orientation relative to a gravitational axis and, based on the indication, selecting a first pair among at least three microphone channels of the portable device. This method also includes indicating, within the portable device and at a second time that is different than the first time, that the portable device has a second orientation relative to the gravitational axis that is different than the first orientation and, based on the indication, selecting a second pair among the at least three microphone channels that is different than the first pair. In this method, each the at least three microphone channels is based on a signal produced by a corresponding one of at least three microphones of the portable device.
摘要:
A multi-microphone system performs location-selective processing of an acoustic signal, wherein source location is indicated by directions of arrival relative to microphone pairs at opposite sides of a midsagittal plane of a user's head.
摘要:
A method for echo cancellation and noise suppression is disclosed. Linear echo cancellation (LEC) is performed for a primary microphone channel on an entire frequency band or in a range of frequencies where echo is audible. LEC is performed on one or more secondary microphone channels only on a lower frequency range over which spatial processing is effective. The microphone channels are spatially processed over the lower frequency range after LEC. Non-linear noise suppression post-processing is performed on the entire frequency band. Echo post-processing is performed on the entire frequency band.