-
公开(公告)号:US20170133041A1
公开(公告)日:2017-05-11
申请号:US15321743
申请日:2015-07-07
Applicant: Analog Devices Global
Inventor: Mikael M. MORTENSEN , Kim Spetzler BERTHELSEN , Robert ADAMS , Cyrill A. MARTIN , Andrew MILIA , Eric G. NESTLER
Abstract: Many processes for audio signal processing can benefit from voice activity detection, which aims to detect the presence of speech as opposed to silence or noise. The present disclosure describes, among other things, leveraging energy-based features of voice and insights on first and second formant frequencies of vowels to provide a low-complexity and low-power voice activity detector. A pair of two channels is provided whereby each channel is configured to detect voice activity in respective frequency bands of interest. Simultaneous activity detected in both channels can be a sufficient condition for determining that voice is present. More channels or pairs of channels can be used to detect different types of voices to improve detection and/or to detect voices present in different audio streams.
-
公开(公告)号:US20190355383A1
公开(公告)日:2019-11-21
申请号:US16515018
申请日:2019-07-17
Applicant: Analog Devices Global Unlimited Company
Inventor: Mikael MORTENSEN , Kim Spetzler BERTHELSEN , Robert Adams , Andrew MILIA
Abstract: Many processes for audio signal processing can benefit from voice activity detection, which aims to detect the presence of speech as opposed to silence or noise. The present disclosure describes, among other things, leveraging energy-based features of voice and insights on first and second formant frequencies of vowels to provide a low-complexity and low-power voice activity detector. A pair of two channels is provided whereby each channel is configured to detect voice activity in respective frequency bands of interest. Simultaneous activity detected in both channels can be a sufficient condition for determining that voice is present. More channels or pairs of channels can be used to detect different types of voices to improve detection and/or to detect voices present in different audio streams.
-