Invention Grant
- Patent Title: Multichannel voice detection in adverse environments
- Patent Title (中): 不利环境中的多声道语音检测
-
Application No.: US10231613Application Date: 2002-08-30
-
Publication No.: US07146315B2Publication Date: 2006-12-05
- Inventor: Radu Victor Balan , Justinian Rosca , Christophe Beaugeant
- Applicant: Radu Victor Balan , Justinian Rosca , Christophe Beaugeant
- Applicant Address: US NJ Princeton
- Assignee: Siemens Corporate Research, Inc.
- Current Assignee: Siemens Corporate Research, Inc.
- Current Assignee Address: US NJ Princeton
- Agency: F. Chau & Associates, LLC.
- Agent Donald B. Paschburg
- Main IPC: G10L15/20
- IPC: G10L15/20

Abstract:
A multichannel source activity detection system, e.g., a voice activity detection (VAD) system, and method that exploits spatial localization of a target audio source is provided. The method includes the steps of receiving a mixed sound signal by at least two microphones; Fast Fourier transforming each received mixed sound signal into the frequency domain; filtering the transformed signals to output a signal corresponding to a spatial signature of a source; summing an absolute value squared of the filtered signal over a predetermined range of frequencies; and comparing the sum to a threshold to determine if a voice is present. Additionally, the filtering step includes multiplying the transformed signals by an inverse of a noise spectral power matrix, a vector of channel transfer function ratios, and a source signal spectral power.
Public/Granted literature
- US20040042626A1 Multichannel voice detection in adverse environments Public/Granted day:2004-03-04
Information query