Invention Application
- Patent Title: VOICE ACTIVITY DETECTION SYSTEMS AND METHODS
-
Application No.: PCT/US2018/063937Application Date: 2018-12-04
-
Publication No.: WO2019113130A1Publication Date: 2019-06-13
- Inventor: KASKARI, Saeed Mosayyebpour , NESTA, Francesco
- Applicant: SYNAPTICS INCORPORATED
- Applicant Address: 1251 McKay Drive San Jose, California 95131 US
- Assignee: SYNAPTICS INCORPORATED
- Current Assignee: SYNAPTICS INCORPORATED
- Current Assignee Address: 1251 McKay Drive San Jose, California 95131 US
- Agency: GALLAGHER, Dennis R.
- Priority: US15/832,709 20171205
- Main IPC: G10L25/78
- IPC: G10L25/78 ; G10L19/02 ; G10L25/18
Abstract:
An audio processing device or method includes an audio transducer operable to receive audio input and generate an audio signal based on the audio input. The audio processing device or method also includes an audio signal processor operable to extract local features from the audio signal, such as Power-Normalized Coefficients (PNCC) of the audio signal. The audio signal processor also is operable to extract global features from the audio signal, such as chroma features and harmonicity features. A neural network is provided to determine a probability that a target audio is present in the audio signal based on the local and global features. In particular, the neural network is trained to output a value indicating whether the target audio is present and locally dominant in the audio signal.
Information query