Invention Grant
- Patent Title: Noise reduction and audio-visual speech activity detection
- Patent Title (中): 降噪和视听语音活动检测
-
Application No.: US10542869Application Date: 2004-01-09
-
Publication No.: US07684982B2Publication Date: 2010-03-23
- Inventor: Morio Taneda
- Applicant: Morio Taneda
- Applicant Address: SE Lund
- Assignee: Sony Ericsson Communications AB
- Current Assignee: Sony Ericsson Communications AB
- Current Assignee Address: SE Lund
- Agency: Myers Bigel Sibley & Sajovec
- Priority: EP03001637 20030124; EP03022561 20031002
- International Application: PCT/EP2004/000104 WO 20040109
- International Announcement: WO2004/066273 WO 20040805
- Main IPC: G10L21/02
- IPC: G10L21/02
Abstract:
A noise reduction system including an audio-visual user interface combines visual features extracted from a digital video sequence with audio features extracted from an analog audio sequence. The digital video sequence may show the face of a speaker, and the analog audio sequence may include background noise in an environment of said speaker. Audio sequence detection means are used to detect said analog audio sequence, and audio feature extraction and analysis means are used to analyze said analog audio sequence and extract said audio features therefrom. Video sequence detection means are used to detect said video sequence, and visual feature extraction and analysis means are used to analyze the detected video sequence and extract said visual features therefrom. A noise reduction circuit is configured to separate the speaker's voice from said background noise based on a combination of derived speech characteristics and output a speech activity indication signal. The speech activity indication signal includes a combination of speech activity estimates supplied by said audio feature extraction and analysis means and said visual feature extraction and analysis means. A multi-channel acoustic echo cancellation unit is configured to perform a near-end speaker detection and double-talk detection algorithm based on the speech characteristics derived by said audio feature extraction and analysis means and said visual feature extraction and analysis means.
Public/Granted literature
- US20060224382A1 Noise reduction and audio-visual speech activity detection Public/Granted day:2006-10-05
Information query