Invention Grant
- Patent Title: System and method for audio/video speaker detection
- Patent Title (中): 用于音频/视频扬声器检测的系统和方法
-
Application No.: US10606061Application Date: 2003-06-25
-
Publication No.: US07343289B2Publication Date: 2008-03-11
- Inventor: Ross Cutler , Ashish Kapoor
- Applicant: Ross Cutler , Ashish Kapoor
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corp.
- Current Assignee: Microsoft Corp.
- Current Assignee Address: US WA Redmond
- Agency: Lyon & Harr, LLP
- Agent Katrina A. Lyon
- Main IPC: G10L13/00
- IPC: G10L13/00

Abstract:
A system and method for detecting speech utilizing audio and video inputs. In one aspect, the invention collects audio data generated from a microphone device. In another aspect, the invention collects video data and processes the data to determine a mouth location for a given speaker. The audio and video are inputted into a time-delay neural network that processes the data to determine which target is speaking. The neural network processing is based upon a correlation to detected mouth movement from the video data and audio sounds detected by the microphone.
Public/Granted literature
- US20040267521A1 System and method for audio/video speaker detection Public/Granted day:2004-12-30
Information query