Invention Grant
US07343289B2 System and method for audio/video speaker detection 有权
用于音频/视频扬声器检测的系统和方法

System and method for audio/video speaker detection
Abstract:
A system and method for detecting speech utilizing audio and video inputs. In one aspect, the invention collects audio data generated from a microphone device. In another aspect, the invention collects video data and processes the data to determine a mouth location for a given speaker. The audio and video are inputted into a time-delay neural network that processes the data to determine which target is speaking. The neural network processing is based upon a correlation to detected mouth movement from the video data and audio sounds detected by the microphone.
Public/Granted literature
Information query
Patent Agency Ranking
0/0