Invention Grant
US09165182B2 Method and apparatus for using face detection information to improve speaker segmentation 有权
用于使用面部检测信息来改善说话者分割的方法和装置

Method and apparatus for using face detection information to improve speaker segmentation
Abstract:
In one embodiment, a method includes obtaining media that includes a video stream and an audio stream. The method also includes detecting a number of faces visible in the video stream, and performing a speaker segmentation on the media. Performing the speaker segmentation on the media includes utilizing the number of faces visible in the video stream to augment the speaker segmentation.
Information query
Patent Agency Ranking
0/0