SYSTEMS AND METHODS FOR DECOMPOSING A VIDEO STREAM INTO FACE STREAMS
摘要:
An audio/video stream may include an audio stream and a video stream. The video stream may be decomposed into a plurality of face streams. Each of the face streams may include a cropped version of the video stream and be focused on the face of one of the individuals captured in the video stream. Facial recognition may be used to associate each of the face streams with an identity of the individual captured in the respective face stream. Additionally, voice recognition may be used to recognize the identity of the active speaker in the audio stream. The face stream associated with an identity matching the active speaker's identity may be labeled as the face stream of the active speaker. In a “Room SplitView” mode, the face stream of the active speaker is rendered in a more prominent manner than the other face streams.
信息查询
0/0