-
公开(公告)号:US20250104318A1
公开(公告)日:2025-03-27
申请号:US18601097
申请日:2024-03-11
Applicant: Samsung Electronics Co., Ltd.
Inventor: Liang Zhao , Siva Penke , Christopher Peri , Byeonghee Yu , Jisun Park
Abstract: In one embodiment, a method includes accessing an audio input that includes a mixture of vocal sounds and non-vocal sounds and separating, by a trained audio source separation model, the audio input into a first audio output representing the vocal sounds and a second audio output representing the non-vocal sounds. The method further includes determining, by one or more trained avatar animation models and by separately encoding the first audio output representing the vocal sounds and the second audio output representing the non-vocal sounds, an avatar animation temporally corresponding to the audio input; and rendering, in real time and temporally coincident with the audio input, the determined avatar animation.