-
公开(公告)号:US20240346735A1
公开(公告)日:2024-10-17
申请号:US18633750
申请日:2024-04-12
发明人: Luchuan Song , Chenliang Xu
CPC分类号: G06T13/40 , G06T3/18 , G06T7/20 , G06T7/70 , G06T17/20 , G06V10/44 , G06V40/174 , H04N21/816
摘要: Features described herein pertain to generative machine learning, and more particularly, to machine learning techniques for generating virtual characters. A video that depicts a first subject and includes an audio component that corresponds to speech spoken by the first subject and an image that depicts a second subject are provided to and used by one or more machine learning models to generate a video that depicts the second subject. The second subject can blink and exhibit emotional characteristic and reactions that are responsive to the speech spoken by the first subject and/or a characteristic of the first subject such as a facial expression and/or head pose motion. The generated video can be displayed and/or stored where it can be later retrieved.