-
公开(公告)号:US12254548B1
公开(公告)日:2025-03-18
申请号:US18082709
申请日:2022-12-16
Applicant: Amazon Technologies, Inc.
Inventor: Gourav Datta , Vivek Yadav , Yue Wu , Ayush Jaiswal , Rajiv M Reddy , Prateek Singhal , Karthik Ramakrishnan , Premkumar Natarajan
Abstract: A system configured to perform style-aware listener animation. By representing different listening styles (e.g., facial expressions) using an embedding space, a single model can be trained to generate unique facial animations for a number of distinct listeners. Thus, individual listening styles can be associated with a listener identifier, enabling the system to (i) animate a plurality of different listeners with unique nonverbal behavior and/or (ii) select a particular listener identifier or desired type of listener style with which to animate. This enables the model to be generalized to new listeners to generate additional listener facial responses without needing training data for each new listener. The model may process a listener representation style or listener identifier, along with input data corresponding to a speaker talking, to generate unique facial animation responsive to the speech.