-
公开(公告)号:US20240346731A1
公开(公告)日:2024-10-17
申请号:US18484586
申请日:2023-10-11
申请人: Metaphysic.AI
发明人: Thomas Graham , Chris Ume , Jo Plaete , Martin Adams
CPC分类号: G06T13/40 , G06N20/00 , G06T13/205 , G06T19/006 , G10L15/26
摘要: Prompting a trained artificial intelligence (AI) model(s) to output photoreal synthetic content in real-time is described. In some examples, one or more AI models are trained using sequential video frames as training data to obtain one or more trained AI models configured to generate temporally-coherent output data. In an example process, user-provided prompt data representing a prompt provided by a user is received, output data representing synthetic content is generated using the trained AI model(s) based at least in part on the user-provided prompt data, and video content featuring the synthetic content is caused to be displayed on a display based at least in part on the output data. In some examples, the output data is provided to the trained AI model(s) as part of a feedback loop to generate further output data as part of a real-time, iterative prompting system.
-
公开(公告)号:US20240212249A1
公开(公告)日:2024-06-27
申请号:US18089487
申请日:2022-12-27
申请人: Metaphysic.AI
发明人: Chris Ume , Jo Plaete , Martin Adams , Thomas Graham
IPC分类号: G06T13/40 , G06N20/00 , G06T13/20 , G06T19/00 , G10L13/033
CPC分类号: G06T13/40 , G06N20/00 , G06T13/205 , G06T19/006 , G10L13/033
摘要: Using latent space manipulation and neural animation to generate hyperreal synthetic faces is described. A machine learning model(s) may be trained to generate a synthetic face of a subject featured in unaltered video content based at least in part on video data of an actor making a mouth-generated sound or a three-dimensional (3D) model of a face of the subject that has been animated in accordance with the mouth-generated sound. Latent space manipulation and neural animation may be used with the trained machine learning model(s) to generate instances of the synthetic face, and the instances of the synthetic face can be used to create altered video content featuring the subject with the synthetic face making the mouth-generated sound.
-