专利检索 ap:("Metaphysic.AI") AND inv:"Thomas Graham" 第 1 页

1.

发明公开
LIVE MODEL PROMPTING AND REAL-TIME OUTPUT OF PHOTOREAL SYNTHETIC CONTENT 审中-公开

公开(公告)号：US20240346731A1

公开(公告)日：2024-10-17

申请号：US18484586

申请日：2023-10-11

申请人： Metaphysic.AI

发明人： Thomas Graham , Chris Ume , Jo Plaete , Martin Adams

IPC分类号： G06T13/40 , G06N20/00 , G06T13/20 , G06T19/00

CPC分类号： G06T13/40 , G06N20/00 , G06T13/205 , G06T19/006 , G10L15/26

摘要： Prompting a trained artificial intelligence (AI) model(s) to output photoreal synthetic content in real-time is described. In some examples, one or more AI models are trained using sequential video frames as training data to obtain one or more trained AI models configured to generate temporally-coherent output data. In an example process, user-provided prompt data representing a prompt provided by a user is received, output data representing synthetic content is generated using the trained AI model(s) based at least in part on the user-provided prompt data, and video content featuring the synthetic content is caused to be displayed on a display based at least in part on the output data. In some examples, the output data is provided to the trained AI model(s) as part of a feedback loop to generate further output data as part of a real-time, iterative prompting system.

2.

发明公开
LATENT SPACE EDITING AND NEURAL ANIMATION TO GENERATE HYPERREAL SYNTHETIC FACES 审中-公开

公开(公告)号：US20240212249A1

公开(公告)日：2024-06-27

申请号：US18089487

申请日：2022-12-27

申请人： Metaphysic.AI

发明人： Chris Ume , Jo Plaete , Martin Adams , Thomas Graham

IPC分类号： G06T13/40 , G06N20/00 , G06T13/20 , G06T19/00 , G10L13/033

CPC分类号： G06T13/40 , G06N20/00 , G06T13/205 , G06T19/006 , G10L13/033

摘要： Using latent space manipulation and neural animation to generate hyperreal synthetic faces is described. A machine learning model(s) may be trained to generate a synthetic face of a subject featured in unaltered video content based at least in part on video data of an actor making a mouth-generated sound or a three-dimensional (3D) model of a face of the subject that has been animated in accordance with the mouth-generated sound. Latent space manipulation and neural animation may be used with the trained machine learning model(s) to generate instances of the synthetic face, and the instances of the synthetic face can be used to create altered video content featuring the subject with the synthetic face making the mouth-generated sound.