-
公开(公告)号:US20240087179A1
公开(公告)日:2024-03-14
申请号:US18462703
申请日:2023-09-07
Applicant: NEC Laboratories America, Inc.
Inventor: Renqiang Min , Kai Li , Hans Peter Graf , Haomiao Ni
CPC classification number: G06T11/00 , G06T3/0093 , G06V20/46
Abstract: Methods and systems for training a model include training an encoder in an unsupervised fashion based on a backward latent flow between a reference frame and a driving frame taken from a same video. A diffusion model is trained that generates a video sequence responsive to an input image and a text condition, using the trained encoder to determine a latent flow sequence and occlusion map sequence of a labeled training video.