- 专利标题: PRIOR GUIDED LATENT DIFFUSION
-
申请号: US18301671申请日: 2023-04-17
-
公开(公告)号: US20240346629A1公开(公告)日: 2024-10-17
- 发明人: Midhun Harikumar , Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Vinh Ngoc Khuc
- 申请人: ADOBE INC.
- 申请人地址: US CA SAN JOSE
- 专利权人: ADOBE INC.
- 当前专利权人: ADOBE INC.
- 当前专利权人地址: US CA SAN JOSE
- 主分类号: G06T5/00
- IPC分类号: G06T5/00 ; G06F40/279 ; G06T5/50
摘要:
Systems and methods for image processing are described. Embodiments of the present disclosure obtain a text prompt for text guided image generation. A multi-modal encoder of an image processing apparatus encodes the text prompt to obtain a text embedding. A diffusion prior model of the image processing apparatus converts the text embedding to an image embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, wherein the image includes an element described by the text prompt.
信息查询