IMAGE SYNTHESIS USING DIFFUSION MODELS CREATED FROM SINGLE OR MULTIPLE VIEW IMAGES

Invention Publication

US20240135630A1 IMAGE SYNTHESIS USING DIFFUSION MODELS CREATED FROM SINGLE OR MULTIPLE VIEW IMAGES 审中-公开

Please log in to see more content

Patent Title: IMAGE SYNTHESIS USING DIFFUSION MODELS CREATED FROM SINGLE OR MULTIPLE VIEW IMAGES
Application No.: US18485225

Application Date: 2023-10-11
Publication No.: US20240135630A1

Publication Date: 2024-04-25
Inventor: Koki Nagano , Eric Ryan Wong Chan , Tero Tapani Karras , Shalini De Mello , Miika Samuli Aittala , Matthew Aaron Wong Chan
Applicant: NVIDIA Corporation
Applicant Address: US CA Santa Clara
Assignee: NVIDIA Corporation
Current Assignee: NVIDIA Corporation
Current Assignee Address: US CA Santa Clara
Main IPC: G06T15/06
IPC: G06T15/06 ; G06T5/00 ; G06T5/50 ; G06V10/44 ; G06V10/771

IMAGE SYNTHESIS USING DIFFUSION MODELS CREATED FROM SINGLE OR MULTIPLE VIEW IMAGES

Abstract:

A method and system for performing novel image synthesis using generative networks are provided. The encoder-based model is trained to infer a 3D representation of an input image. A feature image is then generated using volume rendering techniques in accordance with the 3D representation. The feature image is then concatenated with a noisy image and processed by a denoiser network to predict an output image from a novel viewpoint that is consistent with the input image. The denoiser network can be a modified Noise Conditional Score Network (NCSN). In some embodiments, multiple input images or keyframes can be provided as input, and a different 3D representation is generated for each input image. The feature image is then generated, during volume rendering, by sampling each of the 3D representations and applying a mean-pooling operation to generate an aggregate feature image.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06T	一般的图像数据处理或产生
G06T15/00	3D〔三维〕图像的加工
G06T15/06	.光线跟踪