UTILIZING CROSS-ATTENTION GUIDANCE TO PRESERVE CONTENT IN DIFFUSION-BASED IMAGE MODIFICATIONS

Invention Publication

US20240331236A1 UTILIZING CROSS-ATTENTION GUIDANCE TO PRESERVE CONTENT IN DIFFUSION-BASED IMAGE MODIFICATIONS 审中-公开

Please log in to see more content

Patent Title: UTILIZING CROSS-ATTENTION GUIDANCE TO PRESERVE CONTENT IN DIFFUSION-BASED IMAGE MODIFICATIONS
Application No.: US18178194

Application Date: 2023-03-03
Publication No.: US20240331236A1

Publication Date: 2024-10-03
Inventor: Yijun Li , Richard Zhang , Krishna Kumar Singh , Jingwan Lu , Gaurav Parmar , Jun-Yan Zhu
Applicant: Adobe Inc.
Applicant Address: US CA San Jose
Assignee: Adobe Inc.
Current Assignee: Adobe Inc.
Current Assignee Address: US CA San Jose
Main IPC: G06T11/60
IPC: G06T11/60 ; G06T5/00 ; G06T9/00 ; G06V10/74 ; G06V10/82 ; G06V20/70

UTILIZING CROSS-ATTENTION GUIDANCE TO PRESERVE CONTENT IN DIFFUSION-BASED IMAGE MODIFICATIONS

Abstract:

The present disclosure relates to systems, non-transitory computer-readable media, and methods for utilizing machine learning models to generate modified digital images. In particular, in some embodiments, the disclosed systems generate image editing directions between textual identifiers of two visual features utilizing a language prediction machine learning model and a text encoder. In some embodiments, the disclosed systems generated an inversion of a digital image utilizing a regularized inversion model to guide forward diffusion of the digital image. In some embodiments, the disclosed systems utilize cross-attention guidance to preserve structural details of a source digital image when generating a modified digital image with a diffusion neural network.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06T	一般的图像数据处理或产生
G06T11/00	2D〔二维〕图像的生成
G06T11/60	.编辑图形和文本，组合图形或文本