-
公开(公告)号:US20240185588A1
公开(公告)日:2024-06-06
申请号:US18062314
申请日:2022-12-06
Applicant: ADOBE INC.
Inventor: Nupur Kumari , Richard Zhang , Junyan Zhu , Elya Shechtman
IPC: G06V10/778 , G06V10/75 , G06V10/774
CPC classification number: G06V10/778 , G06V10/751 , G06V10/774
Abstract: Systems and methods for fine-tuning diffusion models are described. Embodiments of the present disclosure obtain an input text indicating an element to be included in an image; generate a synthetic image depicting the element based on the input text using a diffusion model trained by comparing synthetic images depicting the element to training images depicting elements similar to the element and updating selected parameters corresponding to an attention layer of the diffusion model based on the comparison.
-
2.
公开(公告)号:US20220414431A1
公开(公告)日:2022-12-29
申请号:US17899936
申请日:2022-08-31
Applicant: Adobe Inc.
Inventor: Richard Zhang , Sylvain Philippe Paris , Junyan Zhu , Aaron Phillip Hertzmann , Jacob Minyoung Huh
Abstract: A target image is projected into a latent space of generative model by determining a latent vector by applying a gradient-free technique and a class vector by applying a gradient-based technique. An image is generated from the latent and class vectors, and a loss function is used to determine a loss between the target image and the generated image. This determining of the latent vector and the class vector, generating an image, and using the loss function is repeated until a loss condition is satisfied. In response to the loss condition being satisfied, the latent and class vectors that resulted in the loss condition being satisfied are identified as the final latent and class vectors, respectively. The final latent and class vectors are provided to the generative model and multiple weights of the generative model are adjusted to fine-tune the generative model.
-
公开(公告)号:US20220148242A1
公开(公告)日:2022-05-12
申请号:US17091440
申请日:2020-11-06
Applicant: Adobe Inc.
Inventor: Bryan Russell , Taesung Park , Richard Zhang , Junyan Zhu , Alexander Andonian
Abstract: This disclosure describes methods, non-transitory computer readable storage media, and systems that utilize a contrastive perceptual loss to modify neural networks for generating synthetic digital content items. For example, the disclosed systems generate a synthetic digital content item based on a guide input to a generative neural network. The disclosed systems utilize an encoder neural network to generate encoded representations of the synthetic digital content item and a corresponding ground-truth digital content item. Additionally, the disclosed systems sample patches from the encoded representations of the encoded digital content items and then determine a contrastive loss based on the perceptual distances between the patches in the encoded representations. Furthermore, the disclosed systems jointly update the parameters of the generative neural network and the encoder neural network utilizing the contrastive loss.
-
公开(公告)号:US20230102055A1
公开(公告)日:2023-03-30
申请号:US18058163
申请日:2022-11-22
Applicant: Adobe Inc.
Inventor: Taesung Park , Richard Zhang , Oliver Wang , Junyan Zhu , Jingwan Lu , Elya Shechtman , Alexei A. Efros
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating a modified digital image from extracted spatial and global codes. For example, the disclosed systems can utilize a global and spatial autoencoder to extract spatial codes and global codes from digital images. The disclosed systems can further utilize the global and spatial autoencoder to generate a modified digital image by combining extracted spatial and global codes in various ways for various applications such as style swapping, style blending, and attribute editing.
-
5.
公开(公告)号:US11468294B2
公开(公告)日:2022-10-11
申请号:US16798271
申请日:2020-02-21
Applicant: Adobe Inc.
Inventor: Richard Zhang , Sylvain Philippe Paris , Junyan Zhu , Aaron Phillip Hertzmann , Jacob Minyoung Huh
Abstract: A target image is projected into a latent space of generative model by determining a latent vector by applying a gradient-free technique and a class vector by applying a gradient-based technique. An image is generated from the latent and class vectors, and a loss function is used to determine a loss between the target image and the generated image. This determining of the latent vector and the class vector, generating an image, and using the loss function is repeated until a loss condition is satisfied. In response to the loss condition being satisfied, the latent and class vectors that resulted in the loss condition being satisfied are identified as the final latent and class vectors, respectively. The final latent and class vectors are provided to the generative model and multiple weights of the generative model are adjusted to fine-tune the generative model.
-
6.
公开(公告)号:US12254545B2
公开(公告)日:2025-03-18
申请号:US18298138
申请日:2023-04-10
Applicant: Adobe Inc.
Inventor: Taesung Park , Alexei A Efros , Elya Shechtman , Richard Zhang , Junyan Zhu
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and flexibly generating modified digital images utilizing a novel swapping autoencoder that incorporates scene layout. In particular, the disclosed systems can receive a scene layout map that indicates or defines locations for displaying specific digital content within a digital image. In addition, the disclosed systems can utilize the scene layout map to guide combining portions of digital image latent code to generate a modified digital image with a particular textural appearance and a particular geometric structure defined by the scene layout map. Additionally, the disclosed systems can utilize a scene layout map that defines a portion of a digital image to modify by, for instance, adding new digital content to the digital image, and can generate a modified digital image depicting the new digital content.
-
公开(公告)号:US20240282025A1
公开(公告)日:2024-08-22
申请号:US18170963
申请日:2023-02-17
Applicant: ADOBE INC.
Inventor: Taesung Park , Minguk Kang , Richard Zhang , Junyan Zhu , Elya Shechtman , Sylvain Paris
IPC: G06T11/60 , G06F40/126 , G06F40/151 , G06F40/284 , G06T5/20
CPC classification number: G06T11/60 , G06F40/126 , G06F40/151 , G06F40/284 , G06T5/20 , G06T2207/20004 , G06T2207/20081 , G06T2207/20084
Abstract: Systems and methods for image generation are provided. An aspect of the systems and methods includes obtaining a text prompt, generating a style vector based on the text prompt, generating an adaptive convolution filter based on the style vector, and generating an image corresponding to the text prompt based on the adaptive convolution filter.
-
8.
公开(公告)号:US20230245363A1
公开(公告)日:2023-08-03
申请号:US18298138
申请日:2023-04-10
Applicant: Adobe Inc.
Inventor: Taesung Park , Alexei A. Efros , Elya Shechtman , Richard Zhang , Junyan Zhu
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and flexibly generating modified digital images utilizing a novel swapping autoencoder that incorporates scene layout. In particular, the disclosed systems can receive a scene layout map that indicates or defines locations for displaying specific digital content within a digital image. In addition, the disclosed systems can utilize the scene layout map to guide combining portions of digital image latent code to generate a modified digital image with a particular textural appearance and a particular geometric structure defined by the scene layout map. Additionally, the disclosed systems can utilize a scene layout map that defines a portion of a digital image to modify by, for instance, adding new digital content to the digital image, and can generate a modified digital image depicting the new digital content.
-
公开(公告)号:US11544880B2
公开(公告)日:2023-01-03
申请号:US16874399
申请日:2020-05-14
Applicant: Adobe Inc.
Inventor: Taesung Park , Richard Zhang , Oliver Wang , Junyan Zhu , Jingwan Lu , Elya Shechtman , Alexei A Efros
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating a modified digital image from extracted spatial and global codes. For example, the disclosed systems can utilize a global and spatial autoencoder to extract spatial codes and global codes from digital images. The disclosed systems can further utilize the global and spatial autoencoder to generate a modified digital image by combining extracted spatial and global codes in various ways for various applications such as style swapping, style blending, and attribute editing.
-
公开(公告)号:US11514632B2
公开(公告)日:2022-11-29
申请号:US17091440
申请日:2020-11-06
Applicant: Adobe Inc.
Inventor: Bryan Russell , Taesung Park , Richard Zhang , Junyan Zhu , Alexander Andonian
Abstract: This disclosure describes methods, non-transitory computer readable storage media, and systems that utilize a contrastive perceptual loss to modify neural networks for generating synthetic digital content items. For example, the disclosed systems generate a synthetic digital content item based on a guide input to a generative neural network. The disclosed systems utilize an encoder neural network to generate encoded representations of the synthetic digital content item and a corresponding ground-truth digital content item. Additionally, the disclosed systems sample patches from the encoded representations of the encoded digital content items and then determine a contrastive loss based on the perceptual distances between the patches in the encoded representations. Furthermore, the disclosed systems jointly update the parameters of the generative neural network and the encoder neural network utilizing the contrastive loss.
-
-
-
-
-
-
-
-
-