专利检索 ap:("Adobe Inc.") AND inv:"Jun-Yan Zhu" 第 2 页

11.

发明授权
Generative image congealing 有权

公开(公告)号：US11762951B2

公开(公告)日：2023-09-19

申请号：US16951782

申请日：2020-11-18

申请人： Adobe Inc.

发明人： Elya Shechtman , William Peebles , Richard Zhang , Jun-Yan Zhu , Alyosha Efros

IPC分类号： G06F18/21 , G06N3/08 , G06T3/00 , G06F18/214 , G06N3/045

CPC分类号： G06F18/217 , G06F18/214 , G06N3/045 , G06N3/08 , G06T3/0068

摘要： Embodiments are disclosed for generative image congealing which provides an unsupervised learning technique that learns transformations of real data to improve the image quality of GANs trained using that image data. In particular, in one or more embodiments, the disclosed systems and methods comprise generating, by a spatial transformer network, an aligned real image for a real image from an unaligned real dataset, providing, by the spatial transformer network, the aligned real image to an adversarial discrimination network to determine if the aligned real image resembles aligned synthetic images generated by a generator network, and training, by a training manager, the spatial transformer network to learn updated transformations based on the determination of the adversarial discrimination network.

12.

发明申请
IDENTITY-PRESERVING TECHNIQUES FOR GENERATIVE ADVERSARIAL NETWORK PROJECTION 有权

公开(公告)号：US20220122305A1

公开(公告)日：2022-04-21

申请号：US17384273

申请日：2021-07-23

申请人： Adobe Inc.

发明人： Cameron Smith , Ratheesh Kalarot , Wei-An Lin , Richard Zhang , Niloy Mitra , Elya Shechtman , Shabnam Ghadar , Zhixin Shu , Yannick Hold-Geoffrey , Nathan Carr , Jingwan Lu , Oliver Wang , Jun-Yan Zhu

IPC分类号： G06T11/60 , G06T3/40

摘要： An improved system architecture uses a pipeline including an encoder and a Generative Adversarial Network (GAN) including a generator neural network to generate edited images with improved speed, realism, and identity preservation. The encoder produces an initial latent space representation of an input image by encoding the input image. The generator neural network generates an initial output image by processing the initial latent space representation of the input image. The system generates an optimized latent space representation of the input image using a loss minimization technique that minimizes a loss between the input image and the initial output image. The loss is based on target perceptual features extracted from the input image and initial perceptual features extracted from the initial output image. The system outputs the optimized latent space representation of the input image for downstream use.

13.

发明申请
MULTI-SCALE OUTPUT TECHNIQUES FOR GENERATIVE ADVERSARIAL NETWORKS 有权

公开(公告)号：US20220122222A1

公开(公告)日：2022-04-21

申请号：US17384283

申请日：2021-07-23

申请人： Adobe Inc.

发明人： Cameron Smith , Ratheesh Kalarot , Wei-An Lin , Richard Zhang , Niloy Mitra , Elya Shechtman , Shabnam Ghadar , Zhixin Shu , Yannick Hold-Geoffrey , Nathan Carr , Jingwan Lu , Oliver Wang , Jun-Yan Zhu

IPC分类号： G06T3/40 , G06T11/60

摘要： An improved system architecture uses a Generative Adversarial Network (GAN) including a specialized generator neural network to generate multiple resolution output images. The system produces a latent space representation of an input image. The system generates a first output image at a first resolution by providing the latent space representation of the input image as input to a generator neural network comprising an input layer, an output layer, and a plurality of intermediate layers and taking the first output image from an intermediate layer, of the plurality of intermediate layers of the generator neural network. The system generates a second output image at a second resolution different from the first resolution by providing the latent space representation of the input image as input to the generator neural network and taking the second output image from the output layer of the generator neural network.

14.

发明申请
SUPERVISED LEARNING TECHNIQUES FOR ENCODER TRAINING 有权

公开(公告)号：US20220121932A1

公开(公告)日：2022-04-21

申请号：US17384378

申请日：2021-07-23

申请人： Adobe Inc.

发明人： Ratheesh Kalarot , Wei-An Lin , Cameron Smith , Zhixin Shu , Baldo Faieta , Shabnam Ghadar , Jingwan Lu , Aliakbar Darabi , Jun-Yan Zhu , Niloy Mitra , Richard Zhang , Elya Shechtman

IPC分类号： G06N3/08 , G06N3/04

摘要： Systems and methods train an encoder neural network for fast and accurate projection into the latent space of a Generative Adversarial Network (GAN). The encoder is trained by providing an input training image to the encoder and producing, by the encoder, a latent space representation of the input training image. The latent space representation is provided as input to the GAN to generate a generated training image. A latent code is sampled from a latent space associated with the GAN and the sampled latent code is provided as input to the GAN. The GAN generates a synthetic training image based on the sampled latent code. The sampled latent code is provided as input to the encoder to produce a synthetic training code. The encoder is updated by minimizing a loss between the generated training image and the input training image, and the synthetic training code and the sampled latent code.