ATTRIBUTE DECORRELATION TECHNIQUES FOR IMAGE EDITING

    公开(公告)号:US20220122232A1

    公开(公告)日:2022-04-21

    申请号:US17468476

    申请日:2021-09-07

    申请人: Adobe Inc.

    IPC分类号: G06T5/00 G06T5/20 G06N3/08

    摘要: Systems and methods generate a filtering function for editing an image with reduced attribute correlation. An image editing system groups training data into bins according to a distribution of a target attribute. For each bin, the system samples a subset of the training data based on a pre-determined target distribution of a set of additional attributes in the training data. The system identifies a direction in the sampled training data corresponding to the distribution of the target attribute to generate a filtering vector for modifying the target attribute in an input image, obtains a latent space representation of an input image, applies the filtering vector to the latent space representation of the input image to generate a filtered latent space representation of the input image, and provides the filtered latent space representation as input to a neural network to generate an output image with a modification to the target attribute.

    DIRECT REGRESSION ENCODER ARCHITECTURE AND TRAINING

    公开(公告)号:US20220121931A1

    公开(公告)日:2022-04-21

    申请号:US17384371

    申请日:2021-07-23

    申请人: Adobe Inc.

    IPC分类号: G06N3/08 G06N3/04 G06K9/62

    摘要: Systems and methods train and apply a specialized encoder neural network for fast and accurate projection into the latent space of a Generative Adversarial Network (GAN). The specialized encoder neural network includes an input layer, a feature extraction layer, and a bottleneck layer positioned after the feature extraction layer. The projection process includes providing an input image to the encoder and producing, by the encoder, a latent space representation of the input image. Producing the latent space representation includes extracting a feature vector from the feature extraction layer, providing the feature vector to the bottleneck layer as input, and producing the latent space representation as output. The latent space representation produced by the encoder is provided as input to the GAN, which generates an output image based upon the latent space representation. The encoder is trained using specialized loss functions including a segmentation loss and a mean latent loss.