CONTROLLABLE DIFFUSION MODEL
    1.
    发明申请

    公开(公告)号:US20250078349A1

    公开(公告)日:2025-03-06

    申请号:US18459526

    申请日:2023-09-01

    Applicant: ADOBE INC.

    Abstract: A method, apparatus, and non-transitory computer readable medium for image generation are described. Embodiments of the present disclosure obtain a content input and a style input via a user interface or from a database. The content input includes a target spatial layout and the style input includes a target style. A content encoder of an image processing apparatus encodes the content input to obtain a spatial layout mask representing the target spatial layout. A style encoder of the image processing apparatus encodes the style input to obtain a style embedding representing the target style. An image generation model of the image processing apparatus generates an image based on the spatial layout mask and the style embedding, where the image includes the target spatial layout and the target style.

    GAN IMAGE GENERATION FROM FEATURE REGULARIZATION

    公开(公告)号:US20250037431A1

    公开(公告)日:2025-01-30

    申请号:US18357621

    申请日:2023-07-24

    Applicant: ADOBE INC.

    Abstract: Systems and methods for training a Generative Adversarial Network (GAN) using feature regularization are described herein. Embodiments are configured to generate a candidate image using a generator network of a GAN, classify the candidate image as real or generated using a discriminator network of the GAN, and train the GAN to generate realistic images based on the classifying of the candidate image. The training process includes regularizing a gradient with respect to features extracted using a discriminator network of the GAN.

    Generating synthesized digital images utilizing class-specific machine-learning models

    公开(公告)号:US11861762B2

    公开(公告)日:2024-01-02

    申请号:US17400474

    申请日:2021-08-12

    Applicant: Adobe Inc.

    CPC classification number: G06T11/00 G06T2210/12

    Abstract: This disclosure describes methods, non-transitory computer readable storage media, and systems that generate synthetized digital images using class-specific generators for objects of different classes. The disclosed system modifies a synthesized digital image by utilizing a plurality of class-specific generator neural networks to generate a plurality of synthesized objects according to object classes identified in the synthesized digital image. The disclosed system determines object classes in the synthesized digital image such as via a semantic label map corresponding to the synthesized digital image. The disclosed system selects class-specific generator neural networks corresponding to the classes of objects in the synthesized digital image. The disclosed system also generates a plurality of synthesized objects utilizing the class-specific generator neural networks based on contextual data associated with the identified objects. The disclosed system generates a modified synthesized digital image by replacing the identified objects in the synthesized digital images with the synthesized objects.

    GENERATING ANIMATED DIGITAL VIDEOS UTILIZING A CHARACTER ANIMATION NEURAL NETWORK INFORMED BY POSE AND MOTION EMBEDDINGS

    公开(公告)号:US20230123820A1

    公开(公告)日:2023-04-20

    申请号:US17502714

    申请日:2021-10-15

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and method that utilize a character animation neural network informed by motion and pose signatures to generate a digital video through person-specific appearance modeling and motion retargeting. In particular embodiments, the disclosed systems implement a character animation neural network that includes a pose embedding model to encode a pose signature into spatial pose features. The character animation neural network further includes a motion embedding model to encode a motion signature into motion features. In some embodiments, the disclosed systems utilize the motion features to refine per-frame pose features and improve temporal coherency. In certain implementations, the disclosed systems also utilize the motion features to demodulate neural network weights used to generate an image frame of a character in motion based on the refined pose features.

    COMPLETE 3D OBJECT RECONSTRUCTION FROM AN INCOMPLETE IMAGE

    公开(公告)号:US20250078406A1

    公开(公告)日:2025-03-06

    申请号:US18242380

    申请日:2023-09-05

    Applicant: Adobe Inc.

    Abstract: A modeling system accesses a two-dimensional (2D) input image displayed via a user interface, the 2D input image depicting, at a first view, a first object. At least one region of the first object is not represented by pixel values of the 2D input image. The modeling system generates, by applying a 3D representation generation model to the 2D input image, a three-dimensional (3D) representation of the first object that depicts an entirety of the first object including the first region. The modeling system displays, via the user interface, the 3D representation, wherein the 3D representation is viewable via the user interface from a plurality of views including the first view.

    DIGITAL IMAGE REPOSING BASED ON MULTIPLE INPUT VIEWS

    公开(公告)号:US20250005812A1

    公开(公告)日:2025-01-02

    申请号:US18215484

    申请日:2023-06-28

    Applicant: Adobe Inc.

    Abstract: In implementations of systems for human reposing based on multiple input views, a computing device implements a reposing system to receive input data describing: input digital images; pluralities of keypoints corresponding to the input digital images, the pluralities of keypoints representing poses of a person depicted in the input digital images; and a plurality of keypoints representing a target pose. The reposing system generates selection masks corresponding to the input digital images by processing the input data using a machine learning model. The selection masks represent likelihoods of spatial correspondence between pixels of an output digital image and portions of the input digital images. The reposing system generates the output digital image depicting the person in the target pose for display in a user interface based on the selection masks and the input data.

    DIGITAL IMAGE REPOSING TECHNIQUES
    10.
    发明申请

    公开(公告)号:US20240428564A1

    公开(公告)日:2024-12-26

    申请号:US18213118

    申请日:2023-06-22

    Applicant: Adobe Inc.

    Abstract: In implementations of systems for generating images for human reposing, a computing device implements a reposing system to receive input data describing an input digital image depicting a person in a first pose, a first plurality of keypoints representing the first pose, and a second plurality of keypoints representing a second pose. The reposing system generates a mapping by processing the input data using a first machine learning model. The mapping indicates a plurality of first portions of the person in the second pose that are visible in the input digital image and a plurality of second portions of the person in the second pose that are invisible in the input digital image. The reposing system generates an output digital image depicting the person in the second pose by processing the mapping, the first plurality of keypoints, and the second plurality of keypoints using a second machine learning model.

Patent Agency Ranking