EXTRACTING ATTRIBUTES FROM ARBITRARY DIGITAL IMAGES UTILIZING A MULTI-ATTRIBUTE CONTRASTIVE CLASSIFICATION NEURAL NETWORK

    公开(公告)号:US20250022252A1

    公开(公告)日:2025-01-16

    申请号:US18899571

    申请日:2024-09-27

    Applicant: Adobe Inc.

    Abstract: This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that extract multiple attributes from an object portrayed in a digital image utilizing a multi-attribute contrastive classification neural network. For example, the disclosed systems utilize a multi-attribute contrastive classification neural network that includes an embedding neural network, a localizer neural network, a multi-attention neural network, and a classifier neural network. In some cases, the disclosed systems train the multi-attribute contrastive classification neural network utilizing a multi-attribute, supervised-contrastive loss. In some embodiments, the disclosed systems generate negative attribute training labels for labeled digital images utilizing positive attribute labels that correspond to the labeled digital images.

    SYSTEMS AND METHODS FOR IMAGE COMPOSITING

    公开(公告)号:US20250022099A1

    公开(公告)日:2025-01-16

    申请号:US18351838

    申请日:2023-07-13

    Applicant: ADOBE INC.

    Abstract: Systems and methods for image compositing are provided. An aspect of the systems and methods includes obtaining a first image and a second image, wherein the first image includes a target location and the second image includes a target element; encoding the second image using an image encoder to obtain an image embedding; generating a descriptive embedding based on the image embedding using an adapter network; and generating a composite image based on the descriptive embedding and the first image using an image generation model, wherein the composite image depicts the target element from the second image at the target location of the first image.

    Digital image inpainting utilizing a cascaded modulation inpainting neural network

    公开(公告)号:US12165295B2

    公开(公告)日:2024-12-10

    申请号:US17661985

    申请日:2022-05-04

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that generate inpainted digital images utilizing a cascaded modulation inpainting neural network. For example, the disclosed systems utilize a cascaded modulation inpainting neural network that includes cascaded modulation decoder layers. For example, in one or more decoder layers, the disclosed systems start with global code modulation that captures the global-range image structures followed by an additional modulation that refines the global predictions. Accordingly, in one or more implementations, the image inpainting system provides a mechanism to correct distorted local details. Furthermore, in one or more implementations, the image inpainting system leverages fast Fourier convolutions block within different resolution layers of the encoder architecture to expand the receptive field of the encoder and to allow the network encoder to better capture global structure.

    Applying object-aware style transfer to digital images

    公开(公告)号:US12154196B2

    公开(公告)日:2024-11-26

    申请号:US17810392

    申请日:2022-07-01

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for transferring global style features between digital images utilizing one or more machine learning models or neural networks. In particular, in one or more embodiments, the disclosed systems receive a request to transfer a global style from a source digital image to a target digital image, identify at least one target object within the target digital image, and transfer the global style from the source digital image to the target digital image while maintaining an object style of the at least one target object.

    DILATING OBJECT MASKS TO REDUCE ARTIFACTS DURING INPAINTING

    公开(公告)号:US20240169501A1

    公开(公告)日:2024-05-23

    申请号:US18058601

    申请日:2022-11-23

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For instance, in one or more embodiments, the disclosed systems generate, utilizing a segmentation neural network and without user input, object masks for objects in a digital image. The disclosed systems determine foreground and background abutting an object mask. The disclosed systems generate an expanded object mask by expanding the object mask into the foreground abutting the object mask by a first amount and expanding the object mask into the background abutting the object mask by a second amount that differs from the first amount. The disclosed systems inpaint a hole corresponding to the expanded object mask utilizing an inpainting neural network.

    RETRIEVING DIGITAL IMAGES IN RESPONSE TO SEARCH QUERIES FOR SEARCH-DRIVEN IMAGE EDITING

    公开(公告)号:US20240004924A1

    公开(公告)日:2024-01-04

    申请号:US17809781

    申请日:2022-06-29

    Applicant: Adobe Inc.

    CPC classification number: G06F16/538 G06F16/532 G06F16/5838 G06T5/50 G06T7/11

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that implements related image search and image modification processes using various search engines and a consolidated graphical user interface. For instance, in one or more embodiments, the disclosed systems receive an input digital image and search input and further modify the input digital image using the image search results retrieved in response to the search input. In some cases, the search input includes a multi-modal search input having multiple queries (e.g., an image query and a text query), and the disclosed systems retrieve the image search results utilizing a weighted combination of the queries. In some implementations, the disclosed systems generate an input embedding for the search input (e.g., the multi-modal search input) and retrieve the image search results using the input embedding.

    GENERATING UNIFIED EMBEDDINGS FROM MULTI-MODAL CANVAS INPUTS FOR IMAGE RETRIEVAL

    公开(公告)号:US20230419571A1

    公开(公告)日:2023-12-28

    申请号:US17809494

    申请日:2022-06-28

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that implements related image search and image modification processes using various search engines and a consolidated graphical user interface. For instance, in one or more embodiments, the disclosed systems receive an input digital image and search input and further modify the input digital image using the image search results retrieved in response to the search input. In some cases, the search input includes a multi-modal search input having multiple queries (e.g., an image query and a text query), and the disclosed systems retrieve the image search results utilizing a weighted combination of the queries. In some implementations, the disclosed systems generate an input embedding for the search input (e.g., the multi-modal search input) and retrieve the image search results using the input embedding.

    RECOMMENDING OBJECTS FOR IMAGE COMPOSITION USING GEOMETRY-AND-LIGHTING AWARE SEARCH AND EFFICIENT USER INTERFACE WORKFLOWS

    公开(公告)号:US20230325992A1

    公开(公告)日:2023-10-12

    申请号:US17658774

    申请日:2022-04-11

    Applicant: Adobe Inc.

    CPC classification number: G06T5/50 G06T3/60 G06T7/194

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that utilizes artificial intelligence to learn to recommend foreground object images for use in generating composite images based on geometry and/or lighting features. For instance, in one or more embodiments, the disclosed systems transform a foreground object image corresponding to a background image using at least one of a geometry transformation or a lighting transformation. The disclosed systems further generating predicted embeddings for the background image, the foreground object image, and the transformed foreground object image within a geometry-lighting-sensitive embedding space utilizing a geometry-lighting-aware neural network. Using a loss determined from the predicted embeddings, the disclosed systems update parameters of the geometry-lighting-aware neural network. The disclosed systems further provide a variety of efficient user interfaces for generating composite digital images.

Patent Agency Ranking