CAPTURING DIGITAL IMAGES UTILIZING A MACHINE LEARNING MODEL TRAINED TO DETERMINE SUBTLE POSE DIFFERENTIATIONS

    公开(公告)号:US20250069437A1

    公开(公告)日:2025-02-27

    申请号:US18948067

    申请日:2024-11-14

    Applicant: Adobe Inc.

    Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing a machine learning model trained to determine subtle pose differentiations to analyze a repository of captured digital images of a particular user to automatically capture digital images portraying the user. For example, the disclosed systems can utilize a convolutional neural network to determine a pose/facial expression similarity metric between a sample digital image from a camera viewfinder stream of a client device and one or more previously captured digital images portraying the user. The disclosed systems can determine that the similarity metric satisfies a similarity threshold, and automatically capture a digital image utilizing a camera device of the client device. Thus, the disclosed systems can automatically and efficiently capture digital images, such as selfies, that accurately match previous digital images portraying a variety of unique facial expressions specific to individual users.

    IMAGE RELIGHTING
    2.
    发明申请

    公开(公告)号:US20250069299A1

    公开(公告)日:2025-02-27

    申请号:US18452827

    申请日:2023-08-21

    Applicant: ADOBE INC.

    Abstract: One or more aspects of a method, apparatus, and non-transitory computer readable medium include obtaining an input latent vector for an image generation network and a target lighting representation. A modified latent vector is generated based on the input latent vector and the target lighting representation, and an image generation network generates an image based on the modified latent vector using.

    Generating embeddings for text and image queries within a common embedding space for visual-text image searches

    公开(公告)号:US12235891B2

    公开(公告)日:2025-02-25

    申请号:US17809503

    申请日:2022-06-28

    Applicant: Adobe Inc.

    Abstract: Systems, methods, and non-transitory computer-readable media implements related image search and image modification processes using various search engines and a consolidated graphical user interface. For instance, one or more embodiments involve receiving an input digital image and search input and further modifying the input digital image using the image search results retrieved in response to the search input. In some cases, the search input includes a multi-modal search input having multiple queries (e.g., an image query and a text query), and one or more embodiments involve retrieving the image search results utilizing a weighted combination of the queries. Some implementations involve generating an input embedding for the search input (e.g., the multi-modal search input) and retrieving the image search results using the input embedding.

    Visualizing vector graphics in three-dimensional scenes

    公开(公告)号:US12229892B2

    公开(公告)日:2025-02-18

    申请号:US18157940

    申请日:2023-01-23

    Applicant: Adobe Inc.

    Abstract: In implementations of systems for visualizing vector graphics in three-dimensional scenes, a computing device implements a projection system to receive input data describing a digital image depicting a three-dimensional scene and a vector graphic to be projected into the three-dimensional scene. The projection system generates a depth image by estimating disparity values for pixels of the digital image. A three-dimensional mesh is computed that approximates the three-dimensional scene based on the depth image. The projection system projects the vector graphic onto the digital image by transforming the vector graphic based on the three-dimensional mesh.

    DIGITAL IMAGE INPAINTING UTILIZING GLOBAL AND LOCAL MODULATION LAYERS OF AN INPAINTING NEURAL NETWORK

    公开(公告)号:US20250054116A1

    公开(公告)日:2025-02-13

    申请号:US18929330

    申请日:2024-10-28

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that generate inpainted digital images utilizing a cascaded modulation inpainting neural network. For example, the disclosed systems utilize a cascaded modulation inpainting neural network that includes cascaded modulation decoder layers. For example, in one or more decoder layers, the disclosed systems start with global code modulation that captures the global-range image structures followed by an additional modulation that refines the global predictions. Accordingly, in one or more implementations, the image inpainting system provides a mechanism to correct distorted local details. Furthermore, in one or more implementations, the image inpainting system leverages fast Fourier convolutions block within different resolution layers of the encoder architecture to expand the receptive field of the encoder and to allow the network encoder to better capture global structure.

    GRID STRUCTURE CONTROL IN DIGITAL CONTENT

    公开(公告)号:US20250053285A1

    公开(公告)日:2025-02-13

    申请号:US18538802

    申请日:2023-12-13

    Applicant: Adobe Inc.

    Abstract: Grid structure control techniques and systems are described that support use of a flexible grid structure and layout control. In an implementation, an input is received via a user interface, the user interface displaying a grid having a plurality of grid cells. A determination is made as to whether the input corresponds to a first said grid cell of the grid. Responsive to determining by a processing device the input corresponds to the first said grid cell, a first edge of the first grid cell and a second edge of a second grid cell is moved as following the input. The second edge is disposed opposite and proximal to the first edge.

    GENERATING COLLAGE DIGITAL IMAGES BY COMBINING SCENE LAYOUTS AND PIXEL COLORS UTILIZING GENERATIVE NEURAL NETWORKS

    公开(公告)号:US20250045994A1

    公开(公告)日:2025-02-06

    申请号:US18924508

    申请日:2024-10-23

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating digital images depicting photorealistic scenes utilizing a digital image collaging neural network. For example, the disclosed systems utilize a digital image collaging neural network having a particular architecture for disentangling generation of scene layouts and pixel colors for different regions of a digital image. In some cases, the disclosed systems break down the process of generating a collage digital into generating images representing different regions such as a background and a foreground to be collaged into a final result. For example, utilizing the digital image collaging neural network, the disclosed systems determine scene layouts and pixel colors for both foreground digital images and background digital images to ultimately collage the foreground and background together into a collage digital image depicting a real-world scene.

    GENERATING MULTISTATE VECTOR OBJECTS FOR VARIANT REVIEW AND APPLICATION

    公开(公告)号:US20250045696A1

    公开(公告)日:2025-02-06

    申请号:US18365704

    申请日:2023-08-04

    Applicant: Adobe Inc.

    Abstract: The present disclosure is directed toward systems, methods, and non-transitory computer readable media that provide a graphical review interface for curating, reviewing, and approving digital design element variants utilizing multistate vector objects within a digital image. The disclosed systems generate, in response to a user interaction performed at a designer device, a digital image comprising a multistate vector object modifiable to depict variants of a graphical element within the digital image. Further, the disclosed systems provide the digital image for display within a variant review interface on a client device for reviewing the variants of the graphical element. Moreover, the disclosed systems receive, from the client device, an indication of a selected variant from among the variants indicated by the multistate vector object. The disclosed systems also generate a modified digital image reflecting the selected variant and send the modified digital image reflecting the selected variant to the designer device.

    3D modeling user interfaces by introducing improved coordinates for triquad cages

    公开(公告)号:US12217364B2

    公开(公告)日:2025-02-04

    申请号:US17947035

    申请日:2022-09-16

    Applicant: Adobe Inc.

    Abstract: A modeling system displays a three-dimensional (3D) space including a 3D object including a plurality of points and a cage model of the 3D object including a first configuration of vertices and quad faces. Each of the plurality of points is located at a respective initial location. The modeling system generates cage coordinates for the cage model including a vertex coordinate for each vertex of the cage model and four quad coordinates for each quad face of the cage model corresponding to each corner vertex of the quad. The modeling system deforms, responsive to receiving a request, the cage model to change the first configuration of vertices to a second configuration. The modeling system generates, based on the cage coordinates, the first configuration of vertices, and the second configuration of vertices, an updated 3D object by determining a subsequent location for each of the plurality of points.

Patent Agency Ranking