Patent search ap:("Adobe Inc.") AND inv:"Venkata Naveen Kumar Yadav Marri" Page 1

1.

发明授权
Text adjusted visual search 有权

公开(公告)号：US11907280B2

公开(公告)日：2024-02-20

申请号：US17090150

申请日：2020-11-05

Applicant: ADOBE INC.

Inventor： Mikhail Kotov , Roland Geisler , Saeid Motiian , Dylan Nathaniel Warnock , Michele Saad , Venkata Naveen Kumar Yadav Marri , Ajinkya Kale , Ryan Rozich , Baldo Faieta

IPC: G06F17/00 , G06F7/00 , G06F16/532 , G06F16/2457 , G06F16/538 , G06F16/583

CPC classification number: G06F16/532 , G06F16/24578 , G06F16/538 , G06F16/5846

Abstract: Embodiments of the technology described herein, provide improved visual search results by combining a visual similarity and a textual similarity between images. In an embodiment, the visual similarity is quantified as a visual similarity score and the textual similarity is quantified as a textual similarity score. The textual similarity is determined based on text, such as a title, associated with the image. The overall similarity of two images is quantified as a weighted combination of the textual similarity score and the visual similarity score. In an embodiment, the weighting between the textual similarity score and the visual similarity score is user configurable through a control on the search interface. In one embodiment, the aggregate similarity score is the sum of a weighted visual similarity score and a weighted textual similarity score.

2.

发明申请
TEXT ADJUSTED VISUAL SEARCH 有权

公开(公告)号：US20220138247A1

公开(公告)日：2022-05-05

申请号：US17090150

申请日：2020-11-05

Applicant: ADOBE INC.

Inventor： Mikhail Kotov , Roland Geisler , Saeid Motiian , Dylan Nathaniel Warnock , Michele Saad , Venkata Naveen Kumar Yadav Marri , Ajinkya Kale , Ryan Rozich , Baldo Faieta

IPC: G06F16/532 , G06F16/583 , G06F16/538 , G06F16/2457

Abstract: Embodiments of the technology described herein, provide improved visual search results by combining a visual similarity and a textual similarity between images. In an embodiment, the visual similarity is quantified as a visual similarity score and the textual similarity is quantified as a textual similarity score. The textual similarity is determined based on text, such as a title, associated with the image. The overall similarity of two images is quantified as a weighted combination of the textual similarity score and the visual similarity score. In an embodiment, the weighting between the textual similarity score and the visual similarity score is user configurable through a control on the search interface. In one embodiment, the aggregate similarity score is the sum of a weighted visual similarity score and a weighted textual similarity score.

3.

发明授权
Concept disambiguation using multimodal embeddings 有权

公开(公告)号：US12249116B2

公开(公告)日：2025-03-11

申请号：US17656147

申请日：2022-03-23

Applicant: ADOBE INC.

Inventor： Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale

IPC: G06V10/771 , G06N3/088 , G06V10/74 , G06V10/77 , G06V10/774 , G06V10/82

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure identify a plurality of candidate concepts in a knowledge graph (KG) that correspond to an image tag of an image; generate an image embedding of the image using a multi-modal encoder; generate a concept embedding for each of the plurality of candidate concepts using the multi-modal encoder; select a matching concept from the plurality of candidate concepts based on the image embedding and the concept embedding; and generate association data between the image and the matching concept.

4.

发明申请
COLOR CONDITIONED DIFFUSION PRIOR 有权

公开(公告)号：US20240404144A1

公开(公告)日：2024-12-05

申请号：US18329111

申请日：2023-06-05

Applicant: ADOBE INC.

Inventor： Pranav Vineet Aggarwal , Venkata Naveen Kumar Yadav Marri , Midhun Harikumar , Sachin Madhav Kelkar , Hareesh Ravi , Ajinkya Gorakhnath Kale

IPC: G06T11/60 , G06F40/40 , G06N3/045 , G06N3/08 , G06T11/00

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure, via a multi-modal encoder of an image processing apparatus, encodes a text prompt to obtain a text embedding. A color encoder of the image processing apparatus encodes a color prompt to obtain a color embedding. A diffusion prior model of the image processing apparatus generates an image embedding based on the text embedding and the color embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, where the image includes an element from the text prompt and a color from the color prompt.

5.

发明申请
STYLE-BASED IMAGE GENERATION 有权

公开(公告)号：US20250117973A1

公开(公告)日：2025-04-10

申请号：US18903151

申请日：2024-10-01

Applicant: ADOBE INC.

Inventor： Fengbin Chen , Midhun Harikumar , Ajinkya Gorakhnath Kale , Hareesh Ravi , Venkata Naveen Kumar Yadav Marri

IPC: G06T11/00

Abstract: A method, apparatus, non-transitory computer readable medium, and system for media processing includes obtaining a text prompt and a style input, where the text prompt describes image content and the style input describes an image style, generating a text embedding based on the text prompt, where the text embedding represents the image content, generating a style embedding based on the style input, where the style embedding represents the image style, and generating a synthetic image based on the text embedding and the style embedding, where the text embedding is provided to the image generation model at a first step and the style embedding is provided to the image generation model at a second step after the first step.

6.

发明申请
WEAK SUPERVISED TRAINING DATA FOR IMAGE TAGGING MODELS 有权

公开(公告)号：US20240378863A1

公开(公告)日：2024-11-14

申请号：US18313642

申请日：2023-05-08

Applicant: ADOBE INC.

Inventor： Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale

IPC: G06V10/774 , G06T9/00 , G06V20/70

Abstract: Systems and methods for image tagging are provided. One aspect of the systems and methods includes encoding an image and a tag of the image using a multimodal encoder to obtain an image embedding and a text embedding, respectively. Another aspect of the systems and methods includes generating training data for a machine learning model by filtering a plurality of image-tag pairs based on a similarity between the image embedding and the text embedding. Another aspect of the systems and methods includes training the machine learning model using the training data.

7.

发明公开
UTILIZING A DIFFUSION NEURAL NETWORK FOR MASK AWARE IMAGE AND TYPOGRAPHY EDITING 审中-公开

公开(公告)号：US20240355018A1

公开(公告)日：2024-10-24

申请号：US18303898

申请日：2023-04-20

Applicant: Adobe Inc.

Inventor： Pranav Aggarwal , Hareesh Ravi , Midhun Harikumar , Ajinkya Gorakhnath Kale , Fengbin Chen , Venkata Naveen Kumar Yadav Marri

IPC: G06T11/60 , G06T5/00 , G06T5/50 , G06T7/11 , G06T7/50 , G06T13/00

CPC classification number: G06T11/60 , G06T5/50 , G06T5/70 , G06T7/11 , G06T7/50 , G06T13/00 , G06T2200/24 , G06T2207/20021 , G06T2207/20081 , G06T2207/20084 , G06T2207/20092 , G06T2207/20212

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a diffusion neural network for mask aware image and typography editing. For example, in one or more embodiments the disclosed systems utilize a text-image encoder to generate a base image embedding from a base digital image. Moreover, the disclosed systems generate a mask-segmented image by combining a shape mask with the base digital image. In one or more implementations, the disclosed systems utilize noising steps of a diffusion noising model to generate a mask-segmented image noise map from the mask-segmented image. Furthermore, the disclosed systems utilize a diffusion neural network to create a stylized image corresponding to the shape mask from the base image embedding and the mask-segmented image noise map.

8.

发明公开
CONCEPT DISAMBIGUATION USING MULTIMODAL EMBEDDINGS 审中-公开

公开(公告)号：US20230326178A1

公开(公告)日：2023-10-12

申请号：US17656147

申请日：2022-03-23

Applicant: ADOBE INC.

Inventor： Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale

IPC: G06N3/08 , G06V10/74 , G06V10/771 , G06V10/77 , G06V10/82 , G06V10/774

CPC classification number: G06V10/761 , G06N3/088 , G06V10/771 , G06V10/7715 , G06V10/774 , G06V10/82

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure identify a plurality of candidate concepts in a knowledge graph (KG) that correspond to an image tag of an image; generate an image embedding of the image using a multi-modal encoder; generate a concept embedding for each of the plurality of candidate concepts using the multi-modal encoder; select a matching concept from the plurality of candidate concepts based on the image embedding and the concept embedding; and generate association data between the image and the matching concept.

9.

发明公开
PRIOR GUIDED LATENT DIFFUSION 审中-公开

公开(公告)号：US20240346629A1

公开(公告)日：2024-10-17

申请号：US18301671

申请日：2023-04-17

Applicant: ADOBE INC.

Inventor： Midhun Harikumar , Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Vinh Ngoc Khuc

IPC: G06T5/00 , G06F40/279 , G06T5/50

CPC classification number: G06T5/73 , G06F40/279 , G06T5/50

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure obtain a text prompt for text guided image generation. A multi-modal encoder of an image processing apparatus encodes the text prompt to obtain a text embedding. A diffusion prior model of the image processing apparatus converts the text embedding to an image embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, wherein the image includes an element described by the text prompt.

10.

发明公开
MULTILINGUAL TEXT-TO-IMAGE GENERATION 审中-公开

公开(公告)号：US20240338859A1

公开(公告)日：2024-10-10

申请号：US18296002

申请日：2023-04-05

Applicant: ADOBE INC.

Inventor： Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale

IPC: G06T11/00 , G06F40/58 , G06V10/74 , G06V10/774 , G06V10/82

CPC classification number: G06T11/00 , G06F40/58 , G06V10/74 , G06V10/774 , G06V10/82

Abstract: Systems and methods for image processing are provided. One aspect of the systems and methods includes obtaining a text prompt in a first language. Another aspect of the systems and methods includes encoding the text prompt using a multilingual encoder to obtain a multilingual text embedding. Yet another aspect of the systems and methods includes processing the multilingual text embedding using a diffusion prior model to obtain an image embedding, wherein the diffusion prior model is trained to process multilingual text embeddings from the first language and a second language based on training data from the first language and the second language. Yet another aspect of the systems and methods includes generating an image using a diffusion model based on the image embedding, wherein the image includes an element corresponding to the text prompt.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification