Patent search ap:("Adobe Inc.") AND inv:"Pranav Vineet Aggarwal" Page 1

1.

发明授权
Multi-modal differential search with real-time focus adaptation 有权

公开(公告)号：US11604822B2

公开(公告)日：2023-03-14

申请号：US16426369

申请日：2019-05-30

Applicant: Adobe Inc.

Inventor： Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian

IPC: G06F7/00 , G06F16/583 , G06N3/084 , G06N20/00 , G06F16/538 , G06F16/532 , G06F16/33 , G06F3/04855

Abstract: Multi-modal differential search with real-time focus adaptation techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.

2.

发明授权
Unified framework for multi-modal similarity search 有权

公开(公告)号：US12299055B2

公开(公告)日：2025-05-13

申请号：US17887694

申请日：2022-08-15

Applicant: Adobe Inc.

Inventor： Pranav Vineet Aggarwal , Ali Aminian , Ajinkya Gorakhnath Kale , Aashish Kumar Misraa

IPC: G06F7/00 , G06F16/903 , G06F16/908 , G06F16/953 , G06F18/2413 , G06N20/00 , G06V10/426 , G06V10/764

Abstract: Technology is disclosed herein for enhanced similarity search. In an implementation, a search environment includes one or more computing hardware, software, and/or firmware components in support of enhanced similarity search. The one or more components identify a modality for a similarity search with respect to a query object. The components generate an embedding for the query object based on the modality and based on connections between the query object and neighboring nodes in a graph. The embedding for the query object provides the basis for the search for similar objects.

3.

发明公开
PRIOR GUIDED LATENT DIFFUSION 审中-公开

公开(公告)号：US20240346629A1

公开(公告)日：2024-10-17

申请号：US18301671

申请日：2023-04-17

Applicant: ADOBE INC.

Inventor： Midhun Harikumar , Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Vinh Ngoc Khuc

IPC: G06T5/00 , G06F40/279 , G06T5/50

CPC classification number: G06T5/73 , G06F40/279 , G06T5/50

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure obtain a text prompt for text guided image generation. A multi-modal encoder of an image processing apparatus encodes the text prompt to obtain a text embedding. A diffusion prior model of the image processing apparatus converts the text embedding to an image embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, wherein the image includes an element described by the text prompt.

4.

发明授权
Visually guided machine-learning language model 有权

公开(公告)号：US11605019B2

公开(公告)日：2023-03-14

申请号：US16426298

申请日：2019-05-30

Applicant: Adobe Inc.

Inventor： Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian

IPC: G06K9/00 , G06N20/00 , G06K9/62 , G06N3/08 , G06F16/538 , G10L15/22 , G06F40/30

Abstract: Visually guided machine-learning language model and embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.

5.

发明申请
UNIFIED FRAMEWORK FOR MULTI-MODAL SIMILARITY SEARCH 有权

公开(公告)号：US20210326393A1

公开(公告)日：2021-10-21

申请号：US16854697

申请日：2020-04-21

Applicant: Adobe Inc.

Inventor： Pranav Vineet Aggarwal , Ali Aminian , Ajinkya Gorakhnath Kale , Aashish Kumar Misraa

IPC: G06F16/903 , G06N20/00 , G06K9/62 , G06F16/908

Abstract: Technology is disclosed herein for enhanced similarity search. In an implementation, a search environment includes one or more computing hardware, software, and/or firmware components in support of enhanced similarity search. The one or more components identify a modality for a similarity search with respect to a query object. The components generate an embedding for the query object based on the modality and based on connections between the query object and neighboring nodes in a graph. The embedding for the query object provides the basis for the search for similar objects

6.

发明申请
Visually Guided Machine-learning Language Model 审中-公开

公开(公告)号：US20200380403A1

公开(公告)日：2020-12-03

申请号：US16426298

申请日：2019-05-30

Applicant: Adobe Inc.

Inventor： Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian

IPC: G06N20/00 , G06K9/62 , G06F16/538 , G06N3/08

Abstract: Visually guided machine-learning language model and embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.

7.

发明授权
Unsupervised style and color cues for transformer-based image generation 有权

公开(公告)号：US12277630B2

公开(公告)日：2025-04-15

申请号：US17662560

申请日：2022-05-09

Applicant: ADOBE INC.

Inventor： Pranav Vineet Aggarwal , Midhun Harikumar , Ajinkya Gorakhnath Kale

IPC: G06T11/40 , G06N3/045 , G06N3/088 , G06T7/13

Abstract: Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.

8.

发明申请
UPSIDE-DOWN REINFORCEMENT LEARNING FOR IMAGE GENERATION MODELS 有权

公开(公告)号：US20250117967A1

公开(公告)日：2025-04-10

申请号：US18443590

申请日：2024-02-16

Applicant: ADOBE INC.

Inventor： Ritiz Tambi , Purvak Lapsiya , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Nikolaos Vlassis

IPC: G06T11/00 , G06F40/166 , G06V10/764

Abstract: A method, apparatus, non-transitory computer readable media, and system for image generation include obtaining an input text prompt and an indication of a level of a target characteristic, where the target characteristic comprises a characteristic used to train an image generation model. Some embodiments generate an augmented text prompt comprising the input text and an objective text corresponding to the level of the target characteristic. Some embodiments generate, using the image generation model, an image based on the augmented text prompt, where the image depicts content of the input text prompt and has the level of the target characteristic.

9.

发明公开
STYLIZED MOTION EFFECTS 审中-公开

公开(公告)号：US20240037881A1

公开(公告)日：2024-02-01

申请号：US17814940

申请日：2022-07-26

Applicant: ADOBE INC.

Inventor： Pranav Vineet Aggarwal , Alvin Ghouas , Ajinkya Gorakhnath Kale

IPC: G06T19/20 , G06T7/11 , G06T5/00 , G06T5/50 , G06T7/194

CPC classification number: G06T19/20 , G06T7/11 , G06T5/005 , G06T5/50 , G06T7/194 , G06T2207/20224 , G06T2207/20021 , G06T2210/62 , G06T2200/24 , G06T2207/20092 , G06T2207/20084 , G06T2219/2024 , G06T2219/2004

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure receive a first image depicting a scene and a second image that includes a style; segment the first image to obtain a first segment and a second segment, wherein the first segment has a shape of an object in the scene; apply a style transfer network to the first segment and the second image to obtain a first image part, wherein the first image part has the shape of the object and the style from the second image; combine the first image part with a second image part corresponding to the second segment to obtain a combined image; and apply a lenticular effect to the combined image to obtain an output image.

10.

发明公开
UNSUPERVISED STYLE AND COLOR CUES FOR TRANSFORMER-BASED IMAGE GENERATION 审中-公开

公开(公告)号：US20230360294A1

公开(公告)日：2023-11-09

申请号：US17662560

申请日：2022-05-09

Applicant: ADOBE INC.

Inventor： Pranav Vineet Aggarwal , Midhun Harikumar , Ajinkya Gorakhnath Kale

IPC: G06T11/40 , G06N3/04 , G06N3/08 , G06T7/13

CPC classification number: G06T11/40 , G06N3/0454 , G06N3/088 , G06T7/13 , G06T2207/20081 , G06T2207/20084

Abstract: Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification