专利检索 ap:("ADOBE INC.") AND inv:"Ajinkya Gorakhnath Kale" 第 1 页

1.

发明公开
UTILIZING A DIFFUSION NEURAL NETWORK FOR MASK AWARE IMAGE AND TYPOGRAPHY EDITING 审中-公开

公开(公告)号：US20240355018A1

公开(公告)日：2024-10-24

申请号：US18303898

申请日：2023-04-20

申请人： Adobe Inc.

发明人： Pranav Aggarwal , Hareesh Ravi , Midhun Harikumar , Ajinkya Gorakhnath Kale , Fengbin Chen , Venkata Naveen Kumar Yadav Marri

IPC分类号： G06T11/60 , G06T5/00 , G06T5/50 , G06T7/11 , G06T7/50 , G06T13/00

CPC分类号： G06T11/60 , G06T5/50 , G06T5/70 , G06T7/11 , G06T7/50 , G06T13/00 , G06T2200/24 , G06T2207/20021 , G06T2207/20081 , G06T2207/20084 , G06T2207/20092 , G06T2207/20212

摘要： The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a diffusion neural network for mask aware image and typography editing. For example, in one or more embodiments the disclosed systems utilize a text-image encoder to generate a base image embedding from a base digital image. Moreover, the disclosed systems generate a mask-segmented image by combining a shape mask with the base digital image. In one or more implementations, the disclosed systems utilize noising steps of a diffusion noising model to generate a mask-segmented image noise map from the mask-segmented image. Furthermore, the disclosed systems utilize a diffusion neural network to create a stylized image corresponding to the shape mask from the base image embedding and the mask-segmented image noise map.

2.

发明公开
PRESET STYLE TRANSFER 审中-公开

公开(公告)号：US20240354895A1

公开(公告)日：2024-10-24

申请号：US18303271

申请日：2023-04-19

申请人： ADOBE INC.

发明人： Hareesh Ravi , Midhun Harikumar , Taesung Park , Ajinkya Gorakhnath Kale

IPC分类号： G06T5/50 , G06T5/00 , G06T11/60 , G06V10/764

CPC分类号： G06T5/50 , G06T5/00 , G06T11/60 , G06V10/764 , G06T2200/24 , G06T2207/20076 , G06T2207/20081 , G06T2207/20084 , G06T2207/20092 , G06T2207/20212

摘要： Systems and methods for image processing are described. Embodiments of the present disclosure include an image generation network configured to encode a plurality of abstract images using a style encoder to obtain a plurality of abstract style encodings, wherein the style encoder is trained to represent image style separately from image content. A clustering component clusters the plurality of abstract style encodings to obtain an abstract style cluster comprising a subset of the plurality of abstract style encodings. A preset component generates an abstract style transfer preset representing the abstract style cluster.

3.

发明授权
Semantic structure identification for document autostyling 有权

公开(公告)号：US12056453B2

公开(公告)日：2024-08-06

申请号：US17658855

申请日：2022-04-12

申请人： ADOBE INC.

发明人： Ritiz Tambi , Rishav Agarwal , Rishabh Purwar , Ajinkya Gorakhnath Kale , Sanyam Jain

IPC分类号： G06F40/20 , G06F40/253 , G06F40/284 , G06F40/35

CPC分类号： G06F40/284 , G06F40/253 , G06F40/35

摘要： Systems and methods for natural language processing are described. Embodiments of the present disclosure receive plain text comprising a sequence of text entities; generate a sequence of entity embeddings based on the plain text, wherein each entity embedding in the sequence of entity embeddings is generated based on a text entity in the sequence of text entities; generate style information for the text entity based on the sequence of entity embeddings; and generate a document based on the style information.

4.

发明公开
ZERO-SHOT ENTITY-AWARE NEAREST NEIGHBORS RETRIEVAL 审中-公开

公开(公告)号：US20240104131A1

公开(公告)日：2024-03-28

申请号：US17934690

申请日：2022-09-23

申请人： ADOBE INC.

发明人： Ritiz Tambi , Ajinkya Gorakhnath Kale

IPC分类号： G06F16/532 , G06F40/284 , G06F40/289

CPC分类号： G06F16/532 , G06F40/284 , G06F40/289

摘要： Systems and methods for query processing are described. Embodiments of the present disclosure identify a target phrase in an original query, wherein the target phrase comprises a phrase to be replaced in the original query; replace the target phrase with a mask token to obtain a modified query; generate an alternative query based on the modified query using a masked language model (MLM), wherein the alternative query includes an alternative phrase in place of the target phrase that is consistent with a context of the target phrase; and retrieve a search result based on the alternative query.

5.

发明公开
CONCEPT DISAMBIGUATION USING MULTIMODAL EMBEDDINGS 审中-公开

公开(公告)号：US20230326178A1

公开(公告)日：2023-10-12

申请号：US17656147

申请日：2022-03-23

申请人： ADOBE INC.

发明人： Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale

IPC分类号： G06N3/08 , G06V10/74 , G06V10/771 , G06V10/77 , G06V10/82 , G06V10/774

CPC分类号： G06V10/761 , G06N3/088 , G06V10/771 , G06V10/7715 , G06V10/774 , G06V10/82

摘要： Systems and methods for image processing are described. Embodiments of the present disclosure identify a plurality of candidate concepts in a knowledge graph (KG) that correspond to an image tag of an image; generate an image embedding of the image using a multi-modal encoder; generate a concept embedding for each of the plurality of candidate concepts using the multi-modal encoder; select a matching concept from the plurality of candidate concepts based on the image embedding and the concept embedding; and generate association data between the image and the matching concept.

6.

发明申请
EMBEDDING-BASED COLOR-OBJECT RETRIEVAL 有权

公开(公告)号：US20230137774A1

公开(公告)日：2023-05-04

申请号：US17453595

申请日：2021-11-04

申请人： ADOBE INC.

发明人： Baldo Faieta , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Naveen Marri , Saeid Motiian , Tracy Holloway King , Alex Filipkowski , Shabnam Ghadar

IPC分类号： G06F16/583 , G06F16/58 , G06F16/538 , G06F40/295 , G06F16/535 , G06N3/08

摘要： Systems and methods for image retrieval are described. Embodiments of the present disclosure receive a search query from a user; extract an entity and a color phrase describing the entity from the search query; generate an entity color embedding in a color embedding space from the color phrase using a multi-modal color encoder; identify an image in a database based on metadata for the image including an object label corresponding to the extracted entity and an object color embedding in the color embedding space corresponding to the object label; and provide image information for the image to the user based on the metadata.

7.

发明公开
STYLIZED MOTION EFFECTS 审中-公开

公开(公告)号：US20240037881A1

公开(公告)日：2024-02-01

申请号：US17814940

申请日：2022-07-26

申请人： ADOBE INC.

发明人： Pranav Vineet Aggarwal , Alvin Ghouas , Ajinkya Gorakhnath Kale

IPC分类号： G06T19/20 , G06T7/11 , G06T5/00 , G06T5/50 , G06T7/194

CPC分类号： G06T19/20 , G06T7/11 , G06T5/005 , G06T5/50 , G06T7/194 , G06T2207/20224 , G06T2207/20021 , G06T2210/62 , G06T2200/24 , G06T2207/20092 , G06T2207/20084 , G06T2219/2024 , G06T2219/2004

摘要： Systems and methods for image processing are described. Embodiments of the present disclosure receive a first image depicting a scene and a second image that includes a style; segment the first image to obtain a first segment and a second segment, wherein the first segment has a shape of an object in the scene; apply a style transfer network to the first segment and the second image to obtain a first image part, wherein the first image part has the shape of the object and the style from the second image; combine the first image part with a second image part corresponding to the second segment to obtain a combined image; and apply a lenticular effect to the combined image to obtain an output image.

8.

发明公开
OBJECT-AGNOSTIC IMAGE REPRESENTATION 审中-公开

公开(公告)号：US20240020954A1

公开(公告)日：2024-01-18

申请号：US17812596

申请日：2022-07-14

申请人： ADOBE INC.

发明人： Sachin Kelkar , Ajinkya Gorakhnath Kale , Midhun Harikumar

IPC分类号： G06V10/774 , G06T5/00 , G06T7/194 , G06V10/771 , G06V10/776 , G06V10/26 , G06V10/75 , G06F16/532

CPC分类号： G06V10/774 , G06T5/005 , G06T7/194 , G06V10/771 , G06V10/776 , G06V10/267 , G06V10/759 , G06F16/532 , G06T2207/20081 , G06V2201/10

摘要： Systems and methods for image processing, and specifically for generating object-agnostic image representations, are described. Embodiments of the present disclosure receive a training image including a foreground object and a background, remove the foreground object from the training image to obtain a modified training image, inpaint a portion of the modified training image corresponding to the foreground object to obtain an inpainted training image, encode the training image and the inpainted training image using a machine learning model to obtain an encoded training image and an encoded inpainted training image, and update parameters of the machine learning model based on the encoded training image and the encoded inpainted training image.

9.

发明公开
UNSUPERVISED STYLE AND COLOR CUES FOR TRANSFORMER-BASED IMAGE GENERATION 审中-公开

公开(公告)号：US20230360294A1

公开(公告)日：2023-11-09

申请号：US17662560

申请日：2022-05-09

申请人： ADOBE INC.

发明人： Pranav Vineet Aggarwal , Midhun Harikumar , Ajinkya Gorakhnath Kale

IPC分类号： G06T11/40 , G06N3/04 , G06N3/08 , G06T7/13

CPC分类号： G06T11/40 , G06N3/0454 , G06N3/088 , G06T7/13 , G06T2207/20081 , G06T2207/20084

摘要： Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.

10.

发明公开
IMAGE CAPTIONING 审中-公开

公开(公告)号：US20230153522A1

公开(公告)日：2023-05-18

申请号：US17455533

申请日：2021-11-18

申请人： ADOBE INC.

发明人： Jaemin Cho , Seunghyun Yoon , Ajinkya Gorakhnath Kale , Trung Huu Bui , Franck Dernoncourt

IPC分类号： G06F40/253 , G06K9/62 , G06F16/583

CPC分类号： G06F40/253 , G06K9/6256 , G06K9/6262 , G06F16/583

摘要： Systems and methods for image captioning are described. One or more aspects of the systems and methods include generating a training caption for a training image using an image captioning network; encoding the training caption using a multi-modal encoder to obtain an encoded training caption; encoding the training image using the multi-modal encoder to obtain an encoded training image; computing a reward function based on the encoded training caption and the encoded training image; and updating parameters of the image captioning network based on the reward function.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类