-
公开(公告)号:US20220391633A1
公开(公告)日:2022-12-08
申请号:US17337194
申请日:2021-06-02
Applicant: Adobe Inc.
Inventor: Midhun Harikumar , Zhe Lin , Shabnam Ghadar , Baldo Faieta
Abstract: Methods, systems, and non-transitory computer readable media are disclosed for accurately and efficiently generating groups of images portraying semantically similar objects for utilization in building machine learning models. In particular, the disclosed system utilizes metadata and spatial statistics to extract semantically similar objects from a repository of digital images. In some embodiments, the disclosed system generates color embeddings and content embeddings for the identified objects. The disclosed system can further group similar objects together within a query space by utilizing a clustering algorithm to create object clusters and then refining and combining the object clusters within the query space. In some embodiments, the disclosed system utilizes one or more of the object clusters to build a machine learning model.
-
公开(公告)号:US11138257B2
公开(公告)日:2021-10-05
申请号:US16745143
申请日:2020-01-16
Applicant: Adobe Inc.
Inventor: Midhun Harikumar , Zhe Lin , Pramod Srinivasan , Jianming Zhang , Daniel David Miranda , Baldo Antonio Faieta
IPC: G06F16/532 , G06F3/0484 , G06T7/11 , G06F16/538 , G06F16/587 , G06T7/70
Abstract: Object search techniques for digital images are described. In the techniques described herein, semantic features are extracted on a per-object basis form a digital image. This supports location of objects within digital images and is not limited to semantic features of an entirety of the digital image as involved in conventional image similarity search techniques. This may be combined with indications a location of the object globally with respect to the digital image through use of a global segmentation mask, use of a local segmentation mask to capture post and characteristics of the object itself, and so on.
-
公开(公告)号:US20210224312A1
公开(公告)日:2021-07-22
申请号:US16745143
申请日:2020-01-16
Applicant: Adobe Inc.
Inventor: Midhun Harikumar , Zhe Lin , Pramod Srinivasan , Jianming Zhang , Daniel David Miranda , Baldo Antonio Faieta
IPC: G06F16/532 , G06F3/0484 , G06T7/11 , G06T7/70 , G06F16/538 , G06F16/587
Abstract: Object search techniques for digital images are described. In the techniques described herein, semantic features are extracted on a per-object basis form a digital image. This supports location of objects within digital images and is not limited to semantic features of an entirety of the digital image as involved in conventional image similarity search techniques. This may be combined with indications a location of the object globally with respect to the digital image through use of a global segmentation mask, use of a local segmentation mask to capture post and characteristics of the object itself, and so on.
-
公开(公告)号:US20240362842A1
公开(公告)日:2024-10-31
申请号:US18308017
申请日:2023-04-27
Applicant: Adobe Inc.
Inventor: Hareesh Ravi , Sachin Kelkar , Midhun Harikumar , Ajinkya Gorakhnath Kale
CPC classification number: G06T11/60 , G06T5/70 , G06T2200/24 , G06T2207/20084
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a diffusion prior neural network for text guided digital image editing. For example, in one or more embodiments the disclosed systems utilize a text-image encoder to generate a base image embedding from the base digital image and an edit text embedding from edit text. Moreover, the disclosed systems utilize a diffusion prior neural network to generate a text-image embedding. In particular, the disclosed systems inject the base image embedding at a conceptual editing step of the diffusion prior neural network and condition a set of steps of the diffusion prior neural network after the conceptual editing step utilizing the edit text embedding. Furthermore, the disclosed systems utilize a diffusion neural network to create a modified digital image from the text-edited image embedding and the base image embedding.
-
公开(公告)号:US20240346629A1
公开(公告)日:2024-10-17
申请号:US18301671
申请日:2023-04-17
Applicant: ADOBE INC.
Inventor: Midhun Harikumar , Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Vinh Ngoc Khuc
IPC: G06T5/00 , G06F40/279 , G06T5/50
CPC classification number: G06T5/73 , G06F40/279 , G06T5/50
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure obtain a text prompt for text guided image generation. A multi-modal encoder of an image processing apparatus encodes the text prompt to obtain a text embedding. A diffusion prior model of the image processing apparatus converts the text embedding to an image embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, wherein the image includes an element described by the text prompt.
-
公开(公告)号:US20240320872A1
公开(公告)日:2024-09-26
申请号:US18426763
申请日:2024-01-30
Applicant: ADOBE INC.
Inventor: Tobias Hinz , Venkata Naveen Kumar Yadav Marri , Midhun Harikumar , Ajinkya Gorakhnath Kale , Zhe Lin , Oliver Wang , Jingwan Lu
IPC: G06T11/00 , G06F40/284 , G06F40/40
CPC classification number: G06T11/00 , G06F40/284 , G06F40/40 , G06T2207/20081 , G06T2207/20084
Abstract: A method, apparatus, non-transitory computer readable medium, and system for image generation include obtaining a text embedding of a text prompt and an image embedding of an image prompt. Some embodiments map the text embedding into a joint embedding space to obtain a joint text embedding and map the image embedding into the joint embedding space to obtain a joint image embedding. Some embodiments generate a synthetic image based on the joint text embedding and the joint image embedding.
-
公开(公告)号:US11934448B2
公开(公告)日:2024-03-19
申请号:US18302201
申请日:2023-04-18
Applicant: Adobe Inc.
Inventor: Pramod Srinivasan , Zhe Lin , Samarth Gulati , Saeid Motiian , Midhun Harikumar , Baldo Antonio Faieta , Alex C. Filipkowski
IPC: G06F16/532 , G06F16/51 , G06F16/538 , G06F16/54 , G06F16/583 , G06F40/30
CPC classification number: G06F16/532 , G06F16/51 , G06F16/538 , G06F16/54 , G06F16/583 , G06F40/30
Abstract: Keyword localization digital image search techniques are described. These techniques support an ability to indicate “where” a corresponding keyword is to be expressed with respect to a layout in a respective digital image resulting from a search query. The search query may also include an indication of a size of the keyword as expressed in the digital image, a number of instances of the keyword, and so forth. Additionally, the techniques and systems as described herein support real time search through use of keyword signatures.
-
公开(公告)号:US20230419551A1
公开(公告)日:2023-12-28
申请号:US17808261
申请日:2022-06-22
Applicant: Adobe Inc.
Inventor: Midhun Harikumar , Pranav Aggarwal , Ajinkya Gorakhnath Kale
Abstract: Techniques for generating a novel image using tokenized image representations are disclosed. In some embodiments, a method of generating the novel image includes generating, via a first machine learning model, a first sequence of coded representations of a first image having one or more features; generating, via a second machine learning model, a second sequence of coded representations of a sketch image having one or more edge features associated with the one or more features; predicting, via a third machine learning model, one or more subsequent coded representations based on the first sequence of coded representations and the second sequence of coded representations; and based on the subsequent coded representations, generating, via the third machine learning model, a first portion of a reconstructed image having one or more image attributes of the first image, and a second portion of the reconstructed image associated with the one or more edge features.
-
19.
公开(公告)号:US11574392B2
公开(公告)日:2023-02-07
申请号:US16803332
申请日:2020-02-27
Applicant: Adobe Inc.
Inventor: Zhe Lin , Vipul Dalal , Vera Lychagina , Shabnam Ghadar , Saeid Motiian , Rohith mohan Dodle , Prethebha Chandrasegaran , Mina Doroudi , Midhun Harikumar , Kannan Iyer , Jayant Kumar , Gaurav Kukal , Daniel Miranda , Charles R McKinney , Archit Kalra
Abstract: The present disclosure relates to an image merging system that automatically and seamlessly detects and merges missing people for a set of digital images into a composite group photo. For instance, the image merging system utilizes a number of models and operations to automatically analyze multiple digital images to identify a missing person from a base image, segment the missing person from the second image, and generate a composite group photo by merging the segmented image of the missing person into the base image. In this manner, the image merging system automatically creates merged group photos that appear natural and realistic.
-
公开(公告)号:US12260480B2
公开(公告)日:2025-03-25
申请号:US18178791
申请日:2023-03-06
Applicant: Adobe Inc.
Inventor: Sukriti Verma , Venkata naveen kumar Yadav Marri , Ritiz Tambi , Pranav Vineet Aggarwal , Peter O'Donovan , Midhun Harikumar , Ajinkya Kale
IPC: G06T11/60 , G06F3/0482
Abstract: Embodiments are disclosed for machine learning-based generation of recommended layouts. The method includes receiving a set of design elements for performing generative layout recommendation. A number of each type of design element from the set of design elements is determined. A set of recommended layouts are generated using a trained generative layout model and the number and type of design elements. The set of recommended layouts are output.
-
-
-
-
-
-
-
-
-