-
公开(公告)号:US20220391450A1
公开(公告)日:2022-12-08
申请号:US17887694
申请日:2022-08-15
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Ali Aminian , Ajinkya Gorakhnath Kale , Aashish Kumar Misraa
IPC: G06F16/903 , G06N20/00 , G06F16/908 , G06K9/62
Abstract: Technology is disclosed herein for enhanced similarity search. In an implementation, a search environment includes one or more computing hardware, software, and/or firmware components in support of enhanced similarity search. The one or more components identify a modality for a similarity search with respect to a query object. The components generate an embedding for the query object based on the modality and based on connections between the query object and neighboring nodes in a graph. The embedding for the query object provides the basis for the search for similar objects
-
公开(公告)号:US20200380027A1
公开(公告)日:2020-12-03
申请号:US16426369
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06F16/583 , G06N3/08 , G06N20/00 , G06F16/33 , G06F16/538 , G06F16/532
Abstract: Multi-modal differential search with real-time focus adaptation techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.
-
公开(公告)号:US12182955B2
公开(公告)日:2024-12-31
申请号:US17814940
申请日:2022-07-26
Applicant: ADOBE INC.
Inventor: Pranav Vineet Aggarwal , Alvin Ghouas , Ajinkya Gorakhnath Kale
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure receive a first image depicting a scene and a second image that includes a style; segment the first image to obtain a first segment and a second segment, wherein the first segment has a shape of an object in the scene; apply a style transfer network to the first segment and the second image to obtain a first image part, wherein the first image part has the shape of the object and the style from the second image; combine the first image part with a second image part corresponding to the second segment to obtain a combined image; and apply a lenticular effect to the combined image to obtain an output image.
-
公开(公告)号:US20240420389A1
公开(公告)日:2024-12-19
申请号:US18526855
申请日:2023-12-01
Applicant: ADOBE INC.
Inventor: Vineet Batra , Sumit Chaturvedi , Abhishek Rai , Pranav Vineet Aggarwal , Ajinkya Gorakhnath Kale , Aman Jeph , Ankit Phogat , Sumit Dhingra , Fengbin Chen , Kshitiz Garg , Milos Hasan , Midhun Harikumar , Gaurav Suresh Pathak , Souymodip Chakraborty
IPC: G06T11/20 , G06V10/764 , G06V10/774
Abstract: Systems and methods for generating tile-able patterns from text include obtaining a text prompt and generating, by a generation prior model, a latent vector based on the text prompt, where the generation prior model is trained to output vectors within a distribution of tile-able patterns. An image generation model then generates an output image based on the latent vector. The output image comprises a tile-able pattern including an element from the text prompt.
-
公开(公告)号:US20230137774A1
公开(公告)日:2023-05-04
申请号:US17453595
申请日:2021-11-04
Applicant: ADOBE INC.
Inventor: Baldo Faieta , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Naveen Marri , Saeid Motiian , Tracy Holloway King , Alex Filipkowski , Shabnam Ghadar
IPC: G06F16/583 , G06F16/58 , G06F16/538 , G06F40/295 , G06F16/535 , G06N3/08
Abstract: Systems and methods for image retrieval are described. Embodiments of the present disclosure receive a search query from a user; extract an entity and a color phrase describing the entity from the search query; generate an entity color embedding in a color embedding space from the color phrase using a multi-modal color encoder; identify an image in a database based on metadata for the image including an object label corresponding to the extracted entity and an object color embedding in the color embedding space corresponding to the object label; and provide image information for the image to the user based on the metadata.
-
公开(公告)号:US20210365727A1
公开(公告)日:2021-11-25
申请号:US17398317
申请日:2021-08-10
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06K9/62 , G06F16/535 , G06N20/00 , G06K9/72
Abstract: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
公开(公告)号:US11144784B2
公开(公告)日:2021-10-12
申请号:US16426264
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06K9/62 , G06K9/72 , G06F16/535 , G06N20/00 , G06F3/0482
Abstract: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
公开(公告)号:US20200380298A1
公开(公告)日:2020-12-03
申请号:US16426264
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06K9/62 , G06K9/72 , G06F16/535 , G06N20/00
Abstract: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
公开(公告)号:US12260480B2
公开(公告)日:2025-03-25
申请号:US18178791
申请日:2023-03-06
Applicant: Adobe Inc.
Inventor: Sukriti Verma , Venkata naveen kumar Yadav Marri , Ritiz Tambi , Pranav Vineet Aggarwal , Peter O'Donovan , Midhun Harikumar , Ajinkya Kale
IPC: G06T11/60 , G06F3/0482
Abstract: Embodiments are disclosed for machine learning-based generation of recommended layouts. The method includes receiving a set of design elements for performing generative layout recommendation. A number of each type of design element from the set of design elements is determined. A set of recommended layouts are generated using a trained generative layout model and the number and type of design elements. The set of recommended layouts are output.
-
公开(公告)号:US20240404144A1
公开(公告)日:2024-12-05
申请号:US18329111
申请日:2023-06-05
Applicant: ADOBE INC.
Inventor: Pranav Vineet Aggarwal , Venkata Naveen Kumar Yadav Marri , Midhun Harikumar , Sachin Madhav Kelkar , Hareesh Ravi , Ajinkya Gorakhnath Kale
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure, via a multi-modal encoder of an image processing apparatus, encodes a text prompt to obtain a text embedding. A color encoder of the image processing apparatus encodes a color prompt to obtain a color embedding. A diffusion prior model of the image processing apparatus generates an image embedding based on the text embedding and the color embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, where the image includes an element from the text prompt and a color from the color prompt.
-
-
-
-
-
-
-
-
-