-
公开(公告)号:US11604822B2
公开(公告)日:2023-03-14
申请号:US16426369
申请日:2019-05-30
申请人: Adobe Inc.
IPC分类号: G06F7/00 , G06F16/583 , G06N3/084 , G06N20/00 , G06F16/538 , G06F16/532 , G06F16/33 , G06F3/04855
摘要: Multi-modal differential search with real-time focus adaptation techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.
-
公开(公告)号:US20240037881A1
公开(公告)日:2024-02-01
申请号:US17814940
申请日:2022-07-26
申请人: ADOBE INC.
CPC分类号: G06T19/20 , G06T7/11 , G06T5/005 , G06T5/50 , G06T7/194 , G06T2207/20224 , G06T2207/20021 , G06T2210/62 , G06T2200/24 , G06T2207/20092 , G06T2207/20084 , G06T2219/2024 , G06T2219/2004
摘要: Systems and methods for image processing are described. Embodiments of the present disclosure receive a first image depicting a scene and a second image that includes a style; segment the first image to obtain a first segment and a second segment, wherein the first segment has a shape of an object in the scene; apply a style transfer network to the first segment and the second image to obtain a first image part, wherein the first image part has the shape of the object and the style from the second image; combine the first image part with a second image part corresponding to the second segment to obtain a combined image; and apply a lenticular effect to the combined image to obtain an output image.
-
公开(公告)号:US20230360294A1
公开(公告)日:2023-11-09
申请号:US17662560
申请日:2022-05-09
申请人: ADOBE INC.
CPC分类号: G06T11/40 , G06N3/0454 , G06N3/088 , G06T7/13 , G06T2207/20081 , G06T2207/20084
摘要: Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.
-
公开(公告)号:US20220391450A1
公开(公告)日:2022-12-08
申请号:US17887694
申请日:2022-08-15
申请人: Adobe Inc.
IPC分类号: G06F16/903 , G06N20/00 , G06F16/908 , G06K9/62
摘要: Technology is disclosed herein for enhanced similarity search. In an implementation, a search environment includes one or more computing hardware, software, and/or firmware components in support of enhanced similarity search. The one or more components identify a modality for a similarity search with respect to a query object. The components generate an embedding for the query object based on the modality and based on connections between the query object and neighboring nodes in a graph. The embedding for the query object provides the basis for the search for similar objects
-
公开(公告)号:US20200380027A1
公开(公告)日:2020-12-03
申请号:US16426369
申请日:2019-05-30
申请人: Adobe Inc.
IPC分类号: G06F16/583 , G06N3/08 , G06N20/00 , G06F16/33 , G06F16/538 , G06F16/532
摘要: Multi-modal differential search with real-time focus adaptation techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.
-
公开(公告)号:US20230137774A1
公开(公告)日:2023-05-04
申请号:US17453595
申请日:2021-11-04
申请人: ADOBE INC.
发明人: Baldo Faieta , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Naveen Marri , Saeid Motiian , Tracy Holloway King , Alex Filipkowski , Shabnam Ghadar
IPC分类号: G06F16/583 , G06F16/58 , G06F16/538 , G06F40/295 , G06F16/535 , G06N3/08
摘要: Systems and methods for image retrieval are described. Embodiments of the present disclosure receive a search query from a user; extract an entity and a color phrase describing the entity from the search query; generate an entity color embedding in a color embedding space from the color phrase using a multi-modal color encoder; identify an image in a database based on metadata for the image including an object label corresponding to the extracted entity and an object color embedding in the color embedding space corresponding to the object label; and provide image information for the image to the user based on the metadata.
-
公开(公告)号:US20210365727A1
公开(公告)日:2021-11-25
申请号:US17398317
申请日:2021-08-10
申请人: Adobe Inc.
IPC分类号: G06K9/62 , G06F16/535 , G06N20/00 , G06K9/72
摘要: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
公开(公告)号:US11144784B2
公开(公告)日:2021-10-12
申请号:US16426264
申请日:2019-05-30
申请人: Adobe Inc.
IPC分类号: G06K9/62 , G06K9/72 , G06F16/535 , G06N20/00 , G06F3/0482
摘要: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
公开(公告)号:US20200380298A1
公开(公告)日:2020-12-03
申请号:US16426264
申请日:2019-05-30
申请人: Adobe Inc.
IPC分类号: G06K9/62 , G06K9/72 , G06F16/535 , G06N20/00
摘要: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
公开(公告)号:US20240346629A1
公开(公告)日:2024-10-17
申请号:US18301671
申请日:2023-04-17
申请人: ADOBE INC.
发明人: Midhun Harikumar , Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Vinh Ngoc Khuc
IPC分类号: G06T5/00 , G06F40/279 , G06T5/50
CPC分类号: G06T5/73 , G06F40/279 , G06T5/50
摘要: Systems and methods for image processing are described. Embodiments of the present disclosure obtain a text prompt for text guided image generation. A multi-modal encoder of an image processing apparatus encodes the text prompt to obtain a text embedding. A diffusion prior model of the image processing apparatus converts the text embedding to an image embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, wherein the image includes an element described by the text prompt.
-
-
-
-
-
-
-
-
-