-
公开(公告)号:US20240355018A1
公开(公告)日:2024-10-24
申请号:US18303898
申请日:2023-04-20
申请人: Adobe Inc.
发明人: Pranav Aggarwal , Hareesh Ravi , Midhun Harikumar , Ajinkya Gorakhnath Kale , Fengbin Chen , Venkata Naveen Kumar Yadav Marri
CPC分类号: G06T11/60 , G06T5/50 , G06T5/70 , G06T7/11 , G06T7/50 , G06T13/00 , G06T2200/24 , G06T2207/20021 , G06T2207/20081 , G06T2207/20084 , G06T2207/20092 , G06T2207/20212
摘要: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a diffusion neural network for mask aware image and typography editing. For example, in one or more embodiments the disclosed systems utilize a text-image encoder to generate a base image embedding from a base digital image. Moreover, the disclosed systems generate a mask-segmented image by combining a shape mask with the base digital image. In one or more implementations, the disclosed systems utilize noising steps of a diffusion noising model to generate a mask-segmented image noise map from the mask-segmented image. Furthermore, the disclosed systems utilize a diffusion neural network to create a stylized image corresponding to the shape mask from the base image embedding and the mask-segmented image noise map.
-
公开(公告)号:US20240354895A1
公开(公告)日:2024-10-24
申请号:US18303271
申请日:2023-04-19
申请人: ADOBE INC.
IPC分类号: G06T5/50 , G06T5/00 , G06T11/60 , G06V10/764
CPC分类号: G06T5/50 , G06T5/00 , G06T11/60 , G06V10/764 , G06T2200/24 , G06T2207/20076 , G06T2207/20081 , G06T2207/20084 , G06T2207/20092 , G06T2207/20212
摘要: Systems and methods for image processing are described. Embodiments of the present disclosure include an image generation network configured to encode a plurality of abstract images using a style encoder to obtain a plurality of abstract style encodings, wherein the style encoder is trained to represent image style separately from image content. A clustering component clusters the plurality of abstract style encodings to obtain an abstract style cluster comprising a subset of the plurality of abstract style encodings. A preset component generates an abstract style transfer preset representing the abstract style cluster.
-
公开(公告)号:US12056453B2
公开(公告)日:2024-08-06
申请号:US17658855
申请日:2022-04-12
申请人: ADOBE INC.
IPC分类号: G06F40/20 , G06F40/253 , G06F40/284 , G06F40/35
CPC分类号: G06F40/284 , G06F40/253 , G06F40/35
摘要: Systems and methods for natural language processing are described. Embodiments of the present disclosure receive plain text comprising a sequence of text entities; generate a sequence of entity embeddings based on the plain text, wherein each entity embedding in the sequence of entity embeddings is generated based on a text entity in the sequence of text entities; generate style information for the text entity based on the sequence of entity embeddings; and generate a document based on the style information.
-
公开(公告)号:US20240104131A1
公开(公告)日:2024-03-28
申请号:US17934690
申请日:2022-09-23
申请人: ADOBE INC.
IPC分类号: G06F16/532 , G06F40/284 , G06F40/289
CPC分类号: G06F16/532 , G06F40/284 , G06F40/289
摘要: Systems and methods for query processing are described. Embodiments of the present disclosure identify a target phrase in an original query, wherein the target phrase comprises a phrase to be replaced in the original query; replace the target phrase with a mask token to obtain a modified query; generate an alternative query based on the modified query using a masked language model (MLM), wherein the alternative query includes an alternative phrase in place of the target phrase that is consistent with a context of the target phrase; and retrieve a search result based on the alternative query.
-
公开(公告)号:US20230326178A1
公开(公告)日:2023-10-12
申请号:US17656147
申请日:2022-03-23
申请人: ADOBE INC.
IPC分类号: G06N3/08 , G06V10/74 , G06V10/771 , G06V10/77 , G06V10/82 , G06V10/774
CPC分类号: G06V10/761 , G06N3/088 , G06V10/771 , G06V10/7715 , G06V10/774 , G06V10/82
摘要: Systems and methods for image processing are described. Embodiments of the present disclosure identify a plurality of candidate concepts in a knowledge graph (KG) that correspond to an image tag of an image; generate an image embedding of the image using a multi-modal encoder; generate a concept embedding for each of the plurality of candidate concepts using the multi-modal encoder; select a matching concept from the plurality of candidate concepts based on the image embedding and the concept embedding; and generate association data between the image and the matching concept.
-
公开(公告)号:US20230137774A1
公开(公告)日:2023-05-04
申请号:US17453595
申请日:2021-11-04
申请人: ADOBE INC.
发明人: Baldo Faieta , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Naveen Marri , Saeid Motiian , Tracy Holloway King , Alex Filipkowski , Shabnam Ghadar
IPC分类号: G06F16/583 , G06F16/58 , G06F16/538 , G06F40/295 , G06F16/535 , G06N3/08
摘要: Systems and methods for image retrieval are described. Embodiments of the present disclosure receive a search query from a user; extract an entity and a color phrase describing the entity from the search query; generate an entity color embedding in a color embedding space from the color phrase using a multi-modal color encoder; identify an image in a database based on metadata for the image including an object label corresponding to the extracted entity and an object color embedding in the color embedding space corresponding to the object label; and provide image information for the image to the user based on the metadata.
-
公开(公告)号:US20240037881A1
公开(公告)日:2024-02-01
申请号:US17814940
申请日:2022-07-26
申请人: ADOBE INC.
CPC分类号: G06T19/20 , G06T7/11 , G06T5/005 , G06T5/50 , G06T7/194 , G06T2207/20224 , G06T2207/20021 , G06T2210/62 , G06T2200/24 , G06T2207/20092 , G06T2207/20084 , G06T2219/2024 , G06T2219/2004
摘要: Systems and methods for image processing are described. Embodiments of the present disclosure receive a first image depicting a scene and a second image that includes a style; segment the first image to obtain a first segment and a second segment, wherein the first segment has a shape of an object in the scene; apply a style transfer network to the first segment and the second image to obtain a first image part, wherein the first image part has the shape of the object and the style from the second image; combine the first image part with a second image part corresponding to the second segment to obtain a combined image; and apply a lenticular effect to the combined image to obtain an output image.
-
公开(公告)号:US20240020954A1
公开(公告)日:2024-01-18
申请号:US17812596
申请日:2022-07-14
申请人: ADOBE INC.
IPC分类号: G06V10/774 , G06T5/00 , G06T7/194 , G06V10/771 , G06V10/776 , G06V10/26 , G06V10/75 , G06F16/532
CPC分类号: G06V10/774 , G06T5/005 , G06T7/194 , G06V10/771 , G06V10/776 , G06V10/267 , G06V10/759 , G06F16/532 , G06T2207/20081 , G06V2201/10
摘要: Systems and methods for image processing, and specifically for generating object-agnostic image representations, are described. Embodiments of the present disclosure receive a training image including a foreground object and a background, remove the foreground object from the training image to obtain a modified training image, inpaint a portion of the modified training image corresponding to the foreground object to obtain an inpainted training image, encode the training image and the inpainted training image using a machine learning model to obtain an encoded training image and an encoded inpainted training image, and update parameters of the machine learning model based on the encoded training image and the encoded inpainted training image.
-
公开(公告)号:US20230360294A1
公开(公告)日:2023-11-09
申请号:US17662560
申请日:2022-05-09
申请人: ADOBE INC.
CPC分类号: G06T11/40 , G06N3/0454 , G06N3/088 , G06T7/13 , G06T2207/20081 , G06T2207/20084
摘要: Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.
-
公开(公告)号:US20230153522A1
公开(公告)日:2023-05-18
申请号:US17455533
申请日:2021-11-18
申请人: ADOBE INC.
IPC分类号: G06F40/253 , G06K9/62 , G06F16/583
CPC分类号: G06F40/253 , G06K9/6256 , G06K9/6262 , G06F16/583
摘要: Systems and methods for image captioning are described. One or more aspects of the systems and methods include generating a training caption for a training image using an image captioning network; encoding the training caption using a multi-modal encoder to obtain an encoded training caption; encoding the training image using the multi-modal encoder to obtain an encoded training image; computing a reward function based on the encoded training caption and the encoded training image; and updating parameters of the image captioning network based on the reward function.
-
-
-
-
-
-
-
-
-