-
公开(公告)号:US20220343561A1
公开(公告)日:2022-10-27
申请号:US17240030
申请日:2021-04-26
Applicant: ADOBE INC.
Inventor: Pranav Aggarwal , Ajinkya Kale
Abstract: Systems and methods for color replacement are described. Embodiments of the disclosure include a color replacement system that adjusts an image based on a user-input source color and target color. For example, the source color may be replaced with the target color throughout the entire image. In some embodiments, a user provides a speech or text input that identifies a source color to be replaced. The user may then provide a speech or text input identifying the target color, replacing the source color. A color replacement system creates and embedding of the source color, segments the image based on the source color embedding, and then replaces the color of segmented portion of the image with the target color.
-
公开(公告)号:US20240355018A1
公开(公告)日:2024-10-24
申请号:US18303898
申请日:2023-04-20
Applicant: Adobe Inc.
Inventor: Pranav Aggarwal , Hareesh Ravi , Midhun Harikumar , Ajinkya Gorakhnath Kale , Fengbin Chen , Venkata Naveen Kumar Yadav Marri
CPC classification number: G06T11/60 , G06T5/50 , G06T5/70 , G06T7/11 , G06T7/50 , G06T13/00 , G06T2200/24 , G06T2207/20021 , G06T2207/20081 , G06T2207/20084 , G06T2207/20092 , G06T2207/20212
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a diffusion neural network for mask aware image and typography editing. For example, in one or more embodiments the disclosed systems utilize a text-image encoder to generate a base image embedding from a base digital image. Moreover, the disclosed systems generate a mask-segmented image by combining a shape mask with the base digital image. In one or more implementations, the disclosed systems utilize noising steps of a diffusion noising model to generate a mask-segmented image noise map from the mask-segmented image. Furthermore, the disclosed systems utilize a diffusion neural network to create a stylized image corresponding to the shape mask from the base image embedding and the mask-segmented image noise map.
-
公开(公告)号:US11756239B2
公开(公告)日:2023-09-12
申请号:US17240030
申请日:2021-04-26
Applicant: ADOBE INC.
Inventor: Pranav Aggarwal , Ajinkya Kale
CPC classification number: G06T11/001 , G06T7/11 , G06T7/90
Abstract: Systems and methods for color replacement are described. Embodiments of the disclosure include a color replacement system that adjusts an image based on a user-input source color and target color. For example, the source color may be replaced with the target color throughout the entire image. In some embodiments, a user provides a speech or text input that identifies a source color to be replaced. The user may then provide a speech or text input identifying the target color, replacing the source color. A color replacement system creates and embedding of the source color, segments the image based on the source color embedding, and then replaces the color of segmented portion of the image with the target color.
-
公开(公告)号:US11645478B2
公开(公告)日:2023-05-09
申请号:US17088847
申请日:2020-11-04
Applicant: Adobe Inc.
Inventor: Ritiz Tambi , Pranav Aggarwal , Ajinkya Kale
IPC: G06F40/58 , G06F40/117
CPC classification number: G06F40/58 , G06F40/117
Abstract: Introduced here is an approach to translating tags assigned to digital images. As an example, embeddings may be extracted from a tag to be translated and the digital image with which the tag is associated by a multimodal model. These embeddings can be compared to embeddings extracted from a set of target tags associated with a target language by the multimodal model. Such an approach allows similarity to be established along two dimensions, which ensures the obstacles associated with direct translation can be avoided.
-
公开(公告)号:US11615567B2
公开(公告)日:2023-03-28
申请号:US16952008
申请日:2020-11-18
Applicant: Adobe Inc.
Inventor: Midhun Harikumar , Pranav Aggarwal , Baldo Faieta , Ajinkya Kale , Zhe Lin
Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, by a model that includes trainable components, a learned image representation of a target image. The operations further include generating, by a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include generating a class activation map of the target image by, at least, convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image using the class activation map of the target image.
-
-
-
-