MULTI-MODAL IMAGE COLOR SEGMENTER AND EDITOR

    公开(公告)号:US20220343561A1

    公开(公告)日:2022-10-27

    申请号:US17240030

    申请日:2021-04-26

    Applicant: ADOBE INC.

    Abstract: Systems and methods for color replacement are described. Embodiments of the disclosure include a color replacement system that adjusts an image based on a user-input source color and target color. For example, the source color may be replaced with the target color throughout the entire image. In some embodiments, a user provides a speech or text input that identifies a source color to be replaced. The user may then provide a speech or text input identifying the target color, replacing the source color. A color replacement system creates and embedding of the source color, segments the image based on the source color embedding, and then replaces the color of segmented portion of the image with the target color.

    Multi-modal image color segmenter and editor

    公开(公告)号:US11756239B2

    公开(公告)日:2023-09-12

    申请号:US17240030

    申请日:2021-04-26

    Applicant: ADOBE INC.

    CPC classification number: G06T11/001 G06T7/11 G06T7/90

    Abstract: Systems and methods for color replacement are described. Embodiments of the disclosure include a color replacement system that adjusts an image based on a user-input source color and target color. For example, the source color may be replaced with the target color throughout the entire image. In some embodiments, a user provides a speech or text input that identifies a source color to be replaced. The user may then provide a speech or text input identifying the target color, replacing the source color. A color replacement system creates and embedding of the source color, segments the image based on the source color embedding, and then replaces the color of segmented portion of the image with the target color.

    Multi-lingual tagging for digital images

    公开(公告)号:US11645478B2

    公开(公告)日:2023-05-09

    申请号:US17088847

    申请日:2020-11-04

    Applicant: Adobe Inc.

    CPC classification number: G06F40/58 G06F40/117

    Abstract: Introduced here is an approach to translating tags assigned to digital images. As an example, embeddings may be extracted from a tag to be translated and the digital image with which the tag is associated by a multimodal model. These embeddings can be compared to embeddings extracted from a set of target tags associated with a target language by the multimodal model. Such an approach allows similarity to be established along two dimensions, which ensures the obstacles associated with direct translation can be avoided.

    Image segmentation using text embedding

    公开(公告)号:US11615567B2

    公开(公告)日:2023-03-28

    申请号:US16952008

    申请日:2020-11-18

    Applicant: Adobe Inc.

    Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, by a model that includes trainable components, a learned image representation of a target image. The operations further include generating, by a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include generating a class activation map of the target image by, at least, convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image using the class activation map of the target image.

Patent Agency Ranking