Image segmentation using text embedding

    公开(公告)号:US12008698B2

    公开(公告)日:2024-06-11

    申请号:US18117155

    申请日:2023-03-03

    Applicant: Adobe Inc.

    Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, using a model, a learned image representation of a target image. The operations further include generating, using a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image based on the convolving of the learned image representation of the target image with the text embedding.

    DETECTING AND MODIFYING OBJECT ATTRIBUTES
    14.
    发明公开

    公开(公告)号:US20240168617A1

    公开(公告)日:2024-05-23

    申请号:US18058622

    申请日:2022-11-23

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For instance, in one or more embodiments, the disclosed systems detect a selection of an object portrayed in a digital image displayed within a graphical user interface of a client device. The disclosed systems provide, for display within the graphical user interface in response to detecting the selection of the object, an interactive window displaying one or more attributes of the object. The disclosed systems receive, via the interactive window, a user interaction to change an attribute from the one or more attributes. The disclosed systems modify the digital image by changing the attribute of the object in accordance with the user interaction.

    Context-based image tag translation

    公开(公告)号:US11842165B2

    公开(公告)日:2023-12-12

    申请号:US16553305

    申请日:2019-08-28

    Applicant: Adobe Inc.

    Inventor: Yang Yang Zhe Lin

    CPC classification number: G06F40/58 G06F16/53 G06F16/5866

    Abstract: In some embodiments, a context-based translation application generates a co-occurrence data structure for a target language describing co-occurrences of target language words and source language words. The context-based translation application receives an input tag for an input image in the source language to be translated into the target language. The context-based translation application obtains multiple candidate translations in the target language for the input tag and determines a translated tag from the multiple candidate translations based on the co-occurrence data structure. The context-based translation application further associates the translated tag with the input image.

    GENERATING MODIFIED DIGITAL IMAGES VIA IMAGE INPAINTING USING MULTI-GUIDED PATCH MATCH AND INTELLIGENT CURATION

    公开(公告)号:US20230385992A1

    公开(公告)日:2023-11-30

    申请号:US17664991

    申请日:2022-05-25

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that implement an inpainting framework having computer-implemented machine learning models to generate high-resolution inpainting results. For instance, in one or more embodiments, the disclosed systems generate an inpainted digital image utilizing a deep inpainting neural network from a digital image having a replacement region. The disclosed systems further generate, utilizing a visual guide algorithm, at least one deep visual guide from the inpainted digital image. Using a patch match model and the at least one deep visual guide, the disclosed systems generate a plurality of modified digital images from the digital image by replacing the region of pixels of the digital image with replacement pixels. Additionally, the disclosed systems select, utilizing an inpainting curation model, a modified digital image from the plurality of modified digital images to provide to a client device.

Patent Agency Ranking