Auto-tags with object detection and crops

    公开(公告)号:US11790045B2

    公开(公告)日:2023-10-17

    申请号:US17240246

    申请日:2021-04-26

    Applicant: ADOBE INC.

    Abstract: Systems and methods for image tagging are described. In some embodiments, images with problematic tags are identified after applying an auto-tagger. The images with problematic tags are then sent to an object detection network. In some cases, the object detection network is trained using a training set selected to improve detection of objects associated with the problematic tags. The output of the object detection network can be merged with the output of the auto-tagger to provide a combined image tagging output. In some cases, the output of the object detection network also includes a bounding box, which can be used to crop the image around a relevant object so that the auto-tagger can be reapplied to a portion of the image.

    Identity Preserved Controllable Facial Image Manipulation

    公开(公告)号:US20230316591A1

    公开(公告)日:2023-10-05

    申请号:US17709895

    申请日:2022-03-31

    Applicant: Adobe Inc.

    CPC classification number: G06T11/00 G06V10/40 G06V10/7747

    Abstract: Techniques for identity preserved controllable facial image manipulation are described that support generation of a manipulated digital image based on a facial image and a render image. For instance, a facial image depicting a facial representation of an individual is received as input. A feature space including an identity parameter and at least one other visual parameter is extracted from the facial image. An editing module edits one or more of the visual parameters and preserves the identity parameter. A renderer generates a render image depicting a morphable model reconstruction of the facial image based on the edit. The render image and facial image are encoded, and a generator of a neural network is implemented to generate a manipulated digital image based on the encoded facial image and the encoded render image.

    Shaping a neural network architecture utilizing learnable sampling layers

    公开(公告)号:US11710042B2

    公开(公告)日:2023-07-25

    申请号:US16782793

    申请日:2020-02-05

    Applicant: Adobe Inc.

    CPC classification number: G06N3/082 G06N3/04

    Abstract: The present disclosure relates to shaping the architecture of a neural network. For example, the disclosed systems can provide a neural network shaping mechanism for at least one sampling layer of a neural network. The neural network shaping mechanism can include a learnable scaling factor between a sampling rate of the at least one sampling layer and an additional sampling function. The disclosed systems can learn the scaling factor based on a dataset while jointly learning the network weights of the neural network. Based on the learned scaling factor, the disclosed systems can shape the architecture of the neural network by modifying the sampling rate of the at least one sampling layer.

    IMAGE SEGMENTATION USING TEXT EMBEDDING
    338.
    发明公开

    公开(公告)号:US20230206525A1

    公开(公告)日:2023-06-29

    申请号:US18117155

    申请日:2023-03-03

    Applicant: Adobe Inc.

    Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, using a model, a learned image representation of a target image. The operations further include generating, using a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image based on the convolving of the learned image representation of the target image with the text embedding.

Patent Agency Ranking