Robust training of large-scale object detectors with a noisy dataset

    公开(公告)号:US11126890B2

    公开(公告)日:2021-09-21

    申请号:US16388115

    申请日:2019-04-18

    Applicant: ADOBE INC.

    Abstract: Systems and methods are described for object detection within a digital image using a hierarchical softmax function. The method may include applying a first softmax function of a softmax hierarchy on a digital image based on a first set of object classes that are children of a root node of a class hierarchy, then apply a second (and subsequent) softmax functions to the digital image based on a second (and subsequent) set of object classes, where the second (and subsequent) object classes are children nodes of an object class from the first (or parent) object classes. The methods may then include generating an object recognition output using a convolutional neural network (CNN) based at least in part on applying the first and second (and subsequent) softmax functions. In some cases, the hierarchical softmax function is the loss function for the CNN.

    Generating Descriptions of Image Relationships

    公开(公告)号:US20210232850A1

    公开(公告)日:2021-07-29

    申请号:US16750478

    申请日:2020-01-23

    Applicant: Adobe Inc.

    Abstract: In implementations of generating descriptions of image relationships, a computing device implements a description system which receives a source digital image and a target digital image. The description system generates a source feature sequence from the source digital image and a target feature sequence from the target digital image. A visual relationship between the source digital image and the target digital image is determined by using cross-attention between the source feature sequence and the target feature sequence. The system generates a description of a visual transformation between the source digital image and the target digital image based on the visual relationship.

    GENERATING MODIFIED DIGITAL IMAGES UTILIZING A DISPERSED MULTIMODAL SELECTION MODEL

    公开(公告)号:US20210004576A1

    公开(公告)日:2021-01-07

    申请号:US17025477

    申请日:2020-09-18

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.

    Custom auto tagging of multiple objects

    公开(公告)号:US10853700B2

    公开(公告)日:2020-12-01

    申请号:US16928949

    申请日:2020-07-14

    Applicant: Adobe Inc.

    Abstract: There is described a computing device and method in a digital medium environment for custom auto tagging of multiple objects. The computing device includes an object detection network and multiple image classification networks. An image is received at the object detection network and includes multiple visual objects. First feature maps are applied to the image at the object detection network and generate object regions associated with the visual objects. The object regions are assigned to the multiple image classification networks, and each image classification network is assigned to a particular object region. The second feature maps are applied to each object region at each image classification network, and each image classification network outputs one or more classes associated with a visual object corresponding to each object region.

    Meta-learning for facial recognition

    公开(公告)号:US10832036B2

    公开(公告)日:2020-11-10

    申请号:US16036757

    申请日:2018-07-16

    Applicant: ADOBE INC.

    Abstract: Methods and systems are provided for generating a facial recognition system. A facial recognition system can be implemented using a meta-model based on a trained neural network. A neural network can be trained as multiple classifiers that identify individuals using a small number of images of the individual's face. A meta-model can learn from the neural networks to be capable to identify an individual based on a small number of images. In this way, the facial recognition system uses the meta-model that learns from the neural network trained to identify an individual based on a small number of images. Such a facial recognition system is tested to determine any misidentification for fine-tuning the system. A facial recognition system implemented using such a meta-model is capable of adapting the model to learn identities entered into the system using only a small number of images to enroll an identity into the system.

Patent Agency Ranking