IDENTIFYING VISUALLY SIMILAR DIGITAL IMAGES UTILIZING DEEP LEARNING

    公开(公告)号:US20200210763A1

    公开(公告)日:2020-07-02

    申请号:US16817234

    申请日:2020-03-12

    Applicant: ADOBE INC.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a deep neural network-based model to identify similar digital images for query digital images. For example, the disclosed systems utilize a deep neural network-based model to analyze query digital images to generate deep neural network-based representations of the query digital images. In addition, the disclosed systems can generate results of visually-similar digital images for the query digital images based on comparing the deep neural network-based representations with representations of candidate digital images. Furthermore, the disclosed systems can identify visually similar digital images based on user-defined attributes and image masks to emphasize specific attributes or portions of query digital images.

    Multi-scale distillation for low-resolution detection

    公开(公告)号:US12136185B2

    公开(公告)日:2024-11-05

    申请号:US17455134

    申请日:2021-11-16

    Applicant: ADOBE INC.

    Abstract: Systems and methods for image processing are described. The systems and methods include receiving a low-resolution image; generating a feature map based on the low-resolution image using an encoder of a student network, wherein the encoder of the student network is trained based on comparing a predicted feature map from the encoder of the student network and a fused feature map from a teacher network, and wherein the fused feature map represents a combination of first feature map from a high-resolution encoder of the teacher network and a second feature map from a low-resolution encoder of the teacher network; and decoding the feature map to obtain prediction information for the low-resolution image.

    Self-supervised document representation learning

    公开(公告)号:US11886815B2

    公开(公告)日:2024-01-30

    申请号:US17333892

    申请日:2021-05-28

    Applicant: Adobe Inc.

    Abstract: One example method involves operations for a processing device that include receiving, by a machine learning model trained to generate a search result, a search query for a text input. The machine learning model is trained by receiving pre-training data that includes multiple documents. Pre-training the machine learning model by generating, using an encoder, feature embeddings for each of the documents included in the pre-training data. The feature embeddings are generated by applying a masking function to visual and textual features in the documents. Training the machine learning model also includes generating, using the feature embeddings, output features for the documents by concatenating the feature embeddings and applying a non-linear mapping to the feature embeddings. Training the machine learning model further includes applying a linear classifier to the output features. Additionally, operations include generating, for display, a search result using the machine learning model based on the input.

    MULTI-SCALE DISTILLATION FOR LOW-RESOLUTION DETECTION

    公开(公告)号:US20230153943A1

    公开(公告)日:2023-05-18

    申请号:US17455134

    申请日:2021-11-16

    Applicant: ADOBE INC.

    CPC classification number: G06T3/4046 G06K9/6202 G06N3/08 G06N3/0454

    Abstract: Systems and methods for image processing are described. The systems and methods include receiving a low-resolution image; generating a feature map based on the low-resolution image using an encoder of a student network, wherein the encoder of the student network is trained based on comparing a predicted feature map from the encoder of the student network and a fused feature map from a teacher network, and wherein the fused feature map represents a combination of first feature map from a high-resolution encoder of the teacher network and a second feature map from a low-resolution encoder of the teacher network; and decoding the feature map to obtain prediction information for the low-resolution image.

    SELF-SUPERVISED DOCUMENT REPRESENTATION LEARNING

    公开(公告)号:US20220382975A1

    公开(公告)日:2022-12-01

    申请号:US17333892

    申请日:2021-05-28

    Applicant: Adobe Inc.

    Abstract: One example method involves operations for a processing device that include receiving, by a machine learning model trained to generate a search result, a search query for a text input. The machine learning model is trained by receiving pre-training data that includes multiple documents. Pre-training the machine learning model by generating, using an encoder, feature embeddings for each of the documents included in the pre-training data. The feature embeddings are generated by applying a masking function to visual and textual features in the documents. Training the machine learning model also includes generating, using the feature embeddings, output features for the documents by concatenating the feature embeddings and applying a non-linear mapping to the feature embeddings. Training the machine learning model further includes applying a linear classifier to the output features. Additionally, operations include generating, for display, a search result using the machine learning model based on the input.

    Identifying visually similar digital images utilizing deep learning

    公开(公告)号:US11227185B2

    公开(公告)日:2022-01-18

    申请号:US16817234

    申请日:2020-03-12

    Applicant: ADOBE INC.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a deep neural network-based model to identify similar digital images for query digital images. For example, the disclosed systems utilize a deep neural network-based model to analyze query digital images to generate deep neural network-based representations of the query digital images. In addition, the disclosed systems can generate results of visually-similar digital images for the query digital images based on comparing the deep neural network-based representations with representations of candidate digital images. Furthermore, the disclosed systems can identify visually similar digital images based on user-defined attributes and image masks to emphasize specific attributes or portions of query digital images.

    Robust training of large-scale object detectors with a noisy dataset

    公开(公告)号:US11126890B2

    公开(公告)日:2021-09-21

    申请号:US16388115

    申请日:2019-04-18

    Applicant: ADOBE INC.

    Abstract: Systems and methods are described for object detection within a digital image using a hierarchical softmax function. The method may include applying a first softmax function of a softmax hierarchy on a digital image based on a first set of object classes that are children of a root node of a class hierarchy, then apply a second (and subsequent) softmax functions to the digital image based on a second (and subsequent) set of object classes, where the second (and subsequent) object classes are children nodes of an object class from the first (or parent) object classes. The methods may then include generating an object recognition output using a convolutional neural network (CNN) based at least in part on applying the first and second (and subsequent) softmax functions. In some cases, the hierarchical softmax function is the loss function for the CNN.

    UTILIZING A DEEP NEURAL NETWORK-BASED MODEL TO IDENTIFY VISUALLY SIMILAR DIGITAL IMAGES BASED ON USER-SELECTED VISUAL ATTRIBUTES

    公开(公告)号:US20190354802A1

    公开(公告)日:2019-11-21

    申请号:US15983949

    申请日:2018-05-18

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a deep neural network-based model to identify similar digital images for query digital images. For example, the disclosed systems utilize a deep neural network-based model to analyze query digital images to generate deep neural network-based representations of the query digital images. In addition, the disclosed systems can generate results of visually-similar digital images for the query digital images based on comparing the deep neural network-based representations with representations of candidate digital images. Furthermore, the disclosed systems can identify visually similar digital images based on user-defined attributes and image masks to emphasize specific attributes or portions of query digital images.

Patent Agency Ranking