METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM FOR FEATURE AGGREGATION

    公开(公告)号:US20240346811A1

    公开(公告)日:2024-10-17

    申请号:US18609182

    申请日:2024-03-19

    CPC classification number: G06V10/806 G06V10/26 G06V10/764 G06V10/7715

    Abstract: Embodiments of the disclosure provide a method, apparatus, device and storage medium for feature aggregation. The method comprises: extracting, with an image encoder, an image feature representation of an input image; for each image feature element set of a plurality of image feature element sets divided along a predetermined dimension of the plurality of dimensions in the image feature representation, selecting a first number of image feature elements from the image feature element set based on a ranking of corresponding image feature elements in the image feature element set, and determining an aggregated image feature element by aggregating the selected first number of image feature elements; and determining an aggregated image feature representation of the input image based on a plurality of aggregated image feature elements determined for the plurality of image feature element sets, respectively.

    METHOD, APPARATUS, DEVICE AND MEDIUM FOR MANAGING MODEL BASED ON DISTANCE BETWEEN SAMPLES

    公开(公告)号:US20240346318A1

    公开(公告)日:2024-10-17

    申请号:US18504220

    申请日:2023-11-08

    Inventor: Hao WU Cheng Yang

    CPC classification number: G06N3/084 G06N3/094

    Abstract: A method, apparatus, device, and medium for managing a model based on a distance between samples. In one method, a basic sample for training a contrastive learning model and a plurality of negative samples associated with the basic sample is obtained; a sequence of the plurality of negative samples is generated based on distances between the plurality of negative samples and the basic sample; the sequence of the plurality of negative samples is divided into a first set of negative samples and a second set of negative samples; an update parameter for updating the contrastive learning model is determined based on the basic sample, the first set of negative samples and a first weight of the first set of negative samples, and the second set of negative samples and a second weight of the second set of negative samples, the first weight is greater than the second weight.

    METHOD, APPARATUS, DEVICE AND MEDIUM FOR TRAINING AND APPLYING A CONTRASTIVE LEARNING MODEL

    公开(公告)号:US20240152760A1

    公开(公告)日:2024-05-09

    申请号:US18472601

    申请日:2023-09-22

    CPC classification number: G06N3/088

    Abstract: A method of training and applying contrastive learning model. The method includes obtaining a sample set and label information for training contrastive learning model, the sample set including a plurality of first samples of a first modality and a plurality of second samples of a second modality, the label information indicating a correlation between samples of the plurality of first samples and samples of the plurality of second samples; determining whether sample mixing is to be performed on the first modality or the second modality; in accordance with a determination that sample mixing is to be performed on the first modality, generating at least one first mixed sample of the first modality by mixing at least one pair of first samples among the plurality of first samples; and training the contrastive learning model at least based on the at least one first mixed sample and first mixed label information.

    METHOD, APPARATUS, DEVICE AND MEDIUM FOR TRAINING CONTRASTIVE LEARNING MODEL

    公开(公告)号:US20240144100A1

    公开(公告)日:2024-05-02

    申请号:US18496769

    申请日:2023-10-27

    CPC classification number: G06N20/00

    Abstract: Methods, apparatuses, a device, and a medium for training a contrastive learning model are provided. In a method, a plurality of sample sets for training the contrastive learning model are obtained, and the plurality of sample sets comprises a first sample set and a second sample set. A first target sample set is selected from the first sample set and the second sample set according to a predetermined rule. A first set of samples are determined based on the first target sample set according to a predefined batch size. The contrastive learning model is trained using the first set of samples. In this way, on the one hand, performance degradation of the contrastive learning model due to sample set bias may be avoided; on the other hand, a forgetting problem in the training process may be alleviated.

    METHODS, APPARATUSES, DEVICE, AND MEDIUM FOR CONTRASTIVE LEARNING

    公开(公告)号:US20240144007A1

    公开(公告)日:2024-05-02

    申请号:US18472605

    申请日:2023-09-22

    CPC classification number: G06N3/08

    Abstract: A method of contrastive learning comprises: determining, based on a model construction criterion, a first encoder for a first modality and a second encoder for a second modality; constructing a first contrastive learning model, the first contrastive learning model comprising the first encoder and a third encoder for the second modality, and a model capacity of the third encoder being greater than a model capacity of the second encoder; performing pre-training of the first contrastive learning model based on a first training dataset for the first modality and the second modality; and providing the pre-trained first encoder in the pre-trained first contrastive learning model for a downstream task. Because only the model capacity of one encoder is increased in the pre-training stage, model performance may be improved without increasing model training overhead during downstream task fine-tuning and model running overhead during model application.

    METHOD, APPARATUS, DEVICE, AND MEDIUM FOR DETERMINING UPDATE GRADIENT FOR CONTRASTIVE LEARNING MODEL

    公开(公告)号:US20240160925A1

    公开(公告)日:2024-05-16

    申请号:US18472973

    申请日:2023-09-22

    CPC classification number: G06N3/08

    Abstract: There are provided method, apparatus, device, and medium for determining update gradient for contrastive learning model. In the method, a gradient factor of a first type for the contrastive learning model is determined based on a first group of training data and a second group of training data for training the contrastive learning model. The gradient factor of the first type is not used for backpropagation during a training process. In a first stage of the training process, a gradient factor of a second type associated with the first group of training data is determined based on the contrastive learning model. The gradient factor of the second type is used for backpropagation during the training process. Gradient is obtained for updating the contrastive learning model based on the gradient factor of the first type and the gradient factor of the second type associated with the first group of training data.

Patent Agency Ranking