-
公开(公告)号:US20240346799A1
公开(公告)日:2024-10-17
申请号:US18634560
申请日:2024-04-12
Inventor: Quan Cui , Muyang Yi , Hao Wu , Cheng Yang
IPC: G06V10/26 , G06T3/40 , G06V10/50 , G06V10/764 , G06V10/774 , G06V20/70
CPC classification number: G06V10/26 , G06T3/40 , G06V10/50 , G06V10/764 , G06V20/70 , G06V10/7753
Abstract: Embodiments of the disclosure provides technologies for image segmentation. The method includes: extracting an image feature representation of a target image using a trained image encoder; for each of a plurality of classes, generating, using a trained text encoder, a text feature representation corresponding to a name of the class, and determining a candidate segmentation map for the target image and a class confidence of the class based on the image feature representation and the text feature representation; selecting, from the plurality of classes, at least one class related to the target image based on a plurality of class confidences determined respectively for the plurality of classes; and determining a target segmentation map for the target image based on the at least one candidate segmentation map and the at least one class confidence determined for the at least one selected class.
-
公开(公告)号:US20240346811A1
公开(公告)日:2024-10-17
申请号:US18609182
申请日:2024-03-19
Inventor: Quan Cui , Muyang Yi , Hao Wu , Cheng Yang
IPC: G06V10/80 , G06V10/26 , G06V10/764 , G06V10/77
CPC classification number: G06V10/806 , G06V10/26 , G06V10/764 , G06V10/7715
Abstract: Embodiments of the disclosure provide a method, apparatus, device and storage medium for feature aggregation. The method comprises: extracting, with an image encoder, an image feature representation of an input image; for each image feature element set of a plurality of image feature element sets divided along a predetermined dimension of the plurality of dimensions in the image feature representation, selecting a first number of image feature elements from the image feature element set based on a ranking of corresponding image feature elements in the image feature element set, and determining an aggregated image feature element by aggregating the selected first number of image feature elements; and determining an aggregated image feature representation of the input image based on a plurality of aggregated image feature elements determined for the plurality of image feature element sets, respectively.
-
3.
公开(公告)号:US20240346318A1
公开(公告)日:2024-10-17
申请号:US18504220
申请日:2023-11-08
Inventor: Hao WU , Cheng Yang
Abstract: A method, apparatus, device, and medium for managing a model based on a distance between samples. In one method, a basic sample for training a contrastive learning model and a plurality of negative samples associated with the basic sample is obtained; a sequence of the plurality of negative samples is generated based on distances between the plurality of negative samples and the basic sample; the sequence of the plurality of negative samples is divided into a first set of negative samples and a second set of negative samples; an update parameter for updating the contrastive learning model is determined based on the basic sample, the first set of negative samples and a first weight of the first set of negative samples, and the second set of negative samples and a second weight of the second set of negative samples, the first weight is greater than the second weight.
-
4.
公开(公告)号:US20240152760A1
公开(公告)日:2024-05-09
申请号:US18472601
申请日:2023-09-22
Inventor: Hao Wu , Quan Cui , Boyan Zhou , Cheng Yang
IPC: G06N3/088
CPC classification number: G06N3/088
Abstract: A method of training and applying contrastive learning model. The method includes obtaining a sample set and label information for training contrastive learning model, the sample set including a plurality of first samples of a first modality and a plurality of second samples of a second modality, the label information indicating a correlation between samples of the plurality of first samples and samples of the plurality of second samples; determining whether sample mixing is to be performed on the first modality or the second modality; in accordance with a determination that sample mixing is to be performed on the first modality, generating at least one first mixed sample of the first modality by mixing at least one pair of first samples among the plurality of first samples; and training the contrastive learning model at least based on the at least one first mixed sample and first mixed label information.
-
公开(公告)号:US20240185578A1
公开(公告)日:2024-06-06
申请号:US18398945
申请日:2023-12-28
Inventor: Quan Cui , Hao Wu , Cheng Yang
IPC: G06V10/774 , G06V10/50 , G06V10/82 , G06V10/98 , G06V20/70
CPC classification number: G06V10/774 , G06V10/50 , G06V10/82 , G06V10/98 , G06V20/70
Abstract: Embodiments of the present disclosure provide a solution for image encoding learning and application. A method for image encoding learning comprises: extracting an image feature representation of a sample image using an image encoder to be trained; extracting a text feature representation of a sample text sequence using a text encoder, the sample text sequency being associated with the sample image; generating, using the text encoder, a predicted text sequence based on the text feature representation and the image feature representation; and training the image encoder at least based on a text error between the predicted text sequence and the sample text sequence.
-
公开(公告)号:US20240144100A1
公开(公告)日:2024-05-02
申请号:US18496769
申请日:2023-10-27
Inventor: Hao Wu , Boyan Zhou , Quan Cui , Cheng Yang
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: Methods, apparatuses, a device, and a medium for training a contrastive learning model are provided. In a method, a plurality of sample sets for training the contrastive learning model are obtained, and the plurality of sample sets comprises a first sample set and a second sample set. A first target sample set is selected from the first sample set and the second sample set according to a predetermined rule. A first set of samples are determined based on the first target sample set according to a predefined batch size. The contrastive learning model is trained using the first set of samples. In this way, on the one hand, performance degradation of the contrastive learning model due to sample set bias may be avoided; on the other hand, a forgetting problem in the training process may be alleviated.
-
公开(公告)号:US20240144007A1
公开(公告)日:2024-05-02
申请号:US18472605
申请日:2023-09-22
Inventor: Hao Wu , Boyan Zhou , Quan Cui , Cheng Yang
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: A method of contrastive learning comprises: determining, based on a model construction criterion, a first encoder for a first modality and a second encoder for a second modality; constructing a first contrastive learning model, the first contrastive learning model comprising the first encoder and a third encoder for the second modality, and a model capacity of the third encoder being greater than a model capacity of the second encoder; performing pre-training of the first contrastive learning model based on a first training dataset for the first modality and the second modality; and providing the pre-trained first encoder in the pre-trained first contrastive learning model for a downstream task. Because only the model capacity of one encoder is increased in the pre-training stage, model performance may be improved without increasing model training overhead during downstream task fine-tuning and model running overhead during model application.
-
8.
公开(公告)号:US20240160925A1
公开(公告)日:2024-05-16
申请号:US18472973
申请日:2023-09-22
Inventor: Hao Wu , Yu Guo , Quan Cui , Boyan Zhou , Cheng Yang
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: There are provided method, apparatus, device, and medium for determining update gradient for contrastive learning model. In the method, a gradient factor of a first type for the contrastive learning model is determined based on a first group of training data and a second group of training data for training the contrastive learning model. The gradient factor of the first type is not used for backpropagation during a training process. In a first stage of the training process, a gradient factor of a second type associated with the first group of training data is determined based on the contrastive learning model. The gradient factor of the second type is used for backpropagation during the training process. Gradient is obtained for updating the contrastive learning model based on the gradient factor of the first type and the gradient factor of the second type associated with the first group of training data.
-
-
-
-
-
-
-