Invention Grant
- Patent Title: System and method for supervised contrastive learning for multi-modal tasks
-
Application No.: US17589535Application Date: 2022-01-31
-
Publication No.: US12183062B2Publication Date: 2024-12-31
- Inventor: Changsheng Zhao , Burak Uzkent , Yilin Shen , Hongxia Jin
- Applicant: Samsung Electronics Co., Ltd.
- Applicant Address: KR Suwon-si
- Assignee: Samsung Electronics Co., Ltd.
- Current Assignee: Samsung Electronics Co., Ltd.
- Current Assignee Address: KR Suwon-si
- Main IPC: G06V10/80
- IPC: G06V10/80 ; G06F40/279 ; G06V10/774 ; G06V10/778

Abstract:
A method includes obtaining a batch of training data including multiple paired image-text pairs and multiple unpaired image-text pairs, where each paired image-text pair and each unpaired image-text pair includes an image and a text. The method also includes training a machine learning model using the training data based on an optimization of a combination of losses. The losses include, for each paired image-text pair, (i) a first multi-modal representation loss based on the paired image-text pair and (ii) a second multi-modal representation loss based on two or more unpaired image-text pairs, selected from among the multiple unpaired image-text pairs, wherein each of the two or more unpaired image-text pairs includes either the image or the text of the paired image-text pair.
Public/Granted literature
- US20230245435A1 SYSTEM AND METHOD FOR SUPERVISED CONTRASTIVE LEARNING FOR MULTI-MODAL TASKS Public/Granted day:2023-08-03
Information query