METHOD FOR CREATING MULTIMODAL TRAINING DATASETS FOR PREDICTING USER CHARACTERISTICS USING PSEUDO-LABELING
Abstract:
There is provided a method for creating multimodal training datasets for predicting characteristics of a user by using pseudo-labeling. According to an embodiment, the method may acquire a labelled dataset in which an image of a user is labelled with personality information and may extract a multimodal feature vector from the image of the acquired labelled dataset, may acquire an un-labelled dataset in which an image of a user is not labelled with personality information and may extract a multimodal feature vector from the image of the acquired un-labelled dataset, may measure a similarity between the extracted multimodal feature vector of the labelled dataset and the multimodal feature vector of the un-labelled dataset, and may label the un-labelled dataset based on the measured similarity. Accordingly, by creating multimodal training datasets for predicting a user personality by using pseudo-labeling, training datasets may be obtained rapidly, economically and effectively.
Information query
Patent Agency Ranking
0/0