EXCEEDING THE LIMITS OF VISUAL-LINGUISTIC MULTI-TASK LEARNING
Abstract:
Methods, computer readable media, and devices for exceeding the limits of visual-linguistic multi-task learning are disclosed. One method may include identifying a multi-modal multi-task classification dataset including a plurality of data examples, creating a transformer machine learning model to predict a plurality of categorical attributes of a product, and training the transformer machine learning model based on the multi-modal multi-task classification dataset using an alpha decay schedule and dynamically allocating task-specific parameters for at least one of the plurality of task-specific classification heads based on task complexity.
Public/Granted literature
Information query
Patent Agency Ranking
0/0