Apparatus for Training Model, Method and Computer Readable Recording Medium Thereof

    公开(公告)号:US20230135163A1

    公开(公告)日:2023-05-04

    申请号:US17933821

    申请日:2022-09-20

    Abstract: Provided is a method for training a model, including generating a plurality of attention maps by inputting training data into a previously trained teacher model, generating a set of attention weights of the teacher model based on the plurality of attention maps, generating a set of attention weights of a student model by inputting the training data into the student model, calculating a value of a first loss function based on the set of attention weights of the teacher model and the set of attention weights of the student model, calculating a value of a second loss function according to an inference of the student model with respect to the training data, and training the student model based on the value of the first loss function and the value of the second loss function.

Patent Agency Ranking