Apparatus for Training Model, Method and Computer Readable Recording Medium Thereof

    公开(公告)号:US20230135163A1

    公开(公告)日:2023-05-04

    申请号:US17933821

    申请日:2022-09-20

    Abstract: Provided is a method for training a model, including generating a plurality of attention maps by inputting training data into a previously trained teacher model, generating a set of attention weights of the teacher model based on the plurality of attention maps, generating a set of attention weights of a student model by inputting the training data into the student model, calculating a value of a first loss function based on the set of attention weights of the teacher model and the set of attention weights of the student model, calculating a value of a second loss function according to an inference of the student model with respect to the training data, and training the student model based on the value of the first loss function and the value of the second loss function.

    Dialogue Model Training Method and Device Therefor

    公开(公告)号:US20230080930A1

    公开(公告)日:2023-03-16

    申请号:US17807653

    申请日:2022-06-17

    Abstract: Disclosed is a method of training a dialogue model in an electronic device, the method including selecting a first context from a first dialogue data set including at least one pair of a context and a response corresponding to the context, generating a first response corresponding to the first context through a first dialogue model, generating an augmented dialogue dataset by incorporating a pair of the first context and the first response corresponding to the first context into the first dialogue data set, and training a second dialogue model based on the augmented dialogue dataset.

Patent Agency Ranking