Apparatus for Training Model, Method and Computer Readable Recording Medium Thereof

    公开(公告)号:US20230135163A1

    公开(公告)日:2023-05-04

    申请号:US17933821

    申请日:2022-09-20

    Abstract: Provided is a method for training a model, including generating a plurality of attention maps by inputting training data into a previously trained teacher model, generating a set of attention weights of the teacher model based on the plurality of attention maps, generating a set of attention weights of a student model by inputting the training data into the student model, calculating a value of a first loss function based on the set of attention weights of the teacher model and the set of attention weights of the student model, calculating a value of a second loss function according to an inference of the student model with respect to the training data, and training the student model based on the value of the first loss function and the value of the second loss function.

    Processor for accelerating convolutional operation in convolutional neural network and operating method thereof

    公开(公告)号:US11443134B2

    公开(公告)日:2022-09-13

    申请号:US17004733

    申请日:2020-08-27

    Abstract: A method of performing a convolutional operation in a convolutional neural network includes: obtaining input activation data quantized with a first bit from an input image; obtaining weight data quantized with a second bit representing a value of a parameter learned through the convolutional neural network; binarizing each of the input activation data and the weight data to obtain a binarization input activation vector and a binarization weight vector; performing an inner operation of the input activation data and weight data based on a binary operation with respect to the binarization input activation vector and the binarization weight vector and distance vectors having the same length as each of the first bit and the second bit, respectively; and storing a result obtained by the inner operation as output activation data.

    PROCESSOR FOR ACCELERATING CONVOLUTIONAL OPERATION IN CONVOLUTIONAL NEURAL NETWORK AND OPERATING METHOD THEREOF

    公开(公告)号:US20210064920A1

    公开(公告)日:2021-03-04

    申请号:US17004733

    申请日:2020-08-27

    Abstract: A method of performing a convolutional operation in a convolutional neural network includes: obtaining input activation data quantized with a first bit from an input image; obtaining weight data quantized with a second bit representing a value of a parameter learned through the convolutional neural network; binarizing each of the input activation data and the weight data to obtain a binarization input activation vector and a binarization weight vector; performing an inner operation of the input activation data and weight data based on a binary operation with respect to the binarization input activation vector and the binarization weight vector and distance vectors having the same length as each of the first bit and the second bit, respectively; and storing a result obtained by the inner operation as output activation data.

Patent Agency Ranking