METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM FOR DISTILLING MODEL

    公开(公告)号:US20210383233A1

    公开(公告)日:2021-12-09

    申请号:US17101748

    申请日:2020-11-23

    Abstract: The disclosure discloses a method for distilling a model, an electronic device, and a storage medium, and relates to the field of deep learning technologies. A teacher model and a student model are obtained. The second intermediate fully connected layer is transformed into an enlarged fully connected layer and a reduced fully connected layer based on a first data processing capacity of a first intermediate fully connected layer of the teacher model and a second data processing capacity of a second intermediate fully connected layer of the student model. The second intermediate fully connected layer is replaced with the enlarged fully connected layer and the reduced fully connected layer to generate a training student model. The training student model is distilled based on the teacher model.

Patent Agency Ranking