METHOD AND APPARATUS WITH NEURAL NETWORK PRUNING

    公开(公告)号:US20220114453A1

    公开(公告)日:2022-04-14

    申请号:US17236503

    申请日:2021-04-21

    Abstract: A neural network pruning method includes: acquiring a first task accuracy of an inference task processed by a pretrained neural network; pruning, based on a channel unit, the neural network by adjusting weights between nodes of channels based on a preset learning weight and based on a channel-by-channel pruning parameter corresponding to a channel of each of a plurality of layers of the pretrained neural network; updating the learning weight based on the first task accuracy and a task accuracy of the pruned neural network; updating the channel-by-channel pruning parameter based on the updated learning weight and the task accuracy of the pruned neural network; and repruning, based on the channel unit, the pruned neural network based on the updated learning weight and based on the updated channel-by-channel pruning parameter.

Patent Agency Ranking