-
公开(公告)号:US20220114453A1
公开(公告)日:2022-04-14
申请号:US17236503
申请日:2021-04-21
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Won-Jo LEE , Youngmin OH , Minkyoung CHO
Abstract: A neural network pruning method includes: acquiring a first task accuracy of an inference task processed by a pretrained neural network; pruning, based on a channel unit, the neural network by adjusting weights between nodes of channels based on a preset learning weight and based on a channel-by-channel pruning parameter corresponding to a channel of each of a plurality of layers of the pretrained neural network; updating the learning weight based on the first task accuracy and a task accuracy of the pruned neural network; updating the channel-by-channel pruning parameter based on the updated learning weight and the task accuracy of the pruned neural network; and repruning, based on the channel unit, the pruned neural network based on the updated learning weight and based on the updated channel-by-channel pruning parameter.