Invention Application
- Patent Title: ADAPTIVE HIGH-PRECISION COMPRESSION METHOD AND SYSTEM BASED ON CONVOLUTIONAL NEURAL NETWORK MODEL
-
Application No.: US17448934Application Date: 2021-09-27
-
Publication No.: US20220351043A1Publication Date: 2022-11-03
- Inventor: Yongduan Song , Feng Yang , Rui Li , Shengtao Pan , Siyu Li , Yiwen Zhang , Jian Zhang , Zhengtao Yu , Shichun Wang
- Applicant: Chongqing University , University of Electronic Science and Technology of China , Dibi (Chongqing) Intelligent Technology Research Institute Co., Ltd. , Star Institute of Intelligent Systems
- Applicant Address: CN Chongqing; CN Chengdu City; CN Chongqing; CN Chongqing
- Assignee: Chongqing University,University of Electronic Science and Technology of China,Dibi (Chongqing) Intelligent Technology Research Institute Co., Ltd.,Star Institute of Intelligent Systems
- Current Assignee: Chongqing University,University of Electronic Science and Technology of China,Dibi (Chongqing) Intelligent Technology Research Institute Co., Ltd.,Star Institute of Intelligent Systems
- Current Assignee Address: CN Chongqing; CN Chengdu City; CN Chongqing; CN Chongqing
- Priority: CN202110482445.8 20210430
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04

Abstract:
The present disclosure discloses an adaptive high-precision compression method and system based on a convolutional neural network model, and belongs to the fields of artificial intelligence, computer vision, and image processing. According to the method of the present disclosure, coarse-grained pruning is performed on a neural network model by using a differential evolution algorithm first, and the coarse-grained space is quickly searched through an entropy importance criterion and an objective function with good guidance to obtain a near-optimal neural network structure. Then fine-grained search space is built on the basis of an optimal individual obtained from the coarse-grained search, and fine-grained pruning is performed on the neural network model by a differential evolution algorithm to obtain a network model with an optimal structure. Finally, the performance of the optimal model is restored by using a multi-teacher multi-step knowledge distillation network to reach the precision of an original model.
Information query