Invention Grant
- Patent Title: Deep convolutional neural network acceleration and compression method based on parameter quantification
-
Application No.: US15753520Application Date: 2015-08-21
-
Publication No.: US10970617B2Publication Date: 2021-04-06
- Inventor: Jian Cheng , Jiaxiang Wu , Cong Leng , Hanqing Lu
- Applicant: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
- Applicant Address: CN Beijing
- Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
- Current Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
- Current Assignee Address: CN Beijing
- Agency: Maier & Maier, PLLC
- International Application: PCT/CN2015/087792 WO 20150821
- International Announcement: WO2017/031630 WO 20170302
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06K9/62 ; G06K9/46 ; G06N20/00 ; G06F17/16 ; G06N3/08

Abstract:
An acceleration and compression method for a deep convolutional neural network based on quantization of a parameter provided by the present application comprises: quantizing the parameter of the deep convolutional neural network to obtain a plurality of subcode books and respective corresponding index values of the plurality of subcode books; acquiring an output feature map of the deep convolutional neural network according to the plurality of subcode books and respective corresponding index values of the plurality of subcode books. The present application may implement the acceleration and compression for a deep convolutional neural network.
Public/Granted literature
- US20180247180A1 DEEP CONVOLUTIONAL NEURAL NETWORK ACCELERATION AND COMPRESSION METHOD BASED ON PARAMETER QUANTIFICATION Public/Granted day:2018-08-30
Information query