Deep convolutional neural network acceleration and compression method based on parameter quantification

Invention Grant

US10970617B2 Deep convolutional neural network acceleration and compression method based on parameter quantification 有权

Please log in to see more content

Patent Title: Deep convolutional neural network acceleration and compression method based on parameter quantification
Application No.: US15753520

Application Date: 2015-08-21
Publication No.: US10970617B2

Publication Date: 2021-04-06
Inventor: Jian Cheng , Jiaxiang Wu , Cong Leng , Hanqing Lu
Applicant: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
Applicant Address: CN Beijing
Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
Current Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
Current Assignee Address: CN Beijing
Agency: Maier & Maier, PLLC
International Application: PCT/CN2015/087792 WO 20150821
International Announcement: WO2017/031630 WO 20170302
Main IPC: G06N3/04
IPC: G06N3/04 ; G06K9/62 ; G06K9/46 ; G06N20/00 ; G06F17/16 ; G06N3/08

Deep convolutional neural network acceleration and compression method based on parameter quantification

Abstract:

An acceleration and compression method for a deep convolutional neural network based on quantization of a parameter provided by the present application comprises: quantizing the parameter of the deep convolutional neural network to obtain a plurality of subcode books and respective corresponding index values of the plurality of subcode books; acquiring an output feature map of the deep convolutional neural network according to the plurality of subcode books and respective corresponding index values of the plurality of subcode books. The present application may implement the acceleration and compression for a deep convolutional neural network.

Public/Granted literature

US20180247180A1 DEEP CONVOLUTIONAL NEURAL NETWORK ACCELERATION AND COMPRESSION METHOD BASED ON PARAMETER QUANTIFICATION Public/Granted day:2018-08-30

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑