Method and apparatus for universal pruning and compression of deep convolutional neural networks under joint sparsity constraints

Invention Grant

US11423312B2 Method and apparatus for universal pruning and compression of deep convolutional neural networks under joint sparsity constraints 有权

Please log in to see more content

Patent Title: Method and apparatus for universal pruning and compression of deep convolutional neural networks under joint sparsity constraints
Application No.: US16141035

Application Date: 2018-09-25
Publication No.: US11423312B2

Publication Date: 2022-08-23
Inventor: Yoo Jin Choi , Mostafa El-Khamy , Jungwon Lee
Applicant: Samsung Electronics Co., Ltd.
Applicant Address: KR Gyeonggi-do
Assignee: Samsung Electronics Co., Ltd.
Current Assignee: Samsung Electronics Co., Ltd.
Current Assignee Address: KR Gyeonggi-do
Agency: The Farrell Law Firm, P.C.
Main IPC: G06N3/08
IPC: G06N3/08 ; G06F17/15 ; G06F7/02 ; G06N3/04

Method and apparatus for universal pruning and compression of deep convolutional neural networks under joint sparsity constraints

Abstract:

A method and system for constructing a convolutional neural network (CNN) model are herein disclosed. The method includes regularizing spatial domain weights, providing quantization of the spatial domain weights, pruning small or zero weights in a spatial domain, fine-tuning a quantization codebook, compressing a quantization output from the quantization codebook, and decompressing the spatial domain weights and using either sparse spatial domain convolution and sparse Winograd convolution after pruning Winograd-domain weights.

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法