Compression of deep neural networks

Invention Grant

US11966837B2 Compression of deep neural networks 有权

Please log in to see more content

Patent Title: Compression of deep neural networks
Application No.: US16351712

Application Date: 2019-03-13
Publication No.: US11966837B2

Publication Date: 2024-04-23
Inventor: Dzung Phan , Lam Nguyen , Nam H. Nguyen , Jayant R. Kalagnanam
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent Stephanie L. Carusillo
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/047 ; H03M7/30

Abstract:

In an approach for compressing a neural network, a processor receives a neural network, wherein the neural network has been trained on a set of training data. A processor receives a compression ratio. A processor compresses the neural network based on the compression ratio using an optimization model to solve for sparse weights. A processor re-trains the compressed neural network with the sparse weights. A processor outputs the re-trained neural network.

Public/Granted literature

US20200293876A1 COMPRESSION OF DEEP NEURAL NETWORKS Public/Granted day:2020-09-17

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法