Bit Sparse Neural Network Optimization

Invention Publication

US20240013052A1 Bit Sparse Neural Network Optimization 审中-公开

Please log in to see more content

Patent Title: Bit Sparse Neural Network Optimization
Application No.: US17861824

Application Date: 2022-07-11
Publication No.: US20240013052A1

Publication Date: 2024-01-11
Inventor: Zhi-Gang Liu , Paul Nicholas Whatmough , John Fremont Brown, III
Applicant: Arm Limited
Applicant Address: GB Cambridge
Assignee: Arm Limited
Current Assignee: Arm Limited
Current Assignee Address: GB Cambridge
Main IPC: G06N3/08
IPC: G06N3/08

Abstract:

A method, system and apparatus provide bit-sparse neural network optimization. Rather than quantizing and pruning weight and activation elements at the word level, weight and activation elements are pruned at the bit level, which reduces the density of effective “set” bits in weight and activation data, which, advantageously, reduces the power consumption of the neural network inference process by reducing the degree of bit-level switching during inference.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法