Invention Publication
- Patent Title: Bit Sparse Neural Network Optimization
-
Application No.: US17861824Application Date: 2022-07-11
-
Publication No.: US20240013052A1Publication Date: 2024-01-11
- Inventor: Zhi-Gang Liu , Paul Nicholas Whatmough , John Fremont Brown, III
- Applicant: Arm Limited
- Applicant Address: GB Cambridge
- Assignee: Arm Limited
- Current Assignee: Arm Limited
- Current Assignee Address: GB Cambridge
- Main IPC: G06N3/08
- IPC: G06N3/08

Abstract:
A method, system and apparatus provide bit-sparse neural network optimization. Rather than quantizing and pruning weight and activation elements at the word level, weight and activation elements are pruned at the bit level, which reduces the density of effective “set” bits in weight and activation data, which, advantageously, reduces the power consumption of the neural network inference process by reducing the degree of bit-level switching during inference.
Information query