Invention Grant
- Patent Title: Method and apparatus for neural network quantization
-
Application No.: US15433531Application Date: 2017-02-15
-
Publication No.: US11321609B2Publication Date: 2022-05-03
- Inventor: Yoo Jin Choi , Mostafa El-Khamy , Jungwon Lee
- Applicant: Samsung Electronics Co., Ltd.
- Applicant Address: KR Gyeonggi-do
- Assignee: Samsung Electronics Co., Ltd.
- Current Assignee: Samsung Electronics Co., Ltd.
- Current Assignee Address: KR Gyeonggi-do
- Agency: The Farrell Law Firm, P.C.
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/063 ; G06F17/16

Abstract:
Apparatuses and methods of manufacturing same, systems, and methods for performing network parameter quantization in deep neural networks are described. In one aspect, diagonals of a second-order partial derivative matrix (a Hessian matrix) of a loss function of network parameters of a neural network are determined and then used to weight (Hessian-weighting) the network parameters as part of quantizing the network parameters. In another aspect, the neural network is trained using first and second moment estimates of gradients of the network parameters and then the second moment estimates are used to weight the network parameters as part of quantizing the network parameters. In yet another aspect, network parameter quantization is performed by using an entropy-constrained scalar quantization (ECSQ) iterative algorithm. In yet another aspect, network parameter quantization is performed by quantizing the network parameters of all layers of a deep neural network together at once.
Public/Granted literature
- US20180107925A1 METHOD AND APPARATUS FOR NEURAL NETWORK QUANTIZATION Public/Granted day:2018-04-19
Information query