Method and apparatus for neural network quantization

Invention Grant

US11321609B2 Method and apparatus for neural network quantization 有权

Please log in to see more content

Patent Title: Method and apparatus for neural network quantization
Application No.: US15433531

Application Date: 2017-02-15
Publication No.: US11321609B2

Publication Date: 2022-05-03
Inventor: Yoo Jin Choi , Mostafa El-Khamy , Jungwon Lee
Applicant: Samsung Electronics Co., Ltd.
Applicant Address: KR Gyeonggi-do
Assignee: Samsung Electronics Co., Ltd.
Current Assignee: Samsung Electronics Co., Ltd.
Current Assignee Address: KR Gyeonggi-do
Agency: The Farrell Law Firm, P.C.
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/063 ; G06F17/16

Method and apparatus for neural network quantization

Abstract:

Apparatuses and methods of manufacturing same, systems, and methods for performing network parameter quantization in deep neural networks are described. In one aspect, diagonals of a second-order partial derivative matrix (a Hessian matrix) of a loss function of network parameters of a neural network are determined and then used to weight (Hessian-weighting) the network parameters as part of quantizing the network parameters. In another aspect, the neural network is trained using first and second moment estimates of gradients of the network parameters and then the second moment estimates are used to weight the network parameters as part of quantizing the network parameters. In yet another aspect, network parameter quantization is performed by using an entropy-constrained scalar quantization (ECSQ) iterative algorithm. In yet another aspect, network parameter quantization is performed by quantizing the network parameters of all layers of a deep neural network together at once.

Public/Granted literature

US20180107925A1 METHOD AND APPARATUS FOR NEURAL NETWORK QUANTIZATION Public/Granted day:2018-04-19

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法