Non-uniform quantization of pre-trained deep neural network

Invention Grant

US11348009B2 Non-uniform quantization of pre-trained deep neural network 有权

Please log in to see more content

Patent Title: Non-uniform quantization of pre-trained deep neural network
Application No.: US16181326

Application Date: 2018-11-05
Publication No.: US11348009B2

Publication Date: 2022-05-31
Inventor: Hui Chen , Ilia Ovsiannikov
Applicant: Samsung Electronics Co., Ltd.
Applicant Address: KR Suwon-si
Assignee: Samsung Electronics Co., Ltd.
Current Assignee: Samsung Electronics Co., Ltd.
Current Assignee Address: KR Suwon-si
Agency: Renaissance IP Law Group LLP
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/04

Non-uniform quantization of pre-trained deep neural network

Abstract:

A system and a method of quantizing a pre-trained neural network, includes determining by a layer/channel bit-width determiner for each layer or channel of the pre-trained neural network a minimum quantization noise for the layer or the channel for each master bit-width value in a predetermined set of master bit-width values; and selecting by a bit-width selector for the layer or the channel the master bit-width value having the minimum quantization noise for the layer or the channel. In one embodiment, the minimum quantization noise for the layer or the channel is based on a square of a range of weights for the layer or the channel that is multiplied by a constant to a negative power of a current master bit-width value.

Public/Granted literature

US20200097823A1 NON-UNIFORM QUANTIZATION OF PRE-TRAINED DEEP NEURAL NETWORK Public/Granted day:2020-03-26

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法