Invention Application
- Patent Title: NEURAL NETWORK OPERATION APPARATUS AND QUANTIZATION METHOD
-
Application No.: US17368470Application Date: 2021-07-06
-
Publication No.: US20220284262A1Publication Date: 2022-09-08
- Inventor: Sehwan LEE , Hyeonuk SIM , Jongeun LEE
- Applicant: SAMSUNG ELECTRONICS CO., LTD. , UNIST(ULSAN NATIONAL INSTITUTE OF SCIENCE AND TECHNOLOGY)
- Applicant Address: KR Suwon-si; KR Ulsan
- Assignee: SAMSUNG ELECTRONICS CO., LTD.,UNIST(ULSAN NATIONAL INSTITUTE OF SCIENCE AND TECHNOLOGY)
- Current Assignee: SAMSUNG ELECTRONICS CO., LTD.,UNIST(ULSAN NATIONAL INSTITUTE OF SCIENCE AND TECHNOLOGY)
- Current Assignee Address: KR Suwon-si; KR Ulsan
- Priority: KR10-2021-0028636 20210304,KR10-2021-0031354 20210310
- Main IPC: G06N3/04
- IPC: G06N3/04

Abstract:
A neural network operation apparatus and method implementing quantization is disclosed. The neural network operation method may include receiving a weight of a neural network, a candidate set of quantization points, and a bitwidth for representing the weight, extracting a subset of quantization points from the candidate set of quantization points based on the bitwidth, calculating a quantization loss based on the weight of the neural network and the subset of quantization points, and generating a target subset of quantization points based on the quantization loss.
Information query