Invention Application
- Patent Title: NEURAL NETWORK-BASED INFERENCE METHOD AND APPARATUS
-
Application No.: US17343001Application Date: 2021-06-09
-
Publication No.: US20220261649A1Publication Date: 2022-08-18
- Inventor: Chang GAO , Shih-Chii LIU , Tobi DELBRUCK , Xi CHEN
- Applicant: Samsung Electronics Co., Ltd. , University of Zurich
- Applicant Address: KR Suwon-si; CH Zurich
- Assignee: Samsung Electronics Co., Ltd.,University of Zurich
- Current Assignee: Samsung Electronics Co., Ltd.,University of Zurich
- Current Assignee Address: KR Suwon-si; CH Zurich
- Priority: KR10-2021-0019755 20210215
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04

Abstract:
Disclosed is a neural network-based inference method and apparatus. The neural network-based inference method includes compressing a matrix comprising processing elements corresponding to an operation of a neural network, balancing workloads related to the operation by reordering the compressed matrix based on the workloads, and performing inference based on the reordered matrix.
Public/Granted literature
- US12299576B2 Neural network-based inference method and apparatus Public/Granted day:2025-05-13
Information query