Invention Application
- Patent Title: METHOD AND APPARATUS WITH NEURAL NETWORK INFERENCE OPTIMIZATION IMPLEMENTATION
-
Application No.: US17244006Application Date: 2021-04-29
-
Publication No.: US20220083838A1Publication Date: 2022-03-17
- Inventor: UISEOK SONG , SANGGYU SHIN
- Applicant: Samsung Electronics Co., Ltd.
- Applicant Address: KR Suwon-si
- Assignee: Samsung Electronics Co., Ltd.
- Current Assignee: Samsung Electronics Co., Ltd.
- Current Assignee Address: KR Suwon-si
- Priority: KR10-2020-0118759 20200916
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06N3/10

Abstract:
A method includes predicting, for sets of input data, an input data number of a subsequent interval of a first interval using an input data number of the first interval and an input data number of a previous interval of the first interval set in a neural network inference optimization, determining the predicted input data number to be a batch size of the subsequent interval, determining whether pipelining is to be performed in a target device based on a resource state of the target device, and applying, to the target device, an inference policy including the determined batch size and a result of the determining of whether the pipelining is to be performed.
Information query