Invention Application
- Patent Title: RUNTIME OPTIMIZATION OF COMPUTATIONS OF AN ARTIFICIAL NEURAL NETWORK COMPILED FOR EXECUTION ON A DEEP LEARNING ACCELERATOR
-
Application No.: US17092044Application Date: 2020-11-06
-
Publication No.: US20220147813A1Publication Date: 2022-05-12
- Inventor: Andre Xian Ming Chang , Aliasger Tayeb Zaidy , Marko Vitez , Eugenio Culurciello
- Applicant: Micron Technology, Inc.
- Applicant Address: US ID Boise
- Assignee: Micron Technology, Inc.
- Current Assignee: Micron Technology, Inc.
- Current Assignee Address: US ID Boise
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06F8/41 ; G06N3/04

Abstract:
Systems, devices, and methods related to a Deep Learning Accelerator and memory are described. For example, an integrated circuit device may be configured to execute instructions with matrix operands and configured with random access memory (RAM). A compiler is configured to generate instructions executable by the Deep Learning Accelerator from a description of a target artificial neural network. The instructions may call routines in a runtime library that has an embedded artificial neural network configured to predict optimized execution options available to implement the routines. The prediction is based at least in part on a pattern of data being processed in the target artificial neural network and/or a pattern of usages of the routines by the instructions.
Information query