Invention Application
- Patent Title: MEMORY BANDWIDTH REDUCTION TECHNIQUES FOR LOW POWER CONVOLUTIONAL NEURAL NETWORK INFERENCE APPLICATIONS
-
Application No.: US15812336Application Date: 2017-11-14
-
Publication No.: US20190147332A1Publication Date: 2019-05-16
- Inventor: Sateesh Lagudu , Lei Zhang , Allen Rush
- Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06F1/32

Abstract:
Systems, apparatuses, and methods for implementing memory bandwidth reduction techniques for low power convolutional neural network inference applications are disclosed. A system includes at least a processing unit and an external memory coupled to the processing unit. The system detects a request to perform a convolution operation on input data from a plurality of channels. Responsive to detecting the request, the system partitions the input data from the plurality of channels into 3D blocks so as to minimize the external memory bandwidth utilization for the convolution operation being performed. Next, the system loads a selected 3D block from external memory into internal memory and then generates convolution output data for the selected 3D block for one or more features. Then, for each feature, the system adds convolution output data together across channels prior to writing the convolution output data to the external memory.
Public/Granted literature
- US11227214B2 Memory bandwidth reduction techniques for low power convolutional neural network inference applications Public/Granted day:2022-01-18
Information query