Invention Grant
- Patent Title: Memory bandwidth reduction techniques for low power convolutional neural network inference applications
-
Application No.: US15812336Application Date: 2017-11-14
-
Publication No.: US11227214B2Publication Date: 2022-01-18
- Inventor: Sateesh Lagudu , Lei Zhang , Allen Rush
- Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC
- Applicant Address: US CA Sunnyvale; CA Markham
- Assignee: Advanced Micro Devices, Inc.,ATI Technologies ULC
- Current Assignee: Advanced Micro Devices, Inc.,ATI Technologies ULC
- Current Assignee Address: US CA Sunnyvale; CA Markham
- Agency: Kowert, Hood, Munyon, Rankin & Goetzel, P.C.
- Agent Rory D. Rankin
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06F1/3296 ; G06N3/04 ; G06N3/063

Abstract:
Systems, apparatuses, and methods for implementing memory bandwidth reduction techniques for low power convolutional neural network inference applications are disclosed. A system includes at least a processing unit and an external memory coupled to the processing unit. The system detects a request to perform a convolution operation on input data from a plurality of channels. Responsive to detecting the request, the system partitions the input data from the plurality of channels into 3D blocks so as to minimize the external memory bandwidth utilization for the convolution operation being performed. Next, the system loads a selected 3D block from external memory into internal memory and then generates convolution output data for the selected 3D block for one or more features. Then, for each feature, the system adds convolution output data together across channels prior to writing the convolution output data to the external memory.
Public/Granted literature
- US20190147332A1 MEMORY BANDWIDTH REDUCTION TECHNIQUES FOR LOW POWER CONVOLUTIONAL NEURAL NETWORK INFERENCE APPLICATIONS Public/Granted day:2019-05-16
Information query