Memory bandwidth reduction techniques for low power convolutional neural network inference applications

Invention Grant

US11227214B2 Memory bandwidth reduction techniques for low power convolutional neural network inference applications 有权

Please log in to see more content

Patent Title: Memory bandwidth reduction techniques for low power convolutional neural network inference applications
Application No.: US15812336

Application Date: 2017-11-14
Publication No.: US11227214B2

Publication Date: 2022-01-18
Inventor: Sateesh Lagudu , Lei Zhang , Allen Rush
Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC
Applicant Address: US CA Sunnyvale; CA Markham
Assignee: Advanced Micro Devices, Inc.,ATI Technologies ULC
Current Assignee: Advanced Micro Devices, Inc.,ATI Technologies ULC
Current Assignee Address: US CA Sunnyvale; CA Markham
Agency: Kowert, Hood, Munyon, Rankin & Goetzel, P.C.
Agent Rory D. Rankin
Main IPC: G06N3/08
IPC: G06N3/08 ; G06F1/3296 ; G06N3/04 ; G06N3/063

Memory bandwidth reduction techniques for low power convolutional neural network inference applications

Abstract:

Systems, apparatuses, and methods for implementing memory bandwidth reduction techniques for low power convolutional neural network inference applications are disclosed. A system includes at least a processing unit and an external memory coupled to the processing unit. The system detects a request to perform a convolution operation on input data from a plurality of channels. Responsive to detecting the request, the system partitions the input data from the plurality of channels into 3D blocks so as to minimize the external memory bandwidth utilization for the convolution operation being performed. Next, the system loads a selected 3D block from external memory into internal memory and then generates convolution output data for the selected 3D block for one or more features. Then, for each feature, the system adds convolution output data together across channels prior to writing the convolution output data to the external memory.

Public/Granted literature

US20190147332A1 MEMORY BANDWIDTH REDUCTION TECHNIQUES FOR LOW POWER CONVOLUTIONAL NEURAL NETWORK INFERENCE APPLICATIONS Public/Granted day:2019-05-16

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法