- 专利标题: MEMORY BANDWIDTH REDUCTION TECHNIQUES FOR LOW POWER CONVOLUTIONAL NEURAL NETWORK INFERENCE APPLICATIONS
-
申请号: US15812336申请日: 2017-11-14
-
公开(公告)号: US20190147332A1公开(公告)日: 2019-05-16
- 发明人: Sateesh Lagudu , Lei Zhang , Allen Rush
- 申请人: Advanced Micro Devices, Inc. , ATI Technologies ULC
- 主分类号: G06N3/08
- IPC分类号: G06N3/08 ; G06F1/32
摘要:
Systems, apparatuses, and methods for implementing memory bandwidth reduction techniques for low power convolutional neural network inference applications are disclosed. A system includes at least a processing unit and an external memory coupled to the processing unit. The system detects a request to perform a convolution operation on input data from a plurality of channels. Responsive to detecting the request, the system partitions the input data from the plurality of channels into 3D blocks so as to minimize the external memory bandwidth utilization for the convolution operation being performed. Next, the system loads a selected 3D block from external memory into internal memory and then generates convolution output data for the selected 3D block for one or more features. Then, for each feature, the system adds convolution output data together across channels prior to writing the convolution output data to the external memory.
公开/授权文献
信息查询