-
公开(公告)号:US20190303757A1
公开(公告)日:2019-10-03
申请号:US16221295
申请日:2018-12-14
Applicant: MediaTek Inc.
Inventor: Wei-Ting Wang , Han-Lin Li , Chih Chung Cheng , Shao-Yu Wang
Abstract: A deep learning accelerator (DLA) includes processing elements (PEs) grouped into PE groups to perform convolutional neural network (CNN) computations, by applying multi-dimensional weights on an input activation to produce an output activation. The DLA also includes a dispatcher which dispatches input data in the input activation and non-zero weights in the multi-dimensional weights to the processing elements according to a control mask. The DLA also includes a buffer memory which stores the control mask which specifies positions of zero weights in the multi-dimensional weights. The PE groups generate output data of respective output channels in the output activation, and share a same control mask specifying same positions of the zero weights.