Invention Grant
- Patent Title: Fused convolution and batch normalization for neural networks
-
Application No.: US16219154Application Date: 2018-12-13
-
Publication No.: US11573765B2Publication Date: 2023-02-07
- Inventor: Milind N. Nemlekar , Prerit Dak
- Applicant: ADVANCED MICRO DEVICES, INC.
- Applicant Address: US CA Santa Clara
- Assignee: ADVANCED MICRO DEVICES, INC.
- Current Assignee: ADVANCED MICRO DEVICES, INC.
- Current Assignee Address: US CA Santa Clara
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06F5/01 ; G06F17/16 ; G06N3/08

Abstract:
A processing unit implements a convolutional neural network (CNN) by fusing at least a portion of a convolution phase of the CNN with at least a portion of a batch normalization phase. The processing unit convolves two input matrices representing inputs and weights of a portion of the CNN to generate an output matrix. The processing unit performs the convolution via a series of multiplication operations, with each multiplication operation generating a corresponding submatrix (or “tile”) of the output matrix at an output register of the processing unit. While an output submatrix is stored at the output register, the processing unit performs a reduction phase and an update phase of the batch normalization phase for the CNN. The processing unit thus fuses at least a portion of the batch normalization phase of the CNN with a portion of the convolution.
Information query