Invention Grant
- Patent Title: Host-directed multi-layer neural network processing via per-layer work requests
-
Application No.: US15786102Application Date: 2017-10-17
-
Publication No.: US11429848B2Publication Date: 2022-08-30
- Inventor: Aaron Ng , Elliott Delaye , Jindrich Zejda , Ashish Sirasao
- Applicant: Xilinx, Inc.
- Applicant Address: US CA San Jose
- Assignee: Xilinx, Inc.
- Current Assignee: Xilinx, Inc.
- Current Assignee Address: US CA San Jose
- Agency: Crawford Maunu PLLC
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06N3/063

Abstract:
In disclosed approaches of neural network processing, a host computer system copies an input data matrix from host memory to a shared memory for performing neural network operations of a first layer of a neural network by a neural network accelerator. The host instructs the neural network accelerator to perform neural network operations of each layer of the neural network beginning with the input data matrix. The neural network accelerator performs neural network operations of each layer in response to the instruction from the host. The host waits until the neural network accelerator signals completion of performing neural network operations of layer i before instructing the neural network accelerator to commence performing neural network operations of layer i+1, for i≥1. The host instructs the neural network accelerator to use a results data matrix in the shared memory from layer i as an input data matrix for layer i+1 for i≥1.
Public/Granted literature
- US20190114538A1 HOST-DIRECTED MULTI-LAYER NEURAL NETWORK PROCESSING VIA PER-LAYER WORK REQUESTS Public/Granted day:2019-04-18
Information query