Invention Grant
- Patent Title: Neural network operation reordering for parallel execution
-
Application No.: US16453478Application Date: 2019-06-26
-
Publication No.: US11016775B2Publication Date: 2021-05-25
- Inventor: Jeffrey T. Huynh , Drazen Borkovic , Jindrich Zejda , Randy Renfu Huang , Ron Diamant
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G06F8/00
- IPC: G06F8/00 ; G06F9/38 ; G06F9/50 ; G06N3/04 ; G06N3/08

Abstract:
Techniques are disclosed for reordering operations of a neural network to improve runtime efficiency. In some examples, a compiler receives a description of the neural network comprising a plurality of operations. The compiler may determine which execution engine of a plurality of execution engines is to perform each of the plurality of operations. The compiler may determine an order of performance associated with the plurality of operations. The compiler may identify a runtime inefficiency based on the order of performance and a hardware usage for each of the plurality of operations. An operation may be reordered to reduce the runtime inefficiency. Instructions may be compiled based on the plurality of operations, which include the reordered operation.
Information query