Neural network operation reordering for parallel execution

Invention Grant

US11016775B2 Neural network operation reordering for parallel execution 有权

Please log in to see more content

Patent Title: Neural network operation reordering for parallel execution
Application No.: US16453478

Application Date: 2019-06-26
Publication No.: US11016775B2

Publication Date: 2021-05-25
Inventor: Jeffrey T. Huynh , Drazen Borkovic , Jindrich Zejda , Randy Renfu Huang , Ron Diamant
Applicant: Amazon Technologies, Inc.
Applicant Address: US WA Seattle
Assignee: Amazon Technologies, Inc.
Current Assignee: Amazon Technologies, Inc.
Current Assignee Address: US WA Seattle
Agency: Kilpatrick Townsend & Stockton LLP
Main IPC: G06F8/00
IPC: G06F8/00 ; G06F9/38 ; G06F9/50 ; G06N3/04 ; G06N3/08

Neural network operation reordering for parallel execution

Abstract:

Techniques are disclosed for reordering operations of a neural network to improve runtime efficiency. In some examples, a compiler receives a description of the neural network comprising a plurality of operations. The compiler may determine which execution engine of a plurality of execution engines is to perform each of the plurality of operations. The compiler may determine an order of performance associated with the plurality of operations. The compiler may identify a runtime inefficiency based on the order of performance and a hardware usage for each of the plurality of operations. An operation may be reordered to reduce the runtime inefficiency. Instructions may be compiled based on the plurality of operations, which include the reordered operation.

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F8/00	软件工程设计（测试或调试入G06F 11/36; 软件项目管理、规划或组织入G06Q 10/06）