Compiling models for dedicated hardware

Invention Grant

US12175375B2 Compiling models for dedicated hardware 有权

Please log in to see more content

Patent Title: Compiling models for dedicated hardware
Application No.: US17903991

Application Date: 2022-09-06
Publication No.: US12175375B2

Publication Date: 2024-12-24
Inventor: Gaurav Kapoor , Cecile M. Foret , Francesco Rossi , Kit-Man Wan , Umesh S. Vaishampayan , Etienne Belanger , Albert Antony , Alexey Marinichev , Marco Zuliani , Xiaojin Shi
Applicant: Apple Inc.
Applicant Address: US CA Cupertino
Assignee: Apple Inc.
Current Assignee: Apple Inc.
Current Assignee Address: US CA Cupertino
Agency: BAKERHOSTETLER
Main IPC: G06N3/10
IPC: G06N3/10 ; G06F8/41 ; G06F9/50 ; G06N3/04 ; G06N3/063 ; G06N3/08

Abstract:

The subject technology provides receiving a neural network (NN) model to be executed on a target platform, the NN model including multiple layers that include operations and some of the operations being executable on multiple processors of the target platform. The subject technology further sorts the operations from the multiple layers in a particular order based at least in part on grouping the operations that are executable by a particular processor of the multiple processors. The subject technology determines, based at least in part on a cost of transferring the operations between the multiple processors, an assignment of one of the multiple processors for each of the sorted operations of each of the layers in a manner that minimizes a total cost of executing the operations. Further, for each layer of the NN model, the subject technology includes an annotation to indicate the processor assigned for each of the operations.

Public/Granted literature

US12051006B2 Compiling models for dedicated hardware Public/Granted day:2024-07-30

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/10	..在通用计算机上的仿真