Invention Grant
- Patent Title: Compiling models for dedicated hardware
-
Application No.: US17903991Application Date: 2022-09-06
-
Publication No.: US12175375B2Publication Date: 2024-12-24
- Inventor: Gaurav Kapoor , Cecile M. Foret , Francesco Rossi , Kit-Man Wan , Umesh S. Vaishampayan , Etienne Belanger , Albert Antony , Alexey Marinichev , Marco Zuliani , Xiaojin Shi
- Applicant: Apple Inc.
- Applicant Address: US CA Cupertino
- Assignee: Apple Inc.
- Current Assignee: Apple Inc.
- Current Assignee Address: US CA Cupertino
- Agency: BAKERHOSTETLER
- Main IPC: G06N3/10
- IPC: G06N3/10 ; G06F8/41 ; G06F9/50 ; G06N3/04 ; G06N3/063 ; G06N3/08

Abstract:
The subject technology provides receiving a neural network (NN) model to be executed on a target platform, the NN model including multiple layers that include operations and some of the operations being executable on multiple processors of the target platform. The subject technology further sorts the operations from the multiple layers in a particular order based at least in part on grouping the operations that are executable by a particular processor of the multiple processors. The subject technology determines, based at least in part on a cost of transferring the operations between the multiple processors, an assignment of one of the multiple processors for each of the sorted operations of each of the layers in a manner that minimizes a total cost of executing the operations. Further, for each layer of the NN model, the subject technology includes an annotation to indicate the processor assigned for each of the operations.
Public/Granted literature
- US12051006B2 Compiling models for dedicated hardware Public/Granted day:2024-07-30
Information query