COMPILING MODELS FOR DEDICATED HARDWARE
    3.
    发明申请

    公开(公告)号:US20200082274A1

    公开(公告)日:2020-03-12

    申请号:US16262809

    申请日:2019-01-30

    Applicant: Apple Inc.

    Abstract: The subject technology provides receiving a neural network (NN) model to be executed on a target platform, the NN model including multiple layers that include operations and some of the operations being executable on multiple processors of the target platform. The subject technology further sorts the operations from the multiple layers in a particular order based at least in part on grouping the operations that are executable by a particular processor of the multiple processors. The subject technology determines, based at least in part on a cost of transferring the operations between the multiple processors, an assignment of one of the multiple processors for each of the sorted operations of each of the layers in a manner that minimizes a total cost of executing the operations. Further, for each layer of the NN model, the subject technology includes an annotation to indicate the processor assigned for each of the operations.

Patent Agency Ranking