METHOD AND APPARATUS FOR ALLOCATING COMPUTING TASK OF NEURAL NETWORK IN HETEROGENEOUS RESOURCES, AND DEVICE

    公开(公告)号:US20240311193A1

    公开(公告)日:2024-09-19

    申请号:US18571650

    申请日:2022-04-28

    IPC分类号: G06F9/50

    CPC分类号: G06F9/5027 G06F2209/5017

    摘要: A method and apparatus for allocating a computing task of a neural network in heterogeneous resources, a computer device, and a storage medium. The method includes: acquiring task information of the computing task of the neural network and resource information of the heterogeneous resources; determining, according to the task information and the resource information, an allocation mode for allocating each subtask to the heterogeneous resources for execution and a task processing cost corresponding to each allocation mode; constructing a directed acyclic graph according to each allocation mode and each task processing cost; obtaining a value of a loss function corresponding to each allocation path according to the task processing cost corresponding to each subtask in an allocation path of the directed acyclic graph; and selecting a target allocation path according to the value of each loss function.