APPARATUS AND METHOD WITH SCHEDULING
    2.
    发明公开

    公开(公告)号:US20230143270A1

    公开(公告)日:2023-05-11

    申请号:US17887968

    申请日:2022-08-15

    IPC分类号: G06F9/48

    CPC分类号: G06F9/4887

    摘要: A processor-implemented method with scheduling includes: receiving one or more execution requests for a plurality of models executed independently of each other in an accelerator; predicting, for each of the plurality of models, quality of service (QoS) information corresponding to the model; and scheduling the plurality of models in units of layers of the plurality of models based on, for each of the plurality of models, either one or both of the QoS information and an idle time occurring in response to a candidate layer to be scheduled in the model being executed in the accelerator.