PARTITIONING AND PLACEMENT OF MODELS

    公开(公告)号:US20230053575A1

    公开(公告)日:2023-02-23

    申请号:US17578872

    申请日:2022-01-19

    Abstract: This disclosure describes techniques and mechanisms for enabling a user to run heavy deep learning workloads on standard edge networks without off-loading computation to a cloud, leveraging the available edge computing resources, and efficiently partitioning and distributing a Deep Neural Network (DNN) over a network. The techniques enable the user to split a workload into multiple parts and process the workload on a set of smaller, less capable compute nodes in a distributed manner, without compromising on performance, and while meeting a Service Level Objective (SLO).

Patent Agency Ranking