SERVICE-LEVEL OBJECTIVE (SLO) AWARE EXECUTION OF CONCURRENCE INFERENCE REQUESTS ON A FOG-CLOUD NETWORK

    公开(公告)号:EP4398102A1

    公开(公告)日:2024-07-10

    申请号:EP23217957.2

    申请日:2023-12-19

    IPC分类号: G06F9/50

    摘要: Cloud and Fog computing are complementary technologies used for complex Internet of Things (IoT) based deployment of applications. With an increase in the number of internet-connected devices, the volume of data generated and processed at higher speeds has increased substantially. Serving a large amount of data and workloads for predictive decisions in real-time using fog computing without Service-Level Objective (SLO) violation is a challenge. Present disclosure provides systems and method for inference management wherein a suitable execution workflow is automatically generated to execute machine learning (ML)/deep learning (DL) inference requests using fog with various type of instances (e.g., Function-as-a-Service (FaaS) instance, Machine Learning-as-a-service (MLaaS) instance, and the like) provided by cloud vendors/platforms. Generated workflow minimizes the cost of deployment as well as SLO violations.