USING CONTAINER INFORMATION TO SELECT CONTAINERS FOR EXECUTING MODELS

    公开(公告)号:US20220237505A1

    公开(公告)日:2022-07-28

    申请号:US17159639

    申请日:2021-01-27

    Abstract: Using container information to select containers for executing models is described. A system receives a request from an application and identifies a version of a machine-learning model associated with the request. The system identifies a set of each serving container corresponding to the machine-learning model from a cluster of available serving containers associated with the version of the machine-learning model. The system selects a serving container from the set of each serving container corresponding to the machine-learning model. If the machine-learning model is not loaded in the serving container, the system loads the machine-learning model in the serving container. If the machine-learning model is loaded in the serving container, the system executes, in the serving container, the machine-learning model on behalf of the request. The system responds to the request based on executing the machine-learning model on behalf of the request.

Patent Agency Ranking