USING CONTAINER INFORMATION TO SELECT CONTAINERS FOR EXECUTING MODELS

    公开(公告)号:US20220237505A1

    公开(公告)日:2022-07-28

    申请号:US17159639

    申请日:2021-01-27

    Abstract: Using container information to select containers for executing models is described. A system receives a request from an application and identifies a version of a machine-learning model associated with the request. The system identifies a set of each serving container corresponding to the machine-learning model from a cluster of available serving containers associated with the version of the machine-learning model. The system selects a serving container from the set of each serving container corresponding to the machine-learning model. If the machine-learning model is not loaded in the serving container, the system loads the machine-learning model in the serving container. If the machine-learning model is loaded in the serving container, the system executes, in the serving container, the machine-learning model on behalf of the request. The system responds to the request based on executing the machine-learning model on behalf of the request.

    MACHINE LEARNING INFERENCING BASED ON DIRECTED ACYCLIC GRAPHS

    公开(公告)号:US20220414547A1

    公开(公告)日:2022-12-29

    申请号:US17357312

    申请日:2021-06-24

    Abstract: Methods and systems for machine learning inferencing based on directed acyclic graphs are presented. A request for a machine learning application is received from a tenant application. A tenant identifier that identifies one of the tenants is determined from the request. Based on the tenant identifier and a type of the machine learning application, configuration parameters and a graph structure are determined. The graph structure defines a flow of operations for the machine learning application. Nodes of the graph structure are executed based on the configuration parameters to obtain a scoring result. Execution of a node causes a machine learning model generated for the first tenant to be applied to data related to the request. The scoring result is returned in response to the request.

Patent Agency Ranking