MULTIPLE MODEL INJECTION FOR A DEPLOYMENT CLUSTER

    公开(公告)号:US20220078264A1

    公开(公告)日:2022-03-10

    申请号:US17531068

    申请日:2021-11-19

    Inventor: Kartik Mathur

    Abstract: Systems and methods are provided for servicing inference request by one of multiple machine learning models attached to a deployment cluster. The API server of a deployment cluster is not tightly coupled to any of multiple machine learning models attached to the deployment cluster. Upon receiving an inference request, the deployment cluster can retrieve the configuration parameters, including serialization formatting, for a target model identified in the inference request. The deployment cluster can utilize the retrieved parameters to service the inference request and return the results to a business system application.

    Multiple model injection for a deployment cluster

    公开(公告)号:US11206316B2

    公开(公告)日:2021-12-21

    申请号:US16809414

    申请日:2020-03-04

    Inventor: Kartik Mathur

    Abstract: Systems and methods are provided for servicing inference request by one of multiple machine learning models attached to a deployment cluster. The API server of a deployment cluster is not tightly coupled to any of multiple machine learning models attached to the deployment cluster. Upon receiving an inference request, the deployment cluster can retrieve the configuration parameters, including serialization formatting, for a target model identified in the inference request. The deployment cluster can utilize the retrieved parameters to service the inference request and return the results to a business system application.

    ENHANCED MIGRATION OF CLUSTERS BASED ON DATA ACCESSIBILITY

    公开(公告)号:US20210019162A1

    公开(公告)日:2021-01-21

    申请号:US16514434

    申请日:2019-07-17

    Abstract: Described herein are systems, methods, and software to migrate virtual nodes of a data processing cluster. In one implementation, a management system monitors an executing data processing cluster on one or more first hosts to determine when the data processing cluster satisfies migration criteria. Once satisfied, the management system selects one or more second hosts to support the data processing cluster based on accommodation data associated with the hosts. After selection, the management system may initiate operations to migrate the data processing cluster from the one or more first hosts to the one or more second hosts.

    Multiple model injection for a deployment cluster

    公开(公告)号:US12192292B2

    公开(公告)日:2025-01-07

    申请号:US18610561

    申请日:2024-03-20

    Inventor: Kartik Mathur

    Abstract: Systems and methods are provided for servicing inference request by one of multiple machine learning models attached to a deployment cluster. The API server of a deployment cluster is not tightly coupled to any of multiple machine learning models attached to the deployment cluster. Upon receiving an inference request, the deployment cluster can retrieve the configuration parameters, including serialization formatting, for a target model identified in the inference request. The deployment cluster can utilize the retrieved parameters to service the inference request and return the results to a business system application.

Patent Agency Ranking