Invention Application
- Patent Title: ORCHESTRATION OF WORKLOADS INVOLVING AN AI MODEL
-
Application No.: US18483849Application Date: 2023-10-10
-
Publication No.: US20250071069A1Publication Date: 2025-02-27
- Inventor: Aladin Djuhera , Alecio Pedro Delazari Binotto , Fernando Luiz Koch , Rob High
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY ARMONK
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY ARMONK
- Priority: GBGB2312781.4 20230822
- Main IPC: H04L47/70
- IPC: H04L47/70 ; H04L41/16 ; H04L47/80

Abstract:
The present disclosure relates to a method comprising receiving a request to execute a workload using an artificial intelligence model. A current resource utilization status in the distributed system may be determined. The current resource utilization status may be used to define a deployment configuration of the artificial intelligence model, wherein the deployment configuration is defined by: a number and structure of input blocks, a number and structure of output blocks and the intermediate block of the artificial intelligence model, a second computer system to execute the intermediate block, and one or more first computer systems to execute the input and output blocks. The artificial intelligence model may be deployed in accordance with the defined deployment configuration and the workload may be executed.
Information query