Invention Publication
- Patent Title: ELASTIC PROVISIONING OF CONTAINER-BASED GRAPHICS PROCESSING UNIT (GPU) NODES
-
Application No.: US18142041Application Date: 2023-05-02
-
Publication No.: US20240241760A1Publication Date: 2024-07-18
- Inventor: Yisan ZHAO , Xiaoyu HU , Robert RIEMER , Aidan CULLY
- Applicant: VMware, Inc.
- Applicant Address: US CA Palo Alto
- Assignee: VMware, Inc.
- Current Assignee: VMware, Inc.
- Current Assignee Address: US CA Palo Alto
- Priority: WO TCN2023000007 2023.01.12
- Main IPC: G06F9/50
- IPC: G06F9/50 ; G06F11/34

Abstract:
Example methods and systems for elastic provisioning of container-based graphics processing unit (GPU) nodes are described. In one example, a computer system may monitor usage information associated with a pool of multiple container-based GPU nodes. Based on the usage information, the computer system may apply rule(s) to determine whether capacity adjustment is required. In response to determination that capacity expansion is required, the computer system may configure the pool to expand by adding (a) at least one container-based GPU node to the pool, or (b) at least one container pod to one of the multiple container-based GPU nodes. Otherwise, in response to determination that capacity shrinkage is required, the computer system may configure the pool to shrink by removing (a) at least one container-based GPU node, or (b) at least one container pod from the pool.
Information query