WORKLOAD PLACEMENT FOR VIRTUAL GPU ENABLED SYSTEMS

    公开(公告)号:US20210216375A1

    公开(公告)日:2021-07-15

    申请号:US16742108

    申请日:2020-01-14

    Applicant: VMware, Inc.

    Abstract: Disclosed are aspects of workload selection and placement in systems that include graphics processing units (GPUs) that are virtual GPU (vGPU) enabled. In some aspects, workloads are assigned to virtual graphics processing unit (vGPU)-enabled graphics processing units (GPUs) based on a variety of vGPU placement models. A number of vGPU placement neural networks are trained to maximize a composite efficiency metric based on workload data and GPU data for the plurality of vGPU placement models. A combined neural network selector is generated using the vGPU placement neural networks, and utilized to assign a workload to a vGPU-enabled GPU.

    MEMORY-AWARE PLACEMENT FOR VIRTUAL GPU ENABLED SYSTEMS

    公开(公告)号:US20210011773A1

    公开(公告)日:2021-01-14

    申请号:US16550313

    申请日:2019-08-26

    Applicant: VMWARE, INC.

    Abstract: Disclosed are aspects of memory-aware placement in systems that include graphics processing units (GPUs) that are virtual GPU (vGPU) enabled. Virtual graphics processing unit (vGPU) data is identified for graphics processing units (GPUs). A configured GPU list and an unconfigured GPU list are generated using the GPU data. The configured GPU list specifies configured vGPU profiles for configured GPUs. The unconfigured GPU list specifies a total GPU memory for unconfigured GPUs. A vGPU request is assigned to a vGPU of a GPU. The GPU is a first fit, from the configured GPU list or the unconfigured GPU list that satisfies a GPU memory requirement of the vGPU request.

    VGPU SCHEDULING POLICY-AWARE MIGRATION

    公开(公告)号:US20210373972A1

    公开(公告)日:2021-12-02

    申请号:US16890156

    申请日:2020-06-02

    Applicant: VMware, Inc.

    Abstract: Disclosed are aspects of virtual graphics processing unit (vGPU) scheduling-aware virtual machine migration. Graphics processing units (GPUs) that are compatible with a current virtual GPU (vGPU) profile for a virtual machine are identified. A scheduling policy matching order for a migration of the virtual machine is determined based on a current vGPU scheduling policy for the virtual machine. A destination GPU is selected based on a vGPU scheduling policy of the destination GPU being identified as a best available vGPU scheduling policy according to the scheduling policy matching order. The virtual machine is migrated to the destination GPU.

    SYMMETRIC ACCOUNT AUTHENTICATION
    6.
    发明申请

    公开(公告)号:US20200280544A1

    公开(公告)日:2020-09-03

    申请号:US16288453

    申请日:2019-02-28

    Applicant: VMware, Inc.

    Abstract: A method for symmetric authentication is provided. This method includes generating a first challenge message containing a first string; encrypting the first challenge message; transmitting the encrypted first challenge message to a second device; receiving a first answer message from the second device; decrypting the first answer message; and authenticating the second device based on determining the decrypted first answer message contains the first string. Upon successful authentication of the second device, the method further includes receiving an encrypted second challenge message from the second device; decrypting the encrypted second challenge message; generating a second answer message containing a second string; encrypting the second answer message; and transmitting the encrypted second answer message to the second device.

    DYNAMIC KERNEL SLICING FOR VGPU SHARING IN SERVERLESS COMPUTING SYSTEMS

    公开(公告)号:US20210110506A1

    公开(公告)日:2021-04-15

    申请号:US16601831

    申请日:2019-10-15

    Applicant: VMware, Inc.

    Abstract: Various examples are disclosed for dynamic kernel slicing for virtual graphics processing unit (vGPU) sharing in serverless computing systems. A computing device is configured to provide a serverless computing service, receive a request for execution of program code in the serverless computing service in which a plurality of virtual graphics processing units (vGPUs) are used in the execution of the program code, determine a slice size to partition a compute kernel of the program code into a plurality of sub-kernels for concurrent execution by the vGPUs, the slice size being determined for individual ones of the sub-kernels based on an optimization function that considers a load on a GPU, determine an execution schedule for executing the individual ones of the sub-kernels on the vGPUs in accordance with a scheduling policy, and execute the sub-kernels on the vGPUs as partitioned in accordance with the execution schedule.

Patent Agency Ranking