-
公开(公告)号:US20240020572A1
公开(公告)日:2024-01-18
申请号:US17819077
申请日:2022-08-11
Applicant: VMware, Inc.
Inventor: Malini BHANDARU , Jia ZOU , Hai Ning ZHANG , Anthea JUNG
Abstract: The disclosure provides an approach for dynamic centralized model compilation. Embodiments include receiving, from a client, a request for a machine learning model, wherein the request indicates either one or more attributes comprising one or more of a hardware characteristic, a target precision, or a compiler characteristic, or that one or more default behaviors should be used to compile the machine learning model. Embodiments include determining a compiler for the machine learning model based on the one or more attributes or the one or more default behaviors, wherein the compiler is stored in a registry. Embodiments include compiling the machine learning model using the compiler. Embodiments include providing the compiled machine learning model to the client in response to the request.