REGISTRY ENHANCEMENTS FOR JUST-IN-TIME COMPILATION OF MACHINE LEARNING MODELS

    公开(公告)号:US20240020572A1

    公开(公告)日:2024-01-18

    申请号:US17819077

    申请日:2022-08-11

    Applicant: VMware, Inc.

    CPC classification number: G06N20/00 G06F8/41

    Abstract: The disclosure provides an approach for dynamic centralized model compilation. Embodiments include receiving, from a client, a request for a machine learning model, wherein the request indicates either one or more attributes comprising one or more of a hardware characteristic, a target precision, or a compiler characteristic, or that one or more default behaviors should be used to compile the machine learning model. Embodiments include determining a compiler for the machine learning model based on the one or more attributes or the one or more default behaviors, wherein the compiler is stored in a registry. Embodiments include compiling the machine learning model using the compiler. Embodiments include providing the compiled machine learning model to the client in response to the request.

Patent Agency Ranking