DEPLOYING PARALLELIZABLE DEEP LEARNING MODELS BY ADAPTING TO THE COMPUTING DEVICES

    公开(公告)号:US20220351020A1

    公开(公告)日:2022-11-03

    申请号:US17245541

    申请日:2021-04-30

    IPC分类号: G06N3/04

    摘要: In an approach to deploying parallelizable deep learning models by adapting to the computing devices, a deep learning model is split into a plurality of slices, where each slice can exchange data with related slices. Virtual models are created from the plurality of slices, where the virtual models are based on capabilities of a plurality of devices on which the one or more virtual models are to be deployed, and further where each virtual model contains each slice of the plurality of slices. The one or more virtual models are stored in a cache. Responsive to determining that the deep learning model is to be deployed on one or more devices, a candidate model is selected from the virtual models in the cache, where the selection is based on information from a device monitor about the devices.