INCREMENTAL MACHINE LEARNING MODEL DEPLOYMENT

    公开(公告)号:US20250147753A1

    公开(公告)日:2025-05-08

    申请号:US18939097

    申请日:2024-11-06

    Abstract: In accordance with example embodiments of the invention there is at least a method and apparatus to perform executing a machine learning inference loop of a currently deployed or stored at least one machine learning model, wherein the currently deployed or stored at least one machine learning model is identified based on a manifest file received from a communication network; based on determined factors, requesting from the communication network a model update to trigger the model update for use with the currently deployed or stored at least one machine learning model; based on the request, receiving information from the communication network comprising the model update; and based on the information, performing a model update to update the currently deployed or stored at least one machine learning model. Further, receiving, based on determined factors, from a user equipment a communication to trigger a machine learning model update for use with a currently deployed or stored at least one machine learning model at the user equipment; based on the communication, determining information comprising the model update; based on the determining, sending towards the client the information comprising the model update for a model update to update the currently deployed or stored at least one machine learning model.

    Compression Framework for Distributed or Federated Learning with Predictive Compression Paradigm

    公开(公告)号:US20220335269A1

    公开(公告)日:2022-10-20

    申请号:US17717729

    申请日:2022-04-11

    Abstract: An apparatus includes circuitry configured to: receive a plurality of compressed residual local weight updates from a plurality of respective institutes with a plurality of a respective first parameter, the first parameter used to determine a plurality of respective predicted local weight updates; determine a plurality of local weight updates or a plurality of adjusted local weight updates based on the plurality of compressed residual local weight updates and the plurality of respective predicted local weight updates; aggregate the plurality of determined local weight updates or the plurality of adjusted local weight updates to generate an intended global weight update, and update a model on a server based at least on the intended global weight update, the model used to perform a task; and transfer a compressed residual global weight update to the institutes with a second parameter, the second parameter used to determine a predicted global weight update.

Patent Agency Ranking