Systems and methods for knowledge transfer in machine learning

    公开(公告)号:US11620576B1

    公开(公告)日:2023-04-04

    申请号:US16908359

    申请日:2020-06-22

    Abstract: A training system may create and train a machine learning model with knowledge transfer. The knowledge transfer may transfer knowledge that is acquired by another machine learning model that has been previously trained to the machine learning model that is under training. The knowledge transfer may include a combination of representation transfer and instance transfer, the two of which may be performed alternatingly. The instance transfer may further include a filter mechanism to selectively identify instances with a satisfactory performance to implement the knowledge transfer.

    SIMULATION MODELING EXCHANGE
    6.
    发明申请

    公开(公告)号:US20200167687A1

    公开(公告)日:2020-05-28

    申请号:US16201864

    申请日:2018-11-27

    Abstract: A simulation application container executes a simulation of a system in a simulation environment, through which an agent representing the system uses a reinforcement learning model to operate within the simulation environment. The simulation application container obtains data indicating how the agent performed in the simulation environment and transmits this data to a robot application container. The robot application container uses the data to update the reinforcement learning model and provides the updated reinforcement learning model to perform another iteration of the simulation and continue training the reinforcement learning model.

    Robust multi-agent reinforcement learning

    公开(公告)号:US12265924B1

    公开(公告)日:2025-04-01

    申请号:US16908486

    申请日:2020-06-22

    Abstract: Techniques for robust multi-agent reinforcement learning (MARL) are described. An exemplary method includes initializing a plurality of parameters for a plurality of agents including at least policy parameters and action-value (Q) parameters; performing robust multi-agent reinforcement learning to learn polices for the agents, wherein in the learned polices no agent has an incentive to deviate, the agents include an implicit agent that is to select a worst-case at any given time during the learning process; and at least one agent utilizing its learned policy.

Patent Agency Ranking