SYSTEMS AND METHODS FOR MODEL-BASED META-LEARNING

    公开(公告)号:US20240119308A1

    公开(公告)日:2024-04-11

    申请号:US18159036

    申请日:2023-01-24

    CPC classification number: G06N3/0985

    Abstract: Embodiments provide a method for predicting agent actions for neural network based agents according to an intervention. The method includes obtaining a first agent action at a first time step and a first intervention generated according to an intervention policy. The method also includes generating, by the neural network based agent model, a predicted agent action conditioned on the first agent action and the first intervention. The method also includes generating, by a neural network based intervention model, a second intervention according to the intervention policy and conditioned on the first agent action, the first intervention, and the predicted agent action. The method further includes executing a second agent action according to an agent policy that incurs a reward based on the second intervention. The method further includes training the neural network based intervention model by updating parameters of the neural network based intervention model based on an expected return.

Patent Agency Ranking