Adversarial Cooperative Imitation Learning for Dynamic Treatment

    公开(公告)号:US20230376774A1

    公开(公告)日:2023-11-23

    申请号:US18362166

    申请日:2023-07-31

    CPC classification number: G06N3/084 G16H50/30 G06N20/20 G06N5/046 G06N3/045

    Abstract: Methods and systems for responding to changing conditions include training a model, using a processor, using trajectories that resulted in a positive outcome and trajectories that resulted in a negative outcome. Training is performed using an adversarial discriminator to train the model to generate trajectories that are similar to historical trajectories that resulted in a positive outcome, and using a cooperative discriminator to train the model to generate trajectories that are dissimilar to historical trajectories that resulted in a negative outcome. A dynamic response regime is generated using the trained model and environment information. A response to changing environment conditions is performed in accordance with the dynamic response regime.

Patent Agency Ranking