TRAINING NEURAL NETWORKS FOR POLICY ADAPTATION

    公开(公告)号:US20250131279A1

    公开(公告)日:2025-04-24

    申请号:US18772900

    申请日:2024-07-15

    Applicant: InstaDeep Ltd

    Abstract: Systems, storage mediums comprising instructions, and methods of training a neural network to determine solutions to an optimization problem are provided. The methods involve obtaining training data representing a plurality of instances of an optimization problem, each instance being represented by a set of state parameters. For each instance of the optimization problem, a plurality of solutions are generated, each solution being generated using a neural network conditioned on an N-dimensional vector. Training the neural network conditioned on an N-dimensional vector associated with the highest performing solution is performed. Systems, storage mediums, and methods of using an neural network trained to be conditioned on an N-dimensional vector are also provided. These methods involve a search process for identifying an N-dimensional vector selected from a vector latent space to obtain a solution for the instance of the optimization problem.

Patent Agency Ranking