REINFORCEMENT LEARNING METHOD AND APPARATUS

    公开(公告)号:US20230037632A1

    公开(公告)日:2023-02-09

    申请号:US17966985

    申请日:2022-10-17

    Abstract: A reinforcement learning method and recognition apparatus includes: obtaining a structure graph, where the structure graph includes structure information that is of an environment or the intelligent agent and that is obtained through learning; inputing a current state of the environment and the structure graph to a policy function of the intelligent agent, where the policy function is used to generate an action in response to the current state and the structure graph, and the policy function of the intelligent agent is a graph neural network; outputing the action to the environment by using the intelligent agent; obtaining, from the environment by using the intelligent agent, a next state and reward data in response to the action; training the intelligent agent through reinforcement learning based on the reward data.

Patent Agency Ranking