METHOD AND APPARATUS FOR REINFORCEMENT MACHINE LEARNING

    公开(公告)号:US20210019644A1

    公开(公告)日:2021-01-21

    申请号:US16929975

    申请日:2020-07-15

    IPC分类号: G06N7/00 G06N20/00

    摘要: A method and an apparatus for exclusive reinforcement learning are provided, comprising: collecting information of states of an environment through the communication interface and performing a statistical analysis on the states using the collected information; determining a first state value of a first state among the states in a training phase and a second state value of a second state among the states in an inference phase based on analysis results of the statistical analysis; performing reinforcement learning by using one reinforcement learning unit of a plurality of reinforcement learning unit which performs reinforcement learnings from different perspectives according to the first state value; and selecting one of actions determined by the plurality of reinforcement learning unit based on the second state value and applying selected action to the environment.