METHOD FOR FAST AND BETTER TREE SEARCH FOR REINFORCEMENT LEARNING

    公开(公告)号:US20220398283A1

    公开(公告)日:2022-12-15

    申请号:US17824680

    申请日:2022-05-25

    Abstract: A method for performing a Tree-Search (TS) on an environment is provided. The method comprises generating a tree for a current state of the environment based on a TS policy, determining a corrected TS policy, and determining an action to apply to the environment based on the corrected TS policy. The tree comprises a plurality of nodes including a root node among the plurality of nodes corresponding to the current state of the environment. Each node other than the root node among the plurality of nodes corresponding to an estimated future state of the environment. The plurality of nodes in the tree are connected by a plurality of edges. Each edge among the plurality of edges is associated with an action causing a transition from a first state to a different sate of the environment.

Patent Agency Ranking