PLANNING FOR AGENT CONTROL USING LEARNED HIDDEN STATES

    公开(公告)号:US20230073326A1

    公开(公告)日:2023-03-09

    申请号:US17794797

    申请日:2021-01-28

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting actions to be performed by an agent interacting with an environment to cause the agent to perform a task. One of the methods includes: receiving a current observation characterizing a current environment state of the environment; performing a plurality of planning iterations to generate plan data that indicates a respective value to performing the task of the agent performing each of the set of actions in the environment and starting from the current environment state, wherein performing each planning iteration comprises selecting a sequence of actions to be performed by the agent starting from the current environment state based on outputs generated by a dynamics model and a prediction model; and selecting, from the set of actions, an action to be performed by the agent in response to the current observation based on the plan data.

Patent Agency Ranking