GENERATING ENVIRONMENT MODELS USING IN-CONTEXT ADAPTATION AND EXPLORATION

    公开(公告)号:US20240256884A1

    公开(公告)日:2024-08-01

    申请号:US18424687

    申请日:2024-01-26

    IPC分类号: G06N3/092 G06N3/042

    CPC分类号: G06N3/092 G06N3/042

    摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent interacting with an environment to perform a task. In one aspect, one of the methods include: maintaining context data; receiving a current observation characterizing a current state of the environment; generating a current graph model that represents the environment; selecting, from a possible set of actions and using the current graph model, a current action to be performed by the agent in response to the current observation; controlling the agent to perform the selected current action to cause the environment to transition from the current state into a new state; and updating the context data to include (i) data identifying the selected current action and (ii) a new observation characterizing the new state of the environment.