Invention Application
- Patent Title: CONTROLLING AGENTS USING CAUSALLY CORRECT ENVIRONMENT MODELS
-
Application No.: US17763914Application Date: 2020-09-24
-
Publication No.: US20220366246A1Publication Date: 2022-11-17
- Inventor: Ivo Danihelka , Danilo Jimenez Rezende , Karol Gregor , Georgios Papamakarios , Theophane Guillaume Weber
- Applicant: DeepMind Technologies Limited
- Applicant Address: GB London
- Assignee: DeepMind Technologies Limited
- Current Assignee: DeepMind Technologies Limited
- Current Assignee Address: GB London
- International Application: PCT/EP2020/076664 WO 20200924
- Main IPC: G06N3/08
- IPC: G06N3/08

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using an environment model to simulate state transitions of an environment being interacted with by an agent that is controlled using a policy neural network. One of the methods includes initializing an internal representation of a state of the environment at a current time point; repeatedly performing the following operations: receiving an action to be performed by the agent; generating, based on the internal representation, a predicted latent representation that is a prediction of a latent representation that would have been generated by the policy neural network by processing an observation characterizing the state of the environment corresponding to the internal representation; and updating the internal representation to simulate a state transition caused by the agent performing the received action by processing the predicted latent representation and the received action using the environment model.
Information query