CONTROLLING AGENTS USING CAUSALLY CORRECT ENVIRONMENT MODELS

Invention Application

US20220366246A1 CONTROLLING AGENTS USING CAUSALLY CORRECT ENVIRONMENT MODELS 有权

Please log in to see more content

Patent Title: CONTROLLING AGENTS USING CAUSALLY CORRECT ENVIRONMENT MODELS
Application No.: US17763914

Application Date: 2020-09-24
Publication No.: US20220366246A1

Publication Date: 2022-11-17
Inventor: Ivo Danihelka , Danilo Jimenez Rezende , Karol Gregor , Georgios Papamakarios , Theophane Guillaume Weber
Applicant: DeepMind Technologies Limited
Applicant Address: GB London
Assignee: DeepMind Technologies Limited
Current Assignee: DeepMind Technologies Limited
Current Assignee Address: GB London
International Application: PCT/EP2020/076664 WO 20200924
Main IPC: G06N3/08
IPC: G06N3/08

CONTROLLING AGENTS USING CAUSALLY CORRECT ENVIRONMENT MODELS

Abstract:

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using an environment model to simulate state transitions of an environment being interacted with by an agent that is controlled using a policy neural network. One of the methods includes initializing an internal representation of a state of the environment at a current time point; repeatedly performing the following operations: receiving an action to be performed by the agent; generating, based on the internal representation, a predicted latent representation that is a prediction of a latent representation that would have been generated by the policy neural network by processing an observation characterizing the state of the environment corresponding to the internal representation; and updating the internal representation to simulate a state transition caused by the agent performing the received action by processing the predicted latent representation and the received action using the environment model.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法