-
公开(公告)号:US20240256883A1
公开(公告)日:2024-08-01
申请号:US18424561
申请日:2024-01-26
Applicant: DeepMind Technologies Limited
Inventor: Thomas Mesnard , Remi Munos , Alaa Saade , Yunhao Tang , Mark Daniel Rowland , Theophane Guillaume Weber , Wenqi Chen
IPC: G06N3/092
CPC classification number: G06N3/092
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network used to select actions to be performed by an agent interacting with an environment. Implementations of the system can take into account a level of luck in the environment, and hence whilst learning can account for outcomes that were caused by external factors as well as those dependent on the actions of the agent.