REINFORCEMENT LEARNING WITH AUXILIARY TASKS

Invention Publication

US20240144015A1 REINFORCEMENT LEARNING WITH AUXILIARY TASKS 审中-公开

Please log in to see more content

Patent Title: REINFORCEMENT LEARNING WITH AUXILIARY TASKS
Application No.: US18386954

Application Date: 2023-11-03
Publication No.: US20240144015A1

Publication Date: 2024-05-02
Inventor: Volodymyr Mnih , Wojciech Czarnecki , Maxwell Elliot Jaderberg , Tom Schaul , David Silver , Koray Kavukcuoglu
Applicant: DeepMind Technologies Limited
Applicant Address: GB London
Assignee: DeepMind Technologies Limited
Current Assignee: DeepMind Technologies Limited
Current Assignee Address: GB London
Main IPC: G06N3/084
IPC: G06N3/084 ; G06N3/006 ; G06N3/044 ; G06N3/045 ; G06N20/00

REINFORCEMENT LEARNING WITH AUXILIARY TASKS

Abstract:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a reinforcement learning system. The method includes: training an action selection policy neural network, and during the training of the action selection neural network, training one or more auxiliary control neural networks and a reward prediction neural network. Each of the auxiliary control neural networks is configured to receive a respective intermediate output generated by the action selection policy neural network and generate a policy output for a corresponding auxiliary control task. The reward prediction neural network is configured to receive one or more intermediate outputs generated by the action selection policy neural network and generate a corresponding predicted reward. Training each of the auxiliary control neural networks and the reward prediction neural network comprises adjusting values of the respective auxiliary control parameters, reward prediction parameters, and the action selection policy network parameters.

Information query

Global Dossier Espacenet