Action selection for reinforcement learning using neural networks

Invention Grant

US10679126B2 Action selection for reinforcement learning using neural networks 有权

Please log in to see more content

Patent Title: Action selection for reinforcement learning using neural networks
Application No.: US16511571

Application Date: 2019-07-15
Publication No.: US10679126B2

Publication Date: 2020-06-09
Inventor: Simon Osindero , Koray Kavukcuoglu , Alexander Vezhnevets
Applicant: DeepMind Technologies Limited
Applicant Address: GB London
Assignee: DeepMind Technologies Limited
Current Assignee: DeepMind Technologies Limited
Current Assignee Address: GB London
Agency: Fish & Richardson P.C.
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/04

Action selection for reinforcement learning using neural networks

Abstract:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a system configured to select actions to be performed by an agent that interacts with an environment. The system comprises a manager neural network subsystem and a worker neural network subsystem. The manager subsystem is configured to, at each of the multiple time steps, generate a final goal vector for the time step. The worker subsystem is configured to, at each of multiple time steps, use the final goal vector generated by the manager subsystem to generate a respective action score for each action in a predetermined set of actions.

Public/Granted literature

US20190340509A1 ACTION SELECTION FOR REINFORCEMENT LEARNING USING NEURAL NETWORKS Public/Granted day:2019-11-07

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法