Invention Grant
- Patent Title: Environment navigation using reinforcement learning
-
Application No.: US16403343Application Date: 2019-05-03
-
Publication No.: US10572776B2Publication Date: 2020-02-25
- Inventor: Fabio Viola , Piotr Wojciech Mirowski , Andrea Banino , Razvan Pascanu , Hubert Josef Soyer , Andrew James Ballard , Sudarshan Kumaran , Raia Thais Hadsell , Laurent Sifre , Rostislav Goroshin , Koray Kavukcuoglu , Misha Man Ray Denil
- Applicant: DeepMind Technologies Limited
- Applicant Address: GB London
- Assignee: DeepMind Technologies Limited
- Current Assignee: DeepMind Technologies Limited
- Current Assignee Address: GB London
- Agency: Fish & Richardson P.C.
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/62 ; G06N3/04 ; G06N3/08 ; G06N3/00 ; G06T7/50 ; G06T7/70

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a reinforcement learning system. In one aspect, a method of training an action selection policy neural network for use in selecting actions to be performed by an agent navigating through an environment to accomplish one or more goals comprises: receiving an observation image characterizing a current state of the environment; processing, using the action selection policy neural network, an input comprising the observation image to generate an action selection output; processing, using a geometry-prediction neural network, an intermediate output generated by the action selection policy neural network to predict a value of a feature of a geometry of the environment when in the current state; and backpropagating a gradient of a geometry-based auxiliary loss into the action selection policy neural network to determine a geometry-based auxiliary update for current values of the network parameters.
Public/Granted literature
- US20190266449A1 ENVIRONMENT NAVIGATION USING REINFORCEMENT LEARNING Public/Granted day:2019-08-29
Information query