Invention Grant
- Patent Title: Multi-agent reinforcement learning with matchmaking policies
-
Application No.: US18131567Application Date: 2023-04-06
-
Publication No.: US12067491B2Publication Date: 2024-08-20
- Inventor: David Silver , Oriol Vinyals , Maxwell Elliot Jaderberg
- Applicant: DeepMind Technologies Limited
- Applicant Address: GB London
- Assignee: DeepMind Technologies Limited
- Current Assignee: DeepMind Technologies Limited
- Current Assignee Address: GB London
- Agency: Fish & Richardson P.C.
- Main IPC: G06N20/00
- IPC: G06N20/00 ; G06F18/214 ; G06N3/08 ; H04L9/40

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a policy neural network having a plurality of policy parameters and used to select actions to be performed by an agent to control the agent to perform a particular task while interacting with one or more other agents in an environment. In one aspect, the method includes: maintaining data specifying a pool of candidate action selection policies; maintaining data specifying respective matchmaking policy; and training the policy neural network using a reinforcement learning technique to update the policy parameters. The policy parameters define policies to be used in controlling the agent to perform the particular task.
Public/Granted literature
- US20230244936A1 MULTI-AGENT REINFORCEMENT LEARNING WITH MATCHMAKING POLICIES Public/Granted day:2023-08-03
Information query