Multi-agent reinforcement learning with matchmaking policies

Invention Grant

US12067491B2 Multi-agent reinforcement learning with matchmaking policies 有权

Please log in to see more content

Patent Title: Multi-agent reinforcement learning with matchmaking policies
Application No.: US18131567

Application Date: 2023-04-06
Publication No.: US12067491B2

Publication Date: 2024-08-20
Inventor: David Silver , Oriol Vinyals , Maxwell Elliot Jaderberg
Applicant: DeepMind Technologies Limited
Applicant Address: GB London
Assignee: DeepMind Technologies Limited
Current Assignee: DeepMind Technologies Limited
Current Assignee Address: GB London
Agency: Fish & Richardson P.C.
Main IPC: G06N20/00
IPC: G06N20/00 ; G06F18/214 ; G06N3/08 ; H04L9/40

Multi-agent reinforcement learning with matchmaking policies

Abstract:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a policy neural network having a plurality of policy parameters and used to select actions to be performed by an agent to control the agent to perform a particular task while interacting with one or more other agents in an environment. In one aspect, the method includes: maintaining data specifying a pool of candidate action selection policies; maintaining data specifying respective matchmaking policy; and training the policy neural network using a reinforcement learning technique to update the policy parameters. The policy parameters define policies to be used in controlling the agent to perform the particular task.

Public/Granted literature

US20230244936A1 MULTI-AGENT REINFORCEMENT LEARNING WITH MATCHMAKING POLICIES Public/Granted day:2023-08-03

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习