METHODS AND SYSTEMS THAT SAFELY IMPLEMENT CONTROL POLICIES WITHIN REINFORCEMENT-LEARNING-BASED MANAGEMENT-SYSTEM AGENTS

Invention Publication

US20240046069A1 METHODS AND SYSTEMS THAT SAFELY IMPLEMENT CONTROL POLICIES WITHIN REINFORCEMENT-LEARNING-BASED MANAGEMENT-SYSTEM AGENTS 审中-公开

Please log in to see more content

Patent Title: METHODS AND SYSTEMS THAT SAFELY IMPLEMENT CONTROL POLICIES WITHIN REINFORCEMENT-LEARNING-BASED MANAGEMENT-SYSTEM AGENTS
Application No.: US17970830

Application Date: 2022-10-21
Publication No.: US20240046069A1

Publication Date: 2024-02-08
Inventor: MARIUS VILCU , Peter Rudy , Asmitha Rathis , Aiswaryaa Venugopalan
Applicant: VMWARE, INC.
Applicant Address: US CA Palo Alto
Assignee: VMWARE, INC.
Current Assignee: VMWARE, INC.
Current Assignee Address: US CA Palo Alto
Priority: IN 2241042727 2022.07.26
Main IPC: G06N3/04
IPC: G06N3/04

METHODS AND SYSTEMS THAT SAFELY IMPLEMENT CONTROL POLICIES WITHIN REINFORCEMENT-LEARNING-BASED MANAGEMENT-SYSTEM AGENTS

Abstract:

The current document is directed to reinforcement-learning-based management-system agents that control distributed applications and the infrastructure environments in which they run. Management-system agents are initially trained in simulated environments and specialized training environments before being deployed to live, target distributed computer systems where they operate in a controller mode in which they do not explore the control-state space or attempt to learn better policies and value functions, but instead produce traces that are collected and stored for subsequent use. Each deployed management-system agent is associated with a twin training agent that uses the collected traces produced by the deployed management-system agent for optimizing its policy and value functions. To further ensure safe operational control of the environment, the management-system agents employ lookahead planning, action budgets, and action constraints to forestall issuance, by management-system controllers, of potentially deleterious actions.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑