Adversarial automated reinforcement-learning-based application-manager training

Invention Grant

US10977579B2 Adversarial automated reinforcement-learning-based application-manager training 有权

Please log in to see more content

Patent Title: Adversarial automated reinforcement-learning-based application-manager training
Application No.: US16518807

Application Date: 2019-07-22
Publication No.: US10977579B2

Publication Date: 2021-04-13
Inventor: Dev Nag , Yanislav Yankov , Dongni Wang , Gregory T. Burk , Nicholas Mark Grant Stephen
Applicant: VMware, Inc.
Applicant Address: US CA Palo Alto
Assignee: VMware, Inc.
Current Assignee: VMware, Inc.
Current Assignee Address: US CA Palo Alto
Main IPC: G06N20/00
IPC: G06N20/00 ; G06F9/54 ; G06N7/00

Adversarial automated reinforcement-learning-based application-manager training

Abstract:

The current document is directed to automated reinforcement-learning-based application managers that that are trained using adversarial training. During adversarial training, potentially disadvantageous next actions are selected for issuance by an automated reinforcement-learning-based application manager at a lower frequency than selection of next actions, according to a policy that is learned to provide optimal or near-optimal control over a computing environment that includes one or more applications controlled by the automated reinforcement-learning-based application manager. By selecting disadvantageous actions, the automated reinforcement-learning-based application manager is forced to explore a much larger subset of the system-state space during training, so that, upon completion of training, the automated reinforcement-learning-based application manager has learned a more robust and complete optimal or near-optimal control policy than had the automated reinforcement-learning-based application manager been trained by simulators or using management actions and computing-environment responses recorded during previous controlled operation of a computing-environment.

Public/Granted literature

US20200065703A1 ADVERSARIAL AUTOMATED REINFORCEMENT-LEARNING-BASED APPLICATION-MANAGER TRAINING Public/Granted day:2020-02-27

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习