Lifelong learning with a changing action set

Invention Grant

US11501207B2 Lifelong learning with a changing action set 有权

Please log in to see more content

Patent Title: Lifelong learning with a changing action set
Application No.: US16578913

Application Date: 2019-09-23
Publication No.: US11501207B2

Publication Date: 2022-11-15
Inventor: Georgios Theocharous , Yash Chandak
Applicant: ADOBE INC.
Applicant Address: US CA San Jose
Assignee: ADOBE INC.
Current Assignee: ADOBE INC.
Current Assignee Address: US CA San Jose
Agency: F. Chau & Associates, LLC
Main IPC: G06N20/00
IPC: G06N20/00 ; G06N7/00 ; G06N5/04

Lifelong learning with a changing action set

Abstract:

Systems and methods are described for a decision-making process that includes an increasing set of actions, compute a policy function for a Markov decision process (MDP) for the decision-making process, wherein the policy function is computed based on a state conditional function mapping states into an embedding space, an inverse dynamics function mapping state transitions into the embedding space, and an action selection function mapping the elements of the embedding space to actions, identify an additional set of actions in the increasing set of actions, update the inverse dynamics function based at least in part on the additional set of actions, update the policy function based on the updated inverse dynamics function and parameters learned during the computing the policy function, and select an action based on the updated policy function.

Public/Granted literature

US20210089958A1 LIFELONG LEARNING WITH A CHANGING ACTION SET Public/Granted day:2021-03-25

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习