Invention Grant
- Patent Title: Lifelong learning with a changing action set
-
Application No.: US16578913Application Date: 2019-09-23
-
Publication No.: US11501207B2Publication Date: 2022-11-15
- Inventor: Georgios Theocharous , Yash Chandak
- Applicant: ADOBE INC.
- Applicant Address: US CA San Jose
- Assignee: ADOBE INC.
- Current Assignee: ADOBE INC.
- Current Assignee Address: US CA San Jose
- Agency: F. Chau & Associates, LLC
- Main IPC: G06N20/00
- IPC: G06N20/00 ; G06N7/00 ; G06N5/04

Abstract:
Systems and methods are described for a decision-making process that includes an increasing set of actions, compute a policy function for a Markov decision process (MDP) for the decision-making process, wherein the policy function is computed based on a state conditional function mapping states into an embedding space, an inverse dynamics function mapping state transitions into the embedding space, and an action selection function mapping the elements of the embedding space to actions, identify an additional set of actions in the increasing set of actions, update the inverse dynamics function based at least in part on the additional set of actions, update the policy function based on the updated inverse dynamics function and parameters learned during the computing the policy function, and select an action based on the updated policy function.
Public/Granted literature
- US20210089958A1 LIFELONG LEARNING WITH A CHANGING ACTION SET Public/Granted day:2021-03-25
Information query