Optimal sequential decision making with changing action space

Invention Grant

US12111884B2 Optimal sequential decision making with changing action space 有权

Please log in to see more content

Patent Title: Optimal sequential decision making with changing action space
Application No.: US17659983

Application Date: 2022-04-20
Publication No.: US12111884B2

Publication Date: 2024-10-08
Inventor: Tanay Anand , Pinkesh Badjatiya , Sriyash Poddar , Jayakumar Subramanian , Georgios Theocharous , Balaji Krishnamurthy
Applicant: ADOBE INC.
Applicant Address: US CA San Jose
Assignee: ADOBE INC.
Current Assignee: ADOBE INC.
Current Assignee Address: US CA San Jose
Agency: F. CHAU & ASSOCIATES, LLC
Main IPC: G06F18/2137
IPC: G06F18/2137 ; G06N3/088

Optimal sequential decision making with changing action space

Abstract:

Systems and methods for machine learning are described. Embodiments of the present disclosure receive state information that describes a state of a decision making agent in an environment; compute an action vector from an action embedding space based on the state information using a policy neural network of the decision making agent, wherein the policy neural network is trained using reinforcement learning based on a topology loss that constrains changes in a mapping between an action set and the action embedding space; and perform an action that modifies the state of the decision making agent in the environment based on the action vector, wherein the action is selected based on the mapping.

Public/Granted literature

US20230342425A1 OPTIMAL SEQUENTIAL DECISION MAKING WITH CHANGING ACTION SPACE Public/Granted day:2023-10-26

Information query

Espacenet