Invention Grant
- Patent Title: Efficient off-policy credit assignment
-
Application No.: US16653890Application Date: 2019-10-15
-
Publication No.: US11580445B2Publication Date: 2023-02-14
- Inventor: Hao Liu , Richard Socher , Caiming Xiong
- Applicant: salesforce.com, inc.
- Applicant Address: US CA San Francisco
- Assignee: salesforce.com, inc.
- Current Assignee: salesforce.com, inc.
- Current Assignee Address: US CA San Francisco
- Agency: Haynes and Boone LLP
- Main IPC: G06N20/00
- IPC: G06N20/00 ; G06N5/02

Abstract:
Systems and methods are provided for efficient off-policy credit assignment (ECA) in reinforcement learning. ECA allows principled credit assignment for off-policy samples, and therefore improves sample efficiency and asymptotic performance. One aspect of ECA is to formulate the optimization of expected return as approximate inference, where policy is approximating a learned prior distribution, which leads to a principled way of utilizing off-policy samples. Other features are also provided.
Public/Granted literature
- US20200285993A1 Efficient Off-Policy Credit Assignment Public/Granted day:2020-09-10
Information query