Invention Grant
- Patent Title: Online partially rewarded learning
-
Application No.: US16554344Application Date: 2019-08-28
-
Publication No.: US11508480B2Publication Date: 2022-11-22
- Inventor: Sohini Upadhyay , Mikhail Yurochkin , Mayank Agarwal , Djallel Bouneffouf , Yasaman Khazaeni
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Otterstedt & Kammer PLLC
- Agent Anthony Curro
- Main IPC: G16H50/20
- IPC: G16H50/20 ; G16H10/20 ; G06N3/08 ; G06F17/16 ; G06F16/901 ; G06F17/15 ; G06N20/00

Abstract:
A feature vector characterizing a system to be analyzed via online partially rewarded machine learning is obtained. Based on the feature vector, a decision is made, via the machine learning, using an online policy. The system is observed for environmental feedback. In at least a first instance, wherein the observing indicates that the environmental feedback is available, the environmental feedback is obtained. In at least a second instance, wherein the observing indicates that the environmental feedback is missing, the environmental feedback is imputed via an online imputation method. the online policy is updated based on results of the obtained environmental feedback and the online imputation method. A decision is output based on the updated online policy.
Public/Granted literature
- US20210065897A1 ONLINE PARTIALLY REWARDED LEARNING Public/Granted day:2021-03-04
Information query