- 专利标题: Online partially rewarded learning
-
申请号: US16554344申请日: 2019-08-28
-
公开(公告)号: US11508480B2公开(公告)日: 2022-11-22
- 发明人: Sohini Upadhyay , Mikhail Yurochkin , Mayank Agarwal , Djallel Bouneffouf , Yasaman Khazaeni
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Otterstedt & Kammer PLLC
- 代理商 Anthony Curro
- 主分类号: G16H50/20
- IPC分类号: G16H50/20 ; G16H10/20 ; G06N3/08 ; G06F17/16 ; G06F16/901 ; G06F17/15 ; G06N20/00
摘要:
A feature vector characterizing a system to be analyzed via online partially rewarded machine learning is obtained. Based on the feature vector, a decision is made, via the machine learning, using an online policy. The system is observed for environmental feedback. In at least a first instance, wherein the observing indicates that the environmental feedback is available, the environmental feedback is obtained. In at least a second instance, wherein the observing indicates that the environmental feedback is missing, the environmental feedback is imputed via an online imputation method. the online policy is updated based on results of the obtained environmental feedback and the online imputation method. A decision is output based on the updated online policy.
公开/授权文献
- US20210065897A1 ONLINE PARTIALLY REWARDED LEARNING 公开/授权日:2021-03-04
信息查询