Online partially rewarded learning

Invention Grant

US11508480B2 Online partially rewarded learning 有权

Please log in to see more content

Patent Title: Online partially rewarded learning
Application No.: US16554344

Application Date: 2019-08-28
Publication No.: US11508480B2

Publication Date: 2022-11-22
Inventor: Sohini Upadhyay , Mikhail Yurochkin , Mayank Agarwal , Djallel Bouneffouf , Yasaman Khazaeni
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agency: Otterstedt & Kammer PLLC
Agent Anthony Curro
Main IPC: G16H50/20
IPC: G16H50/20 ; G16H10/20 ; G06N3/08 ; G06F17/16 ; G06F16/901 ; G06F17/15 ; G06N20/00

Abstract:

A feature vector characterizing a system to be analyzed via online partially rewarded machine learning is obtained. Based on the feature vector, a decision is made, via the machine learning, using an online policy. The system is observed for environmental feedback. In at least a first instance, wherein the observing indicates that the environmental feedback is available, the environmental feedback is obtained. In at least a second instance, wherein the observing indicates that the environmental feedback is missing, the environmental feedback is imputed via an online imputation method. the online policy is updated based on results of the obtained environmental feedback and the online imputation method. A decision is output based on the updated online policy.

Public/Granted literature

US20210065897A1 ONLINE PARTIALLY REWARDED LEARNING Public/Granted day:2021-03-04

Information query

Espacenet

IPC分类:

G	物理
G16	特别适用于特定应用领域的信息通信技术
G16H	医疗保健信息学，即专门用于处置或处理医疗或健康数据的信息和通信技术[ICT]
G16H50/00	专门适用于医疗诊断，医学模拟或医疗数据挖掘的ICT；专门适用于检测、监测或建模流行病或传染病
G16H50/20	.用于计算机辅助诊断，例如医疗专家系统