Invention Application
- Patent Title: OFF-POLICY CONTROL POLICY EVALUATION
-
Application No.: US16827596Application Date: 2020-03-23
-
Publication No.: US20200304545A1Publication Date: 2020-09-24
- Inventor: Kanury Kanishka Rao , Konstantinos Bousmalis , Christopher K. Harris , Alexander Irpan , Sergey Vladimir Levine , Julian Ibarz
- Applicant: Google LLC
- Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@afbfb9f
- Main IPC: H04L29/06
- IPC: H04L29/06 ; G06K9/62 ; G06N3/04 ; G06N3/08

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for off-policy evaluation of a control policy. One of the methods includes obtaining policy data specifying a control policy for controlling a source agent interacting with a source environment to perform a particular task; obtaining a validation data set generated from interactions of a target agent in a target environment; determining a performance estimate that represents an estimate of a performance of the control policy in controlling the target agent to perform the particular task in the target environment; and determining, based on the performance estimate, whether to deploy the control policy for controlling the target agent to perform the particular task in the target environment.
Public/Granted literature
- US11477243B2 Off-policy control policy evaluation Public/Granted day:2022-10-18
Information query