-
1.
公开(公告)号:US20210341886A1
公开(公告)日:2021-11-04
申请号:US17319442
申请日:2021-05-13
Applicant: Huawei Technologies Co., Ltd.
Inventor: Lifeng Liu , Yingxuan Zhu , Jun Zhang , Xiaofian Yin , Jian Li , Yongxiang Tao , Dayao Liang
IPC: G05B13/02 , G06F16/2457 , B60W50/06
Abstract: A computer implemented method for self-learning of a control system. The method includes creating an initial knowledge base. The method learns first principles using the knowledge base. The method creates initial control commands derived from the knowledge base. The method generates constraints for the control commands. The method performs constrained reinforcement learning by executing the control commands with the constraints and observing feedback to improve the control commands. The method enriches the knowledge base based on the feedback.