-
公开(公告)号:US20230037632A1
公开(公告)日:2023-02-09
申请号:US17966985
申请日:2022-10-17
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Furui LIU , Wenjing CUN , Zhitang CHEN
IPC: G06N3/08
Abstract: A reinforcement learning method and recognition apparatus includes: obtaining a structure graph, where the structure graph includes structure information that is of an environment or the intelligent agent and that is obtained through learning; inputing a current state of the environment and the structure graph to a policy function of the intelligent agent, where the policy function is used to generate an action in response to the current state and the structure graph, and the policy function of the intelligent agent is a graph neural network; outputing the action to the environment by using the intelligent agent; obtaining, from the environment by using the intelligent agent, a next state and reward data in response to the action; training the intelligent agent through reinforcement learning based on the reward data.