Patent search ap:("HUAWEI TECHNOLOGIES CO. Page LTD.") AND inv:"Wenjing CUN"

1.

发明申请
REINFORCEMENT LEARNING METHOD AND APPARATUS 有权

公开(公告)号：US20230037632A1

公开(公告)日：2023-02-09

申请号：US17966985

申请日：2022-10-17

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Furui LIU , Wenjing CUN , Zhitang CHEN

IPC: G06N3/08

Abstract: A reinforcement learning method and recognition apparatus includes: obtaining a structure graph, where the structure graph includes structure information that is of an environment or the intelligent agent and that is obtained through learning; inputing a current state of the environment and the structure graph to a policy function of the intelligent agent, where the policy function is used to generate an action in response to the current state and the structure graph, and the policy function of the intelligent agent is a graph neural network; outputing the action to the environment by using the intelligent agent; obtaining, from the environment by using the intelligent agent, a next state and reward data in response to the action; training the intelligent agent through reinforcement learning based on the reward data.

Patent Agency Ranking