-
1.
公开(公告)号:US20240169211A1
公开(公告)日:2024-05-23
申请号:US18388180
申请日:2023-11-08
Applicant: DeepMind Technologies Limited
Inventor: Domenic Joseph Donato , Christopher James Dyer , Lei Yu , Wang Ling
IPC: G06N3/092 , G06N3/0985
CPC classification number: G06N3/092 , G06N3/0985
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network to perform a machine learning task through reinforcement learning. In one aspect, the training uses importance weights generated using standardized absolute deviations of quality scores generated by the neural network for candidate network outputs.