-
公开(公告)号:US20240127071A1
公开(公告)日:2024-04-18
申请号:US18475859
申请日:2023-09-27
Applicant: DeepMind Technologies Limited
Inventor: Robert Tjarko Lange , Tom Schaul , Yutian Chen , Tom Ben Zion Zahavy , Valentin Clement Dalibard , Christopher Yenchuan Lu , Satinder Singh Baveja , Johan Sebastian Flennerhag
IPC: G06N3/086
CPC classification number: G06N3/086
Abstract: There is provided a computer-implemented method for updating a search distribution of an evolutionary strategies optimizer using an optimizer neural network comprising one or more attention blocks. The method comprises receiving a plurality of candidate solutions, one or more parameters defining the search distribution that the plurality of candidate solutions are sampled from, and fitness score data indicating a fitness of each respective candidate solution of the plurality of candidate solutions. The method further comprises processing, by the one or more attention neural network blocks, the fitness score data using an attention mechanism to generate respective recombination weights corresponding to each respective candidate solution. The method further comprises updating the one or more parameters defining the search distribution based upon the recombination weights applied to the plurality of candidate solutions.