-
公开(公告)号:US20240127586A1
公开(公告)日:2024-04-18
申请号:US18275087
申请日:2022-02-02
Applicant: DeepMind Technologies Limited
Inventor: Andrew Brock , Soham De , Samuel Laurence Smith , Karen Simonyan
IPC: G06V10/82 , G06V10/776
CPC classification number: G06V10/82 , G06V10/776
Abstract: There is disclosed a computer-implemented method for training a neural network. The method comprises determining a gradient associated with a parameter of the neural network. The method further comprises determining a ratio of a gradient norm to parameter norm and comparing the ratio to a threshold. In response to determining that the ratio exceeds the threshold, the value of the gradient is reduced such that the ratio is equal to or below the threshold. The value of the parameter is updated based upon the reduced gradient value.
-
公开(公告)号:US20230351042A1
公开(公告)日:2023-11-02
申请号:US18141273
申请日:2023-04-28
Applicant: DeepMind Technologies Limited
Inventor: Soham De , Borja De Balle Pigem , Jamie Hayes , Samuel Laurence Smith , Leonard Alix Jean Eric Berrada Lancrey Javal
IPC: G06F21/62
CPC classification number: G06F21/6245
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for privacy-sensitive training of a neural network. In one aspect, a method includes training a set of neural network parameters of the neural network on a set of training data over multiple training iterations to optimize an objective function. Each training iteration includes: sampling a batch of network inputs from the set of training data; determining a clipped gradient for each network input in the batch of network inputs; and updating the neural network parameters using the clipped gradients for the network inputs in the batch of network inputs.
-