-
公开(公告)号:US20220253694A1
公开(公告)日:2022-08-11
申请号:US17560118
申请日:2021-12-22
Applicant: Google LLC
Inventor: Ibrahim Alabdulmohsin , Hartmut Maennel , Daniel M. Keysers
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network using re-initialization. One of the methods includes, at each time step in a sequence of time steps: identifying current values of the weights as of the training time step; selecting one of the layer blocks; generating new values for the weights of the plurality of neural network layers, comprising: re-initializing the values of the weights of at least the neural network layers in the layer blocks that are after the selected layer block without re-initializing the current values of the weights of the neural network layers in the layer block and the neural network layers in any layer block that is before the selected layer block; and raining the neural network starting from the new values for the weights of the plurality of neural network layers.