Deep Learning Training Method for Computing Device and Apparatus

    公开(公告)号:US20230206069A1

    公开(公告)日:2023-06-29

    申请号:US18175936

    申请日:2023-02-28

    CPC classification number: G06N3/08 G06N3/045

    Abstract: A deep learning training method includes obtaining a training set, a first neural network, and a second neural network, where shortcut connections included in the first neural network are less than shortcut connections included in the second neural network; performing at least one time of iterative training on the first neural network based on the training set, to obtain a trained first neural network, where any iterative training includes: using a first output of at least one first intermediate layer in the first neural network as an input of at least one network layer in the second neural network, to obtain an output result of the at least one network layer; and updating the first neural network according to a first loss function.

Patent Agency Ranking