TRAINING APPARATUS AND METHOD FOR NEURAL NETWORK MODEL, AND RELATED DEVICE

    公开(公告)号:US20230087642A1

    公开(公告)日:2023-03-23

    申请号:US17991683

    申请日:2022-11-21

    Abstract: A training apparatus used for model training on a neural network in the field of artificial intelligence (AI) is disclosed. The training apparatus includes a plurality of accelerators. In a parallel processing process in which the training apparatus trains a neural network model using the plurality of accelerators, a complete weight coefficient of the neural network model is stored in the plurality of accelerators in the training apparatus in a distributed manner. Weight coefficients of the plurality of accelerators are subsequently added to obtain the complete weight coefficient. The neural network model is further trained on each accelerator based on different input data and the complete weight coefficient. Thus, the complete weight coefficient is stored in the plurality of accelerators in the training apparatus in a distributed manner, to reduce video random access memory (RAM) consumption of the training apparatus in a training process of the neural network model.

Patent Agency Ranking