Invention Application
- Patent Title: Neural Network Training Method and Apparatus, Electronic Device, Medium and Program Product
-
Application No.: US17558355Application Date: 2021-12-21
-
Publication No.: US20220374704A1Publication Date: 2022-11-24
- Inventor: Danlei FENG , Long LIAN , Dianhai YU , Xuefeng YAO , Xinxuan WU , Zhihua WU , Yanjun MA
- Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.
- Applicant Address: CN Beijing
- Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
- Current Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
- Current Assignee Address: CN Beijing
- Priority: CN202110548446.8 20210519
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04

Abstract:
The disclosure provides a neural network training method and apparatus, an electronic device, a medium and a program product, and relates to the field of artificial intelligence, in particular to the fields of deep learning and distributed learning. The method includes: acquiring a neural network for deep learning; constructing a deep reinforcement learning model for the neural network; and determining, through the deep reinforcement learning model, a processing unit selection for the plurality of the network layers based on a duration for training each of the network layers by each type of the plurality of types of the processing units, and a cost of each type of the plurality of types of the processing units, wherein the processing unit selection comprises the type of the processing unit to be used for each of the plurality of the network layers, and the processing unit selection is used for making a total cost of the processing units used by the neural network below a cost threshold, in response to a duration for pipelining parallel computing for training the neural network being shorter than a present duration.
Information query