Loss-scaling for deep neural network training with reduced precision

Invention Grant

US11842280B2 Loss-scaling for deep neural network training with reduced precision 有权

Please log in to see more content

Patent Title: Loss-scaling for deep neural network training with reduced precision
Application No.: US15971884

Application Date: 2018-05-04
Publication No.: US11842280B2

Publication Date: 2023-12-12
Inventor: Jonah Alben , Paulius Micikevicius , Hao Wu
Applicant: NVIDIA Corporation
Applicant Address: US CA Santa Clara
Assignee: NVIDIA Corporation
Current Assignee: NVIDIA Corporation
Current Assignee Address: US CA Santa Clara
Agency: Davis Wright Tremaine LLP
Main IPC: G06N3/084
IPC: G06N3/084 ; G06N3/04 ; G06N3/063

Loss-scaling for deep neural network training with reduced precision

Abstract:

In training a deep neural network using reduced precision, gradient computation operates on larger values without affecting the rest of the training procedure. One technique trains the deep neural network to develop loss, scales the loss, computes gradients at a reduced precision, and reduces the magnitude of the computed gradients to compensate for scaling of the loss. In one example non-limiting arrangement, the training forward pass scales a loss value by some factor S and the weight update reduces the weight gradient contribution by 1/S. Several techniques can be used for selecting scaling factor S and adjusting the weight update.

Public/Granted literature

US20180322391A1 LOSS-SCALING FOR DEEP NEURAL NETWORK TRAINING WITH REDUCED PRECISION Public/Granted day:2018-11-08

Information query

Espacenet