Invention Grant
- Patent Title: Loss-scaling for deep neural network training with reduced precision
-
Application No.: US15971884Application Date: 2018-05-04
-
Publication No.: US11842280B2Publication Date: 2023-12-12
- Inventor: Jonah Alben , Paulius Micikevicius , Hao Wu
- Applicant: NVIDIA Corporation
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA Corporation
- Current Assignee: NVIDIA Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Davis Wright Tremaine LLP
- Main IPC: G06N3/084
- IPC: G06N3/084 ; G06N3/04 ; G06N3/063

Abstract:
In training a deep neural network using reduced precision, gradient computation operates on larger values without affecting the rest of the training procedure. One technique trains the deep neural network to develop loss, scales the loss, computes gradients at a reduced precision, and reduces the magnitude of the computed gradients to compensate for scaling of the loss. In one example non-limiting arrangement, the training forward pass scales a loss value by some factor S and the weight update reduces the weight gradient contribution by 1/S. Several techniques can be used for selecting scaling factor S and adjusting the weight update.
Public/Granted literature
- US20180322391A1 LOSS-SCALING FOR DEEP NEURAL NETWORK TRAINING WITH REDUCED PRECISION Public/Granted day:2018-11-08
Information query