Accelerating deep neural network training with inconsistent stochastic gradient descent

Invention Grant

US10572800B2 Accelerating deep neural network training with inconsistent stochastic gradient descent 有权

Please log in to see more content

Patent Title: Accelerating deep neural network training with inconsistent stochastic gradient descent
Application No.: US15423360

Application Date: 2017-02-02
Publication No.: US10572800B2

Publication Date: 2020-02-25
Inventor: Linnan Wang , Yi Yang , Renqiang Min , Srimat Chakradhar
Applicant: NEC Laboratories America, Inc.
Applicant Address: JP
Assignee: NEC Corporation
Current Assignee: NEC Corporation
Current Assignee Address: JP
Agent Joseph Kolodka
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/04

Accelerating deep neural network training with inconsistent stochastic gradient descent

Abstract:

Aspects of the present disclosure describe techniques for training a convolutional neural network using an inconsistent stochastic gradient descent (ISGD) algorithm. Training effort for training batches used by the ISGD algorithm are dynamically adjusted according to a determined loss for a given training batch which are classified into two sub states—well-trained or under-trained. The ISGD algorithm provides more iterations for under-trained batches while reducing iterations for well-trained ones.

Public/Granted literature

US20170228645A1 ACCELERATING DEEP NEURAL NETWORK TRAINING WITH INCONSISTENT STOCHASTIC GRADIENT DESCENT Public/Granted day:2017-08-10

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法