MEMORY REDUCTION FOR NEURAL NETWORKS WITH FIXED STRUCTURES

发明申请

US20190303025A1 MEMORY REDUCTION FOR NEURAL NETWORKS WITH FIXED STRUCTURES 审中-公开

请登陆查看更多内容

专利标题： MEMORY REDUCTION FOR NEURAL NETWORKS WITH FIXED STRUCTURES
申请号： US15943079

申请日： 2018-04-02
公开(公告)号： US20190303025A1

公开(公告)日： 2019-10-03
发明人: Taro Sekiyama , Haruki Imai , Jun Doi , Yasushi Negishi
申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION
主分类号： G06F3/06
IPC分类号： G06F3/06 ; G06N3/08

MEMORY REDUCTION FOR NEURAL NETWORKS WITH FIXED STRUCTURES

摘要：

A method is provided for reducing consumption of a memory in a propagation process for a neural network (NN) having fixed structures for computation order and node data dependency. The memory includes memory segments for allocating to nodes. The method collects, in a NN training iteration, information for each node relating to an allocation, size, and lifetime thereof. The method chooses, responsive to the information, a first node having a maximum memory size relative to remaining nodes, and a second node non-overlapped with the first node lifetime. The method chooses another node non-overlapped with the first node lifetime, responsive to a sum of memory sizes of the second node and the other node not exceeding a first node memory size. The method reallocates a memory segment allocated to the first node to the second node and the other node to be reused by the second node and the other node.

公开/授权文献

US10782897B2 Memory reduction for neural networks with fixed structures 公开/授权日：2020-09-22

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F3/00	用于将所要处理的数据转变成为计算机能够处理的形式的输入装置；用于将数据从处理机传送到输出设备的输出装置，例如，接口装置
G06F3/06	.来自记录载体的数字输入，或者到记录载体上去的数字输出