Multicast network and memory transfer optimizations for neural network hardware acceleration

发明授权

US11120329B2 Multicast network and memory transfer optimizations for neural network hardware acceleration 有权

请登陆查看更多内容

专利标题： Multicast network and memory transfer optimizations for neural network hardware acceleration
申请号： US15588569

申请日： 2017-05-05
公开(公告)号： US11120329B2

公开(公告)日： 2021-09-14
发明人: Jeremy Bruestle , Choong Ng
申请人： Intel Corporation
申请人地址： US CA Santa Clara
专利权人： Intel Corporation
当前专利权人： Intel Corporation
当前专利权人地址： US CA Santa Clara
代理机构： Trop Pruner & Hu, P.C.
主分类号： G06N3/063
IPC分类号： G06N3/063 ; G06F12/06 ; G06F9/345 ; H04L12/933 ; G06N3/04 ; H04L15/00

Multicast network and memory transfer optimizations for neural network hardware acceleration

摘要：

Neural network specific hardware acceleration optimizations are disclosed, including an optimized multicast network and an optimized DRAM transfer unit to perform in constant or linear time. The multicast network is a set of switch nodes organized into layers and configured to operate as a Beneš network. Configuration data may be accessed by all switch nodes in the network. Each layer is configured to perform a Beneš network transformation of the -previous layer within a computer instruction. Since the computer instructions are pipelined, the entire network of switch nodes may be configured in constant or linear time. Similarly a DRAM transfer unit configured to access memory in strides organizes memory into banks indexed by prime or relatively prime number amounts. The index value is selected as not to cause memory address collisions. Upon receiving a memory specification, the DRAM transfer unit may calculate out strides thereby accessing an entire tile of a tensor in constant or linear time.

公开/授权文献

US20170337468A1 MULTICAST NETWORK AND MEMORY TRANSFER OPTIMIZATIONS FOR NEURAL NETWORK HARDWARE ACCELERATION 公开/授权日：2017-11-23

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/06	..物理实现，即神经网络、神经元或神经元部分的硬件实现
G06N3/063	...采用电的