-
公开(公告)号:US20190303743A1
公开(公告)日:2019-10-03
申请号:US16317497
申请日:2016-09-27
Applicant: Intel Corporation
Inventor: Swagath VENKATARAMANI , Dipankar DAS , Ashish RANJAN , Subarno BANERJEE , Sasikanth AVANCHA , Ashok JAGANNATHAN , Ajaya V. DURG , Dheemanth NAGARAJ , Bharat KAUL , Anand RAGHUNATHAN
Abstract: Methods and apparatuses relating to processing neural networks are described. In one embodiment, an apparatus to process a neural network includes a plurality of fully connected layer chips coupled by an interconnect; a plurality of convolutional layer chips each coupled by an interconnect to a respective fully connected layer chip of the plurality of fully connected layer chips and each of the plurality of fully connected layer chips and the plurality of convolutional layer chips including an interconnect to couple each of a forward propagation compute intensive tile, a back propagation compute intensive tile, and a weight gradient compute intensive tile of a column of compute intensive tiles between a first memory intensive tile and a second memory intensive tile.
-
公开(公告)号:US20190243651A1
公开(公告)日:2019-08-08
申请号:US16317501
申请日:2016-09-27
Applicant: INTEL CORPORATION
Inventor: Swagath VENKATARAMANI , Dipankar DAS , Sasikanth AVANCHA , Ashish RANJAN , Subarno BANERJEE , Bharat KAUL , Anand RAGHUNATHAN
IPC: G06F9/30
CPC classification number: G06F9/30145 , G06F9/3004 , G06F9/30043 , G06F9/30087 , G06F9/3834 , G06F9/52 , G06N3/063 , G06N3/084
Abstract: Systems, methods, and apparatuses relating to access synchronization in a shared memory are described. In one embodiment, a processor includes a decoder to decode an instruction into a decoded instruction, and an execution unit to execute the decoded instruction to: receive a first input operand of a memory address to be tracked and a second input operand of an allowed sequence of memory accesses to the memory address, and cause a block of a memory access that violates the allowed sequence of memory accesses to the memory address. In one embodiment, a circuit separate from the execution unit compares a memory address for a memory access request to one or more memory addresses in a tracking table, and blocks a memory access for the memory access request when a type of access violates a corresponding allowed sequence of memory accesses to the memory address for the memory access request.
-
公开(公告)号:US20240118892A1
公开(公告)日:2024-04-11
申请号:US18543357
申请日:2023-12-18
Applicant: Intel Corporation
Inventor: Swagath VENKATARAMANI , Dipankar DAS , Ashish RANJAN , Subarno BANERJEE , Sasikanth AVANCHA , Ashok JAGANNATHAN , Ajaya V. DURG , Dheemanth NAGARAJ , Bharat KAUL , Anand RAGHUNATHAN
CPC classification number: G06F9/30145 , G06F9/3004 , G06F9/30043 , G06F9/30087 , G06F9/3834 , G06F9/52 , G06N3/04 , G06N3/063 , G06N3/084
Abstract: Methods and apparatuses relating to processing neural networks are described. In one embodiment, an apparatus to process a neural network includes a plurality of fully connected layer chips coupled by an interconnect; a plurality of convolutional layer chips each coupled by an interconnect to a respective fully connected layer chip of the plurality of fully connected layer chips and each of the plurality of fully connected layer chips and the plurality of convolutional layer chips including an interconnect to couple each of a forward propagation compute intensive tile, a back propagation compute intensive tile, and a weight gradient compute intensive tile of a column of compute intensive tiles between a first memory intensive tile and a second memory intensive tile.
-
公开(公告)号:US20210382719A1
公开(公告)日:2021-12-09
申请号:US17410934
申请日:2021-08-24
Applicant: Intel Corporation
Inventor: Swagath VENKATARAMANI , Dipankar DAS , Sasikanth AVANCHA , Ashish RANJAN , Subarno BANERJEE , Bharat KAUL , Anand RAGHUNATHAN
Abstract: Systems, methods, and apparatuses relating to access synchronization in a shared memory are described. In one embodiment, a processor includes a decoder to decode an instruction into a decoded instruction, and an execution unit to execute the decoded instruction to: receive a first input operand of a memory address to be tracked and a second input operand of an allowed sequence of memory accesses to the memory address, and cause a block of a memory access that violates the allowed sequence of memory accesses to the memory address. In one embodiment, a circuit separate from the execution unit compares a memory address for a memory access request to one or more memory addresses in a tracking table, and blocks a memory access for the memory access request when a type of access violates a corresponding allowed sequence of memory accesses to the memory address for the memory access request.
-
-
-