-
公开(公告)号:US11699681B2
公开(公告)日:2023-07-11
申请号:US16727779
申请日:2019-12-26
Applicant: Intel Corporation
Inventor: Abhishek Sharma , Hui Jae Yoo , Van H. Le , Huseyin Ekin Sumbul , Phil Knag , Gregory K. Chen , Ram Krishnamurthy
IPC: H01L25/065 , G11C11/407
CPC classification number: H01L25/0657 , G11C11/407 , H01L2224/32145 , H01L2224/32225
Abstract: An apparatus is formed. The apparatus includes a stack of semiconductor chips. The stack of semiconductor chips includes a logic chip and a memory stack, wherein, the logic chip includes at least one of a GPU and CPU. The apparatus also includes a semiconductor chip substrate. The stack of semiconductor chips are mounted on the semiconductor chip substrate. At least one other logic chip is mounted on the semiconductor chip substrate. The semiconductor chip substrate includes wiring to interconnect the stack of semiconductor chips to the at least one other logic chip.
-
公开(公告)号:US11625584B2
公开(公告)日:2023-04-11
申请号:US16443548
申请日:2019-06-17
Applicant: Intel Corporation
Inventor: Raghavan Kumar , Gregory K. Chen , Huseyin Ekin Sumbul , Phil Knag , Ram Krishnamurthy
Abstract: Examples described herein relate to a neural network whose weights from a matrix are selected from a set of weights stored in a memory on-chip with a processing engine for generating multiply and carry operations. The number of weights in the set of weights stored in the memory can be less than a number of weights in the matrix thereby reducing an amount of memory used to store weights in a matrix. The weights in the memory can be generated in training using gradients from back propagation. Weights in the memory can be selected using a tabulation hash calculation on entries in a table.
-
公开(公告)号:US11347477B2
公开(公告)日:2022-05-31
申请号:US16586648
申请日:2019-09-27
Applicant: Intel Corporation
Inventor: Huseyin Ekin Sumbul , Gregory K. Chen , Phil Knag , Raghavan Kumar , Ram Krishnamurthy
Abstract: A memory circuit includes a number (X) of multiply-accumulate (MAC) circuits that are dynamically configurable. The MAC circuits can either compute an output based on computations of X elements of the input vector with the weight vector, or to compute the output based on computations of a single element of the input vector with the weight vector, with each element having a one bit or multibit length. A first memory can hold the input vector having a width of X elements and a second memory can store the weight vector. The MAC circuits include a MAC array on chip with the first memory.
-
公开(公告)号:US11017288B2
公开(公告)日:2021-05-25
申请号:US15845245
申请日:2017-12-18
Applicant: Intel Corporation
Inventor: Ram Kumar Krishnamurthy , Gregory Kengho Chen , Raghavan Kumar , Phil Christopher Knag , Huseyin Ekin Sumbul
Abstract: System and techniques for spike timing dependent plasticity (STDP) in neuromorphic hardware are described herein. A first spike may be received, at a first neuron at a first time, from a second neuron. The first neuron may produce a second spike at a second time after the first time. At a third time after the second time, the first neuron may receive a third spike from the second neuron. Here, the third spike is a replay of the first spike with a defined time offset. The first neuron may then perform long term potentiation (LTP) for the first spike using the third spike.
-
公开(公告)号:US10860682B2
公开(公告)日:2020-12-08
申请号:US16839013
申请日:2020-04-02
Applicant: Intel Corporation
Inventor: Phil Knag , Gregory K. Chen , Raghavan Kumar , Huseyin Ekin Sumbul , Abhishek Sharma , Sasikanth Manipatruni , Amrita Mathuriya , Ram Krishnamurthy , Ian A. Young
IPC: G06F17/16 , G11C11/419 , G11C11/418 , G11C7/12 , G11C8/08 , G06G7/16 , G06G7/22 , G11C11/56 , G06F9/30 , G11C7/10 , G06N3/063
Abstract: A binary CIM circuit enables all memory cells in a memory array to be effectively accessible simultaneously for computation using fixed pulse widths on the wordlines and equal capacitance on the bitlines. The fixed pulse widths and equal capacitance ensure that a minimum voltage drop in the bitline represents one least significant bit (LSB) so that the bitline voltage swing remains safely within the maximum allowable range. The binary CIM circuit maximizes the effective memory bandwidth of a memory array for a given maximum voltage range of bitline voltage.
-
公开(公告)号:US20190042913A1
公开(公告)日:2019-02-07
申请号:US15884001
申请日:2018-01-30
Applicant: Intel Corporation
Inventor: Phil Knag , Gregory Kengho Chen , Raghavan Kumar , Huseyin Ekin Sumbul , Ram Kumar Krishnamurthy
IPC: G06N3/04
Abstract: Various systems, devices, and methods for operating on a data sequence. A system includes a set of circuits that form an input layer to receive a data sequence; first hardware computing units to transform the data sequence, the first hardware computing units connected using a set of randomly selected weights, a first hardware computing unit to: receive an input from a second hardware computing unit, determine a weight of a connection between the first and second hardware computing units using an identifier of the second hardware computing unit and a fixed random weight generator, and operate on the input using the weight to determine a state of the first hardware computing unit; and second hardware computing units to operate on states of the first computing units to generate an output based on the data sequence.
-
公开(公告)号:US20180189648A1
公开(公告)日:2018-07-05
申请号:US15394976
申请日:2016-12-30
Applicant: Intel Corporation
Inventor: Abhronil Sengupta , Gregory K. Chen , Raghavan Kumar , Huseyin Ekin Sumbul , Phil Knag
Abstract: In one embodiment, a processor is to store a membrane potential of a neural unit of a neural network; and calculate, at a particular time-step of the neural network, a change to the membrane potential of the neural unit occurring over multiple time-steps that have elapsed since the last time-step at which the membrane potential was updated, wherein each of the multiple time-steps that have elapsed since the last time-step is associated with at least one input to the neural unit that affects the membrane potential of the neural unit.
-
公开(公告)号:US11783160B2
公开(公告)日:2023-10-10
申请号:US15884001
申请日:2018-01-30
Applicant: Intel Corporation
Inventor: Phil Knag , Gregory Kengho Chen , Raghavan Kumar , Huseyin Ekin Sumbul , Ram Kumar Krishnamurthy
Abstract: Various systems, devices, and methods for operating on a data sequence. A system includes a set of circuits that form an input layer to receive a data sequence; first hardware computing units to transform the data sequence, the first hardware computing units connected using a set of randomly selected weights, a first hardware computing unit to: receive an input from a second hardware computing unit, determine a weight of a connection between the first and second hardware computing units using an identifier of the second hardware computing unit and a fixed random weight generator, and operate on the input using the weight to determine a state of the first hardware computing unit; and second hardware computing units to operate on states of the first computing units to generate an output based on the data sequence.
-
公开(公告)号:US11726950B2
公开(公告)日:2023-08-15
申请号:US16586975
申请日:2019-09-28
Applicant: Intel Corporation
Inventor: Huseyin Ekin Sumbul , Gregory K. Chen , Phil Knag , Raghavan Kumar , Ram Krishnamurthy
CPC classification number: G06F15/8046 , G06F17/153 , G06N3/063
Abstract: A compute near memory (CNM) convolution accelerator enables a convolutional neural network (CNN) to use dedicated acceleration to achieve efficient in-place convolution operations with less impact on memory and energy consumption. A 2D convolution operation is reformulated as 1D row-wise convolution. The 1D row-wise convolution enables the CNM convolution accelerator to process input activations row-by-row, while using the weights one-by-one. Lightweight access circuits provide the ability to stream both weights and input rows as vectors to MAC units, which in turn enables modules of the CNM convolution accelerator to implement convolution for both [1×1] and chosen [n×n] sized filters.
-
公开(公告)号:US11251186B2
公开(公告)日:2022-02-15
申请号:US16827542
申请日:2020-03-23
Applicant: Intel Corporation
Inventor: Abhishek Sharma , Noriyuki Sato , Sarah Atanasov , Huseyin Ekin Sumbul , Gregory K. Chen , Phil Knag , Ram Krishnamurthy , Hui Jae Yoo , Van H. Le
IPC: G11C11/24 , H01L27/108 , H01L27/12 , G11C11/4096
Abstract: Examples herein relate to a memory device comprising an eDRAM memory cell, the eDRAM memory cell can include a write circuit formed at least partially over a storage cell and a read circuit formed at least partially under the storage cell; a compute near memory device bonded to the memory device; a processor; and an interface from the memory device to the processor. In some examples, circuitry is included to provide an output of the memory device to emulate output read rate of an SRAM memory device comprises one or more of: a controller, a multiplexer, or a register. Bonding of a surface of the memory device can be made to a compute near memory device or other circuitry. In some examples, a layer with read circuitry can be bonded to a layer with storage cells. Any layers can be bonded together using techniques described herein.
-
-
-
-
-
-
-
-
-