Multi-precision digital compute-in-memory deep neural network engine for flexible and energy efficient inferencing

Invention Grant

US12079733B2 Multi-precision digital compute-in-memory deep neural network engine for flexible and energy efficient inferencing 有权

Please log in to see more content

Patent Title: Multi-precision digital compute-in-memory deep neural network engine for flexible and energy efficient inferencing
Application No.: US16941178

Application Date: 2020-07-28
Publication No.: US12079733B2

Publication Date: 2024-09-03
Inventor: Tung Thanh Hoang , Won Ho Choi , Martin Lueker-Boden
Applicant: SanDisk Technologies LLC
Applicant Address: US TX Addison
Assignee: SanDisk Technologies LLC
Current Assignee: SanDisk Technologies LLC
Current Assignee Address: US TX Addison
Agency: Vierra Magen Marcus LLP
Main IPC: G06F7/544
IPC: G06F7/544 ; G06F12/02 ; G06N3/10

Multi-precision digital compute-in-memory deep neural network engine for flexible and energy efficient inferencing

Abstract:

Anon-volatile memory structure capable of storing weights for layers of a deep neural network (DNN) and perform an inferencing operation within the structure is presented. An in-array multiplication can be performed between multi-bit valued inputs, or activations, for a layer of the DNN and multi-bit valued weights of the layer. Each bit of a weight value is stored in a binary valued memory cell of the memory array and each bit of the input is applied as a binary input to a word line of the array for the multiplication of the input with the weight. To perform a multiply and accumulate operation, the results of the multiplications are accumulated by adders connected to sense amplifiers along the bit lines of the array. The adders can be configured to multiple levels of precision, so that the same structure can accommodate weights and activations of 8-bit, 4-bit, and 2-bit precision.

Public/Granted literature

US20210397974A1 MULTI-PRECISION DIGITAL COMPUTE-IN-MEMORY DEEP NEURAL NETWORK ENGINE FOR FLEXIBLE AND ENERGY EFFICIENT INFERENCING Public/Granted day:2021-12-23

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F7/00	通过待处理的数据的指令或内容进行运算的数据处理的方法或装置（逻辑电路入H03K19/00）
G06F7/38	.只利用数制表示，例如利用二进制、三进制、十进制表示来完成计算的方法或装置
G06F7/48	..应用非形成接触器件的，例如，电子管、固体器件；应用非特定的器件的
G06F7/544	...用于通过计算求函数值的