-
公开(公告)号:US20240045589A1
公开(公告)日:2024-02-08
申请号:US18488581
申请日:2023-10-17
Inventor: Nitin CHAWLA , Giuseppe DESOLI , Anuj GROVER , Thomas BOESCH , Surinder Pal SINGH , Manuj AYODHYAWASI
CPC classification number: G06F3/0604 , G06F3/0619 , G06F3/0655 , G06F3/0679 , G06N3/08
Abstract: A memory array arranged as a plurality of memory cells. The memory cells are configured to operate at a determined voltage. A memory management circuitry coupled to the plurality of memory cells tags a first set of the plurality of memory cells as low-voltage cells and tags a second set of the plurality of memory cells as high-voltage cells. A power source provides a low voltage to the first set of memory cells and provides a high voltage to the second set of memory cells based on the tags.
-
12.
公开(公告)号:US20230206032A1
公开(公告)日:2023-06-29
申请号:US18172979
申请日:2023-02-22
Inventor: Giuseppe DESOLI , Carmine CAPPETTA , Thomas BOESCH , Surinder Pal SINGH , Saumya SUNEJA
CPC classification number: G06N3/045 , G06F16/2282 , G06N3/04 , G06N3/063 , G06N3/08 , G06F18/217
Abstract: Embodiments of an electronic device include an integrated circuit, a reconfigurable stream switch formed in the integrated circuit along with a plurality of convolution accelerators and a decompression unit coupled to the reconfigurable stream switch. The decompression unit decompresses encoded kernel data in real time during operation of convolutional neural network.
-
公开(公告)号:US20220269410A1
公开(公告)日:2022-08-25
申请号:US17742987
申请日:2022-05-12
Inventor: Nitin CHAWLA , Giuseppe DESOLI , Anuj GROVER , Thomas BOESCH , Surinder Pal SINGH , Manuj AYODHYAWASI
Abstract: A memory array arranged as a plurality of memory cells. The memory cells are configured to operate at a determined voltage. A memory management circuitry coupled to the plurality of memory cells tags a first set of the plurality of memory cells as low-voltage cells and tags a second set of the plurality of memory cells as high-voltage cells. A power source provides a low voltage to the first set of memory cells and provides a high voltage to the second set of memory cells based on the tags.
-
公开(公告)号:US20210241806A1
公开(公告)日:2021-08-05
申请号:US17158875
申请日:2021-01-26
Inventor: Nitin CHAWLA , Thomas BOESCH , Anuj Grover , Surinder Pal SINGH , Giuseppe DESOLI
Abstract: A system includes a random access memory organized into individually addressable words. Streaming access control circuitry is coupled to word lines of the random access memory. The streaming access control circuitry responds to a request to access a plurality of individually addressable words of a determined region of the random access memory by generating control signals to drive the word lines to streamingly access the plurality of individually addressable words of the determined region. The request indicates an offset associated with the determined region and a pattern associated with the streaming access.
-
公开(公告)号:US20130330013A1
公开(公告)日:2013-12-12
申请号:US13963860
申请日:2013-08-09
Applicant: STMicroelectronics International N.V.
Inventor: Surinder Pal SINGH , Aneesh BHASIN , Kaushik SAHA
IPC: G06T9/00
Abstract: Parallelization of decoding of a data stream encoded with a variable length code includes determining one or more markers, each of which indicates a position within the encoded data stream. The determined markers are included into the encoded data stream together with the encoded data. At the decoder side, the markers are parsed from the encoded data stream and based on the extracted markers. The encoded data is separated into partitions, which are decoded separately and in parallel.
Abstract translation: 用可变长度码编码的数据流的解码的并行化包括确定一个或多个标记,每个标记表示编码数据流内的位置。 确定的标记与编码数据一起被包括在编码数据流中。 在解码器侧,从编码数据流中解析出标记,并根据提取的标记进行解析。 编码数据被分成分开并且并行解码的分区。
-
公开(公告)号:US20230153621A1
公开(公告)日:2023-05-18
申请号:US18156704
申请日:2023-01-19
Inventor: Surinder Pal SINGH , Giuseppe DESOLI , Thomas BOESCH
CPC classification number: G06N3/08 , G06N20/00 , G06F17/11 , G06N3/063 , G06F9/3001 , G06F9/30032 , G06F9/30036 , G06N3/045
Abstract: An integrated circuit includes a reconfigurable stream switch and an arithmetic circuit. The stream switch, in operation, streams data. The arithmetic circuit has a plurality of inputs coupled to the reconfigurable stream switch. In operation, the arithmetic circuit generates an output according to AX+BY+C, where A, B and C are vector or scalar constants, and X and Y are data streams streamed to the arithmetic circuit through the reconfigurable stream switch.
-
公开(公告)号:US20230084985A1
公开(公告)日:2023-03-16
申请号:US18056937
申请日:2022-11-18
Inventor: Thomas BOESCH , Giuseppe DESOLI , Surinder Pal SINGH , Carmine CAPPETTA
Abstract: Techniques and systems are provided for implementing a convolutional neural network. One or more convolution accelerators are provided that each include a feature line buffer memory, a kernel buffer memory, and a plurality of multiply-accumulate (MAC) circuits arranged to multiply and accumulate data. In a first operational mode the convolutional accelerator stores feature data in the feature line buffer memory and stores kernel data in the kernel data buffer memory. In a second mode of operation, the convolutional accelerator stores kernel decompression tables in the feature line buffer memory.
-
公开(公告)号:US20230062910A1
公开(公告)日:2023-03-02
申请号:US17461626
申请日:2021-08-30
Inventor: Giuseppe DESOLI , Surinder Pal SINGH , Thomas BOESCH
Abstract: A convolutional neural network includes convolution circuitry. The convolution circuitry performs convolution operations on input tensor values. The convolutional neural network includes requantization circuitry that requantizes convolution values output from the convolution circuitry.
-
公开(公告)号:US20220414420A1
公开(公告)日:2022-12-29
申请号:US17360986
申请日:2021-06-28
Inventor: Loris LUISE , Surinder Pal SINGH , Fabio Giuseppe DE AMBROGGI
Abstract: Data structure and microcontroller architecture performing binary multiply-accumulate operations using multiple partial copies of weights. Destination-register location, source-register location, and weight-register location are received. Using the weight-register location, a sub-set of the weight bits is copied a select number of times based on a filter index value that is received. Each copy of the sub-set of weights is executed in parallel. Using the source-register location, a sub-set of the input bits is selected based on the size of the sub-set of weights, wherein the sub-set of input bits is shifted one bit from a previous sub-set of input bits. XOR operation is performed on each corresponding bit in the copy of the sub-set of weights with each corresponding bit in the selected sub-set of input bits. In a corresponding destination sub-location, output of each XOR operation is aggregated with each other and with current value of the corresponding destination sub-location.
-
公开(公告)号:US20210397933A1
公开(公告)日:2021-12-23
申请号:US16909673
申请日:2020-06-23
Inventor: Thomas BOESCH , Giuseppe DESOLI , Surinder Pal SINGH , Carmine CAPPETTA
Abstract: Techniques and systems are provided for implementing a convolutional neural network. One or more convolution accelerators are provided that each include a feature line buffer memory, a kernel buffer memory, and a plurality of multiply-accumulate (MAC) circuits arranged to multiply and accumulate data. In a first operational mode the convolutional accelerator stores feature data in the feature line buffer memory and stores kernel data in the kernel data buffer memory. In a second mode of operation, the convolutional accelerator stores kernel decompression tables in the feature line buffer memory.
-
-
-
-
-
-
-
-
-