Patent search ap:("STMICROELECTRONICS S.r.l.") AND inv:"Giuseppe DESOLI" Page 3

21.

发明公开
ITERATION ENGINE FOR THE COMPUTATION OF LARGE KERNELS IN CONVOLUTIONAL ACCELERATORS 审中-公开

公开(公告)号：US20240012871A1

公开(公告)日：2024-01-11

申请号：US17859769

申请日：2022-07-07

Applicant: STMICROELECTRONICS S.r.l. , STMicroelectronics International N.V.

Inventor： Antonio DE VITA , Thomas BOESCH , Giuseppe DESOLI

IPC: G06F17/15 , G06F7/544

CPC classification number: G06F17/15 , G06F7/5443

Abstract: A convolutional accelerator includes a feature line buffer, a kernel buffer, a multiply-accumulate cluster, and iteration control circuitry. The convolutional accelerator, in operation, convolves a kernel with a streaming feature data tensor. The convolving includes decomposing the kernel into a plurality of sub-kernels and iteratively convolving the sub-kernels with respective sub-tensors of the streamed feature data tensor. The iteration control circuitry, in operation, defines respective windows of the streamed feature data tensors, the windows corresponding to the sub-tensors.

22.

发明公开
ACCELERATION OF 1X1 CONVOLUTIONS IN CONVOLUTIONAL NEURAL NETWORKS 审中-公开

公开(公告)号：US20230418559A1

公开(公告)日：2023-12-28

申请号：US17847817

申请日：2022-06-23

Applicant: STMICROELECTRONICS S.r.l. , STMicroelectronics International N.V.

Inventor： Michele ROSSI , Thomas BOESCH , Giuseppe DESOLI

IPC: G06F7/523 , G06F7/50

CPC classification number: G06F7/523 , G06F7/50

Abstract: A convolutional accelerator includes a feature line buffer, a kernel buffer, a multiply-accumulate cluster, and mode control circuitry. In a first mode of operation, the mode control circuitry stores feature data in a feature line buffer and stores kernel data in a kernel buffer. The data stored in the buffers is transferred to the MAC cluster of the convolutional accelerator for processing. In a second mode of operation the mode control circuitry stores feature data in the kernel buffer and stores kernel data in the feature line buffer. The data stored in the buffers is transferred to the MAC cluster of the convolutional accelerator for processing. The second mode of operation may be employed to efficiently process 1×N kernels, where N is an integer greater than or equal to 1.

23.

发明申请
VARIABLE CLOCK ADAPTATION IN NEURAL NETWORK PROCESSORS 有权

公开(公告)号：US20210081773A1

公开(公告)日：2021-03-18

申请号：US17023144

申请日：2020-09-16

Applicant: STMICROELECTRONICS S.r.l. , STMicroelectronics International N.V.

Inventor： Nitin CHAWLA , Giuseppe DESOLI , Manuj AYODHYAWASI , Thomas BOESCH , Surinder Pal SINGH

IPC: G06N3/063 , G06F1/08 , G06F1/324 , G06F9/50 , G06N3/08 , G06F1/3228 , G06F1/3296

Abstract: Systems and devices are provided to increase computational and/or power efficiency for one or more neural networks via a computationally driven closed-loop dynamic clock control. A clock frequency control word is generated based on information indicative of a current frame execution rate of a processing task of the neural network and a reference clock signal. A clock generator generates the clock signal of neural network based on the clock frequency control word. A reference frequency may be used to generate the clock frequency control word, and the reference frequency may be based on information indicative of a sparsity of data of a training frame.

24.

发明申请
CONVOLUTIONAL NETWORK HARDWARE ACCELERATOR DEVICE, SYSTEM AND METHOD 审中-公开

公开(公告)号：US20200310758A1

公开(公告)日：2020-10-01

申请号：US16833353

申请日：2020-03-27

Applicant: STMICROELECTRONICS S.R.L. , STMicroelectronics International N.V.

Inventor： Giuseppe DESOLI , Thomas BOESCH , Carmine CAPPETTA , Ugo Maria IANNUZZI

IPC: G06F7/544 , G06N3/04

Abstract: A Multiple Accumulate (MAC) hardware accelerator includes a plurality of multipliers. The plurality of multipliers multiply a digit-serial input having a plurality of digits by a parallel input having a plurality of bits by sequentially multiplying individual digits of the digit-serial input by the plurality of bits of the parallel input. A result is generated based on the multiplication of the digit-serial input by the parallel input. An accelerator framework may include multiple MAC hardware accelerators, and may be used to implement a convolutional neural network. The MAC hardware accelerators may multiple an input weight by an input feature by sequentially multiplying individual digits of the input weight by the input feature.

25.

发明申请
ARITHMETIC UNIT FOR DEEP LEARNING ACCELERATION 审中-公开

公开(公告)号：US20190266485A1

公开(公告)日：2019-08-29

申请号：US16280960

申请日：2019-02-20

Applicant: STMICROELECTRONICS S.R.L. , STMICROELECTRONICS INTERNATIONAL N.V.

Inventor： Surinder Pal SINGH , Giuseppe DESOLI , Thomas BOESCH

IPC: G06N3/08 , G06N20/00 , G06F17/11

Abstract: Embodiments of a device include an integrated circuit, a reconfigurable stream switch formed in the integrated circuit, and an arithmetic unit coupled to the reconfigurable stream switch. The arithmetic unit has a plurality of inputs and at least one output, and the arithmetic unit is solely dedicated to performance of a plurality of parallel operations. Each one of the plurality of parallel operations carries out a portion of the formula: output=AX+BY+C.

26.

发明申请
CONFIGURABLE ACCELERATOR FRAMEWORK 审中-公开

公开(公告)号：US20180189642A1

公开(公告)日：2018-07-05

申请号：US15423284

申请日：2017-02-02

Applicant: STMICROELECTRONICS S.R.L. , STMICROELECTRONICS INTERNATIONAL N.V.

Inventor： Thomas BOESCH , Giuseppe DESOLI

IPC: G06N3/063 , G06N3/04 , G06N3/08

Abstract: Embodiments are directed towards a configurable accelerator framework device that includes a stream switch and a plurality of convolution accelerators. The stream switch has a plurality of input ports and a plurality of output ports. Each of the input ports is configurable at run time to unidirectionally pass data to any one or more of the output ports via a stream link. Each one of the plurality of convolution accelerators is configurable at run time to unidirectionally receive input data via at least two of the plurality of stream switch output ports, and each one of the plurality of convolution accelerators is further configurable at run time to unidirectionally communicate output data via an input port of the stream switch.

27.

发明公开
ARITHMETIC UNIT FOR DEEP LEARNING ACCELERATION 审中-公开

公开(公告)号：US20230153621A1

公开(公告)日：2023-05-18

申请号：US18156704

申请日：2023-01-19

Applicant: STMICROELECTRONICS S.r.l. , STMICROELECTRONICS INTERNATIONAL N.V.

Inventor： Surinder Pal SINGH , Giuseppe DESOLI , Thomas BOESCH

IPC: G06N3/08 , G06N20/00 , G06F17/11 , G06N3/063 , G06F9/30 , G06N3/045

CPC classification number: G06N3/08 , G06N20/00 , G06F17/11 , G06N3/063 , G06F9/3001 , G06F9/30032 , G06F9/30036 , G06N3/045

Abstract: An integrated circuit includes a reconfigurable stream switch and an arithmetic circuit. The stream switch, in operation, streams data. The arithmetic circuit has a plurality of inputs coupled to the reconfigurable stream switch. In operation, the arithmetic circuit generates an output according to AX+BY+C, where A, B and C are vector or scalar constants, and X and Y are data streams streamed to the arithmetic circuit through the reconfigurable stream switch.

28.

发明申请
CONVOLUTION ACCELERATION WITH EMBEDDED VECTOR DECOMPRESSION 有权

公开(公告)号：US20230084985A1

公开(公告)日：2023-03-16

申请号：US18056937

申请日：2022-11-18

Applicant: STMICROELECTRONICS S.r.l. , STMicroelectronics International N.V.

Inventor： Thomas BOESCH , Giuseppe DESOLI , Surinder Pal SINGH , Carmine CAPPETTA

IPC: G06N3/063 , G06F9/50 , H03M7/30

Abstract: Techniques and systems are provided for implementing a convolutional neural network. One or more convolution accelerators are provided that each include a feature line buffer memory, a kernel buffer memory, and a plurality of multiply-accumulate (MAC) circuits arranged to multiply and accumulate data. In a first operational mode the convolutional accelerator stores feature data in the feature line buffer memory and stores kernel data in the kernel data buffer memory. In a second mode of operation, the convolutional accelerator stores kernel decompression tables in the feature line buffer memory.

29.

发明申请
NEURAL NETWORK HARDWARE ACCELERATOR CIRCUIT WITH REQUANTIZATION CIRCUITS 有权

公开(公告)号：US20230062910A1

公开(公告)日：2023-03-02

申请号：US17461626

申请日：2021-08-30

Applicant: STMicroelectronics S.r.l. , STMicroelectronics International N.V.

Inventor： Giuseppe DESOLI , Surinder Pal SINGH , Thomas BOESCH

IPC: G06N3/063 , G06N3/04 , G06F9/50

Abstract: A convolutional neural network includes convolution circuitry. The convolution circuitry performs convolution operations on input tensor values. The convolutional neural network includes requantization circuitry that requantizes convolution values output from the convolution circuitry.

30.

发明申请
RECONFIGURABLE HARDWARE BUFFER IN A NEURAL NETWORKS ACCELERATOR FRAMEWORK 有权

公开(公告)号：US20220101086A1

公开(公告)日：2022-03-31

申请号：US17039653

申请日：2020-09-30

Applicant: STMICROELECTRONICS S.r.l. , STMicroelectronics International N.V.

Inventor： Carmine CAPPETTA , Thomas BOESCH , Giuseppe DESOLI

IPC: G06N3/04 , G06N3/063 , G06F9/38 , G06T7/11

Abstract: A convolutional accelerator framework (CAF) has a plurality of processing circuits including one or more convolution accelerators, a reconfigurable hardware buffer configurable to store data of a variable number of input data channels, and a stream switch coupled to the plurality of processing circuits. The reconfigurable hardware buffer has a memory and control circuitry. A number of the variable number of input data channels is associated with an execution epoch. The stream switch streams data of the variable number of input data channels between processing circuits of the plurality of processing circuits and the reconfigurable hardware buffer during processing of the execution epoch. The control circuitry of the reconfigurable hardware buffer configures the memory to store data of the variable number of input data channels, the configuring including allocating a portion of the memory to each of the variable number of input data channels.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification