-
公开(公告)号:US11900240B2
公开(公告)日:2024-02-13
申请号:US17023144
申请日:2020-09-16
Inventor: Nitin Chawla , Giuseppe Desoli , Manuj Ayodhyawasi , Thomas Boesch , Surinder Pal Singh
IPC: G06N3/06 , G06F1/32 , G06F9/50 , G06F1/08 , G06N3/063 , G06N3/082 , G06F1/3228 , G06F1/324 , G06F1/3296
CPC classification number: G06N3/063 , G06F1/08 , G06F1/324 , G06F1/3228 , G06F1/3296 , G06F9/5027 , G06N3/082
Abstract: Systems and devices are provided to increase computational and/or power efficiency for one or more neural networks via a computationally driven closed-loop dynamic clock control. A clock frequency control word is generated based on information indicative of a current frame execution rate of a processing task of the neural network and a reference clock signal. A clock generator generates the clock signal of neural network based on the clock frequency control word. A reference frequency may be used to generate the clock frequency control word, and the reference frequency may be based on information indicative of a sparsity of data of a training frame.
-
公开(公告)号:US11687762B2
公开(公告)日:2023-06-27
申请号:US16280991
申请日:2019-02-20
Inventor: Surinder Pal Singh , Thomas Boesch , Giuseppe Desoli
CPC classification number: G06N3/063 , G06F1/26 , G06F17/16 , G06F17/175 , G06N3/045 , G06N3/08 , G06N20/00
Abstract: Embodiments of a device include an integrated circuit, a reconfigurable stream switch formed in the integrated circuit along with a plurality of convolution accelerators and an arithmetic unit coupled to the reconfigurable stream switch. The arithmetic unit has at least one input and at least one output. The at least one input is arranged to receive streaming data passed through the reconfigurable stream switch, and the at least one output is arranged to stream resultant data through the reconfigurable stream switch. The arithmetic unit also has a plurality of data paths. At least one of the plurality of data paths is solely dedicated to performance of operations that accelerate an activation function represented in the form of a piece-wise second order polynomial approximation.
-
公开(公告)号:US11593609B2
公开(公告)日:2023-02-28
申请号:US16794062
申请日:2020-02-18
Inventor: Giuseppe Desoli , Carmine Cappetta , Thomas Boesch , Surinder Pal Singh , Saumya Suneja
Abstract: Embodiments of an electronic device include an integrated circuit, a reconfigurable stream switch formed in the integrated circuit along with a plurality of convolution accelerators and a decompression unit coupled to the reconfigurable stream switch. The decompression unit decompresses encoded kernel data in real time during operation of convolutional neural network.
-
公开(公告)号:US11836346B2
公开(公告)日:2023-12-05
申请号:US17742987
申请日:2022-05-12
Inventor: Nitin Chawla , Giuseppe Desoli , Anuj Grover , Thomas Boesch , Surinder Pal Singh , Manuj Ayodhyawasi
CPC classification number: G06F3/0604 , G06F3/0619 , G06F3/0655 , G06F3/0679 , G06N3/08
Abstract: A memory array arranged as a plurality of memory cells. The memory cells are configured to operate at a determined voltage. A memory management circuitry coupled to the plurality of memory cells tags a first set of the plurality of memory cells as low-voltage cells and tags a second set of the plurality of memory cells as high-voltage cells. A power source provides a low voltage to the first set of memory cells and provides a high voltage to the second set of memory cells based on the tags.
-
公开(公告)号:US11823771B2
公开(公告)日:2023-11-21
申请号:US17158875
申请日:2021-01-26
Inventor: Nitin Chawla , Thomas Boesch , Anuj Grover , Surinder Pal Singh , Giuseppe Desoli
Abstract: A system includes a random access memory organized into individually addressable words. Streaming access control circuitry is coupled to word lines of the random access memory. The streaming access control circuitry responds to a request to access a plurality of individually addressable words of a determined region of the random access memory by generating control signals to drive the word lines to streamingly access the plurality of individually addressable words of the determined region. The request indicates an offset associated with the determined region and a pattern associated with the streaming access.
-
公开(公告)号:US11710032B2
公开(公告)日:2023-07-25
申请号:US18055245
申请日:2022-11-14
Inventor: Surinder Pal Singh , Thomas Boesch , Giuseppe Desoli
CPC classification number: G06N3/063 , G06F17/18 , G06F18/217 , G06N3/04 , G06N3/08
Abstract: A convolutional neural network includes a pooling unit. The pooling unit performs pooling operations between convolution layers of the convolutional neural network. The pooling unit includes hardware blocks that promote computational and area efficiency in the convolutional neural network.
-
公开(公告)号:US11531873B2
公开(公告)日:2022-12-20
申请号:US16909673
申请日:2020-06-23
Inventor: Thomas Boesch , Giuseppe Desoli , Surinder Pal Singh , Carmine Cappetta
Abstract: Techniques and systems are provided for implementing a convolutional neural network. One or more convolution accelerators are provided that each include a feature line buffer memory, a kernel buffer memory, and a plurality of multiply-accumulate (MAC) circuits arranged to multiply and accumulate data. In a first operational mode the convolutional accelerator stores feature data in the feature line buffer memory and stores kernel data in the kernel data buffer memory. In a second mode of operation, the convolutional accelerator stores kernel decompression tables in the feature line buffer memory.
-
公开(公告)号:US11507831B2
公开(公告)日:2022-11-22
申请号:US16799671
申请日:2020-02-24
Inventor: Surinder Pal Singh , Thomas Boesch , Giuseppe Desoli
Abstract: A convolutional neural network includes a pooling unit. The pooling unit performs pooling operations between convolution layers of the convolutional neural network. The pooling unit includes hardware blocks that promote computational and area efficiency in the convolutional neural network.
-
公开(公告)号:US10977854B2
公开(公告)日:2021-04-13
申请号:US16280963
申请日:2019-02-20
Inventor: Surinder Pal Singh , Thomas Boesch , Giuseppe Desoli
IPC: G06T7/11 , G06K9/00 , G06T15/08 , G06T7/62 , G06F16/901 , G06F9/38 , G06K9/62 , G06N3/08 , G06N3/04 , G06N3/063
Abstract: Embodiments of a device include on-board memory, an applications processor, a digital signal processor (DSP) cluster, a configurable accelerator framework (CAF), and at least one communication bus architecture. The communication bus communicatively couples the applications processor, the DSP cluster, and the CAF to the on-board memory. The CAF includes a reconfigurable stream switch and a data volume sculpting unit, which has an input and an output coupled to the reconfigurable stream switch. The data volume sculpting unit has a counter, a comparator, and a controller. The data volume sculpting unit is arranged to receive a stream of feature map data that forms a three-dimensional (3D) feature map. The 3D feature map is formed as a plurality of two-dimensional (2D) data planes. The data volume sculpting unit is also arranged to identify a 3D volume within the 3D feature map that is dimensionally smaller than the 3D feature map and isolate data from the 3D feature map that is within the 3D volume for processing in a deep learning algorithm.
-
10.
公开(公告)号:US20190158858A1
公开(公告)日:2019-05-23
申请号:US16251798
申请日:2019-01-18
Applicant: STMICROELECTRONICS INTERNATIONAL N.V.
Inventor: Sumit Johar , Surinder Pal Singh
IPC: H04N19/433
Abstract: Video encoders and decoders and video encoding and decoding methods are provided. A video encoder includes an input buffer configured to receive a video data stream and to supply current frame data, a frame buffer configured to store reconstructed frame data, and an encoder circuit configured to read reference frame data from the frame buffer, to encode the current frame data received from the input buffer using the reference frame data and to write the reconstructed frame data to the frame buffer. The encoder circuit may be configured to write the reconstructed frame data by overwriting the reference frame data in the frame buffer.
-
-
-
-
-
-
-
-
-