Patent search ap:("Arm Limited") AND inv:"Matthew Mattina" Page 2

11.

发明授权
Hardware accelerator for IM2COL operation 有权

公开(公告)号：US11783163B2

公开(公告)日：2023-10-10

申请号：US16901542

申请日：2020-06-15

Applicant: Arm Limited

Inventor： Zhi-Gang Liu , Paul Nicholas Whatmough , Matthew Mattina

IPC: G06N3/04 , G06N3/08 , G06F17/16 , G06F9/30

CPC classification number: G06N3/04 , G06F9/30105 , G06F17/16 , G06N3/08

Abstract: The present disclosure advantageously provides a matrix expansion unit that includes an input data selector, a first register set, a second register set, and an output data selector. The input data selector is configured to receive first matrix data in a columnwise format. The first register set is coupled to the input data selector, and includes a plurality of data selectors and a plurality of registers arranged in a first shift loop. The second register set is coupled to the data selector, and includes a plurality of data selectors and a plurality of registers arranged in a second shift loop. The output data selector is coupled to the first register set and the second register set, and is configured to output second matrix data in a rowwise format.

12.

发明申请
SYSTEM, CIRCUIT, DEVICE AND/OR PROCESSES FOR ACCUMULATING NEURAL NETWORK SIGNALS 有权

公开(公告)号：US20230026113A1

公开(公告)日：2023-01-26

申请号：US17382108

申请日：2021-07-21

Applicant: Arm Limited

Inventor： Paul Nicholas Whatmough , Zhi-Gang Liu , Matthew Mattina

IPC: G06N3/04 , G06N3/063

Abstract: Example methods, devices and/or circuits to be implemented in a processing device to perform neural network-based computing operations. According to an embodiment, an accumulation of weighted activation input values may be computed on accumulation cycles at least in part by multiplying and/or scaling accumulated activation input values by an associated neural network weight.

13.

发明申请
Time Domain Unrolling Sparse Matrix Multiplication System and Method 有权

公开(公告)号：US20220035890A1

公开(公告)日：2022-02-03

申请号：US17103676

申请日：2020-11-24

Applicant: Arm Limited

Inventor： Zhi-Gang Liu , Paul Nicholas Whatmough , Matthew Mattina

IPC: G06F17/16 , G06F7/544 , G06F7/523 , G06F7/50 , G06F9/50 , G06F15/80

Abstract: A system and method for multiplying matrices are provided. The system includes a processor coupled to a memory and a matrix multiply accelerator (MMA) coupled to the processor. The MMA is configured to multiply, based on a bitmap, a compressed first matrix and a second matrix to generate an output matrix including, for each element i,j of the output matrix, calculate a dot product of an ith row of the compressed first matrix and a jth column of the second matrix based on the bitmap. Or, the MMA is configured to multiply, based on the bitmap, the second matrix and the compressed first matrix and to generate the output matrix including, for each element i,j of the output matrix, calculate a dot product of an ith row of the second matrix and a jth column of the compressed first matrix based on the bitmap.

14.

发明申请
Modulo Operation Unit 有权

公开(公告)号：US20210374509A1

公开(公告)日：2021-12-02

申请号：US16889031

申请日：2020-06-01

Applicant: Arm Limited

Inventor： Zhi-Gang Liu , Matthew Mattina

IPC: G06N3/063 , G06N3/08 , G06F9/50 , G06F9/38 , G06F7/72 , G06F7/50 , G06F7/02

Abstract: The present disclosure advantageously provides a modulo operation unit that includes a first input configured to receive operand data, a second input configured to receive modulus data, an initial modulo stage, a sequence of intermediate modulo stages, and a final modulo stage.

15.

发明申请
MIXED-PRECISION COMPUTATION UNIT 有权

公开(公告)号：US20210089889A1

公开(公告)日：2021-03-25

申请号：US16836117

申请日：2020-03-31

Applicant: Arm Limited

Inventor： Dibakar Gope , Jesse Garrett Beu , Paul Nicholas Whatmough , Matthew Mattina

IPC: G06N3/08

Abstract: The present disclosure advantageously provides a mixed precision computation (MPC) unit for executing one or more mixed-precision layers of an artificial neural network (ANN). The MPC unit includes a multiplier circuit configured to input a pair of operands and output a product, a first adder circuit coupled to the multiplier circuit, a second adder circuit, coupled to the first adder circuit, configured to input a pair of operands, an accumulator circuit, coupled to the multiplier circuit and the first adder circuit, configured to output an accumulated value, and a controller, coupled to the multiplier circuit, the first adder circuit, the second adder circuit and the accumulator circuit, configured to input a mode control signal. The controller has a plurality of operating modes including a high precision mode, a low precision add mode and a low precision multiply mode.

16.

发明申请
Refactoring MAC Computations for Reduced Programming Steps 有权

公开(公告)号：US20210064379A1

公开(公告)日：2021-03-04

申请号：US16556101

申请日：2019-08-29

Applicant: Arm Limited

Inventor： Matthew Mattina , Shidhartha Das , Glen Arnold Rosendale , Fernando Garcia Redondo

IPC: G06F9/38 , G06F9/30 , G06F7/487 , G06F7/544 , G06F17/16 , G06N3/06

Abstract: A method and architecture for performing multiply-accumulate operations in a neural network is disclosed. The architecture includes a crossbar having a plurality of non-volatile memory elements. A plurality of input activations is applied to the crossbar, which are then summed by binary weight encoding a plurality of the non-volatile memory elements to connect the input activations to weight values. At least one of the plurality of non-volatile memory elements is then precision programmed.

17.

发明授权
System, method and apparatus for computationally efficient data manipulation 有权

公开(公告)号：US10747845B2

公开(公告)日：2020-08-18

申请号：US16118818

申请日：2018-08-31

Applicant: Arm Limited

Inventor： Paul Nicholas Whatmough , Matthew Mattina , Zhigang Liu

IPC: G06F17/16 , G06F7/523 , G06N3/04 , G06N3/08

Abstract: A system, apparatus and method for exposing input data operands and input weight operands to elements of a two-dimensional array so that two pairs of operands are exposed to each element of the array.

18.

发明申请
SYSTOLIC CONVOLUTIONAL NEURAL NETWORK 审中-公开

公开(公告)号：US20190311243A1

公开(公告)日：2019-10-10

申请号：US15945952

申请日：2018-04-05

Applicant: Arm Limited

Inventor： Paul Nicholas Whatmough , Ian Rudolf Bratt , Matthew Mattina

IPC: G06N3/04 , G06N3/063

Abstract: A circuit and method are provided for performing convolutional neural network computations for a neural network. The circuit includes a transposing buffer configured to receive actuation feature vectors along a first dimension and to output feature component vectors along a second dimension, a weight buffer configured to store kernel weight vectors along a first dimension and further configured to output kernel component vectors along a second dimension, and a systolic array configured to receive the kernel weight vectors along a first dimension and to receive the feature component vectors along a second dimension. The systolic array includes an array of multiply and accumulate (MAC) processing cells. Each processing cell is associated with an output value. The actuation feature vectors may be shifted into the transposing buffer along the first dimension and output feature component vectors may shifted out of the transposing buffer along the second dimension, providing efficient dataflow.

19.

发明授权
Time domain unrolling sparse matrix multiplication system and method 有权

公开(公告)号：US11928176B2

公开(公告)日：2024-03-12

申请号：US17103676

申请日：2020-11-24

Applicant: Arm Limited

Inventor： Zhi-Gang Liu , Paul Nicholas Whatmough , Matthew Mattina

IPC: G06F17/16 , G06F7/544 , G06F9/38 , G06F15/80

CPC classification number: G06F17/16 , G06F7/5443 , G06F15/80 , G06F9/3893

Abstract: A system and method for multiplying matrices are provided. The system includes a processor coupled to a memory and a matrix multiply accelerator (MMA) coupled to the processor. The MMA is configured to multiply, based on a bitmap, a compressed first matrix and a second matrix to generate an output matrix including, for each element i,j of the output matrix, a calculation of a dot product of an ith row of the compressed first matrix and a jth column of the second matrix based on the bitmap. Or, the MMA is configured to multiply, based on the bitmap, the second matrix and the compressed first matrix and to generate the output matrix including, for each element i,j of the output matrix, a calculation of a dot product of an ith row of the second matrix and a jth column of the compressed first matrix based on the bitmap.

20.

发明授权
Refactoring mac operations 有权

公开(公告)号：US11922169B2

公开(公告)日：2024-03-05

申请号：US17674503

申请日：2022-02-17

Applicant: Arm Limited

Inventor： Matthew Mattina , Shidhartha Das , Glen Arnold Rosendale , Fernando Garcia Redondo

IPC: G06F9/38 , G06F7/487 , G06F7/544 , G06F9/30 , G06F17/16 , G06N3/06

CPC classification number: G06F9/3893 , G06F7/4876 , G06F7/5443 , G06F9/30014 , G06F17/16 , G06N3/06

Abstract: A method and apparatus for performing refactored multiply-and-accumulate operations is provided. A summing array includes a plurality of non-volatile memory elements arranged in columns. Each non-volatile memory element in the summing array is programmed to a high resistance state or a low resistance state based on weights of a neural network. The summing array is configured to generate a summed signal for each column based, at least in part, on a plurality of input signals. A multiplying array is coupled to the summing array, and includes a plurality of non-volatile memory elements. Each non-volatile memory element in the multiplying array is programmed to a different conductance level based on the weights of the neural network. The multiplying array is configured to generate an output signal based, at least in part, on the summed signals from the summing array.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification