Patent search ap:("ARM Limited") AND inv:"Paul Nicholas Whatmough" Page 3

21.

发明授权
Apparatus and method for matrix operations 有权

公开(公告)号：US11379556B2

公开(公告)日：2022-07-05

申请号：US16417937

申请日：2019-05-21

Applicant: Arm Limited

Inventor： Matthew Mattina , Zhigang Liu , Paul Nicholas Whatmough , David Hennah Mansell

IPC: G06F17/16

Abstract: There is provided a data processing apparatus to perform an operation on a first matrix and a second matrix. The data processing apparatus includes receiver circuitry to receive elements of the first matrix, elements of the second matrix, and correspondence data to indicate where the elements of the first matrix are located in the first matrix. Determination circuitry performs, using the correspondence data, a determination of whether, for a given element of the first matrix in column i of the first matrix, a given element of the second matrix occurs in row i of the second matrix. Aggregation circuitry calculates an aggregation between a given row in the first matrix and a given column in the second matrix and includes: functional circuitry to perform, in dependence on the determination, a function on the given element of the first matrix and the given element of the second matrix to produce a partial result.

22.

发明申请
Memory for an Artificial Neural Network Accelerator 有权

公开(公告)号：US20220164127A1

公开(公告)日：2022-05-26

申请号：US17103632

申请日：2020-11-24

Applicant: Arm Limited

Inventor： Mudit Bhargava , Paul Nicholas Whatmough , Supreet Jeloka , Zhi-Gang Liu

IPC: G06F3/06 , G06N3/063

Abstract: A memory for an artificial neural network (ANN) accelerator is provided. The memory includes a first bank, a second bank and a bank selector. Each bank includes at least two word lines and a plurality of write word selectors. Each word line stores a plurality of words, and each word has a plurality of bytes. Each write word selector has an input port and a plurality of output ports, is coupled to a corresponding word in each word line, and is configured to select a byte of the corresponding word of a selected word line based on a byte select signal. The bank selector is coupled to the write word selectors of the first bank and the second bank, and configured to select a combination of write word selectors from at least one of the first bank and the second bank based on a bank select signal.

23.

发明申请
Hardware Accelerator For IM2COL Operation 有权

公开(公告)号：US20210390367A1

公开(公告)日：2021-12-16

申请号：US16901542

申请日：2020-06-15

Applicant: Arm Limited

Inventor： Zhi-Gang Liu , Paul Nicholas Whatmough , Matthew Mattina

IPC: G06N3/04 , G06N3/08 , G06F17/16 , G06F9/30

Abstract: The present disclosure advantageously provides a matrix expansion unit that includes an input data selector, a first register set, a second register set, and an output data selector. The input data selector is configured to receive first matrix data in a columnwise format. The first register set is coupled to the input data selector, and includes a plurality of data selectors and a plurality of registers arranged in a first shift loop. The second register set is coupled to the data selector, and includes a plurality of data selectors and a plurality of registers arranged in a second shift loop. The output data selector is coupled to the first register set and the second register set, and is configured to output second matrix data in a rowwise format.

24.

发明申请
Pipelined Accumulator 有权

公开(公告)号：US20210374508A1

公开(公告)日：2021-12-02

申请号：US16885704

申请日：2020-05-28

Applicant: Arm Limited

Inventor： Paul Nicholas Whatmough , Zhi-Gang Liu , Matthew Mattina

IPC: G06N3/063 , G06N3/08

Abstract: The present disclosure advantageously provides a pipelined accumulator that includes a data selector configured to receive a sequence of operands to be summed, an input register coupled to the data selector, an output register, coupled to the data selector, configured to store a sequence of partial sums and output a final sum, and a multi-stage add module coupled to the input register and the output register. The multi-stage add module is configured to store a sequence of partial sums and a final sum in a redundant format, and perform back-to-back accumulation into the output register.

25.

发明授权
Matrix multiplication system and method 有权

公开(公告)号：US11120101B2

公开(公告)日：2021-09-14

申请号：US16585265

申请日：2019-09-27

Applicant: Arm Limited

Inventor： Zhi-Gang Liu , Matthew Mattina , Paul Nicholas Whatmough

IPC: G06F17/16 , H03M7/30

Abstract: The present disclosure advantageously provides a system method for efficiently multiplying matrices with elements that have a value of 0. A bitmap is generated for each matrix. Each bitmap includes a bit position for each matrix element. The value of each bit is set to 0 when the value of the corresponding matrix element is 0, and to 1 when the value of the corresponding matrix element is not 0. Each matrix is compressed into a compressed matrix, which will have fewer elements with a value of 0 than the original matrix. Each bitmap is then adjusted based on the corresponding compressed matrix. The compressed matrices are then multiplied to generate an output matrix. For each element i,j in the output matrix, a dot product of the ith row of the first compressed matrix and the jth column of the second compressed matrix is calculated based on the bitmaps.

26.

发明授权
Apparatus and method for processing a received input signal containing a sequence of data blocks 有权

公开(公告)号：US10797915B2

公开(公告)日：2020-10-06

申请号：US15761212

申请日：2016-09-12

Applicant: ARM LIMITED

Inventor： Paul Nicholas Whatmough , Shidhartha Das

IPC: H04L25/06 , H04L25/49 , H04L25/03 , H04W4/80

Abstract: An apparatus and method are provided for processing a received input signal comprising a sequence of data blocks. Counter circuitry within the apparatus is arranged to receive a digital representation of the input signal, and for each data block generates a count value indicative of occurrences of a property of the digital representation (for example a rising edge or a falling edge) during an associated data block transmission period. Quantization circuitry then maps each count value to a soft decision value from amongst a predetermined set of soft decision values, where the number of soft decision values in the predetermined set exceeds a number of possible data values of the data block. The output circuitry then generates a digital output signal in dependence on the soft decision values. Such an apparatus has been found to provide a low power technique for a receiver, whilst still enabling the improved sensitivity benefits of using soft decisions to be achieved, and allows the apparatus to be constructed using all digital components.

27.

发明申请
SYSTEM, METHOD AND APPARATUS FOR COMPUTATIONALLY EFFICIENT DATA MANIPULATION 审中-公开

公开(公告)号：US20200073911A1

公开(公告)日：2020-03-05

申请号：US16118818

申请日：2018-08-31

Applicant: Arm Limited

Inventor： Paul Nicholas Whatmough , Matthew Mattina , Zhigang Liu

IPC: G06F17/16 , G06N3/08 , G06N3/04 , G06F7/523

Abstract: A system, apparatus and method for exposing input data operands and input weight operands to elements of a two-dimensional array so that two pairs of operands are exposed to each element of the array.

28.

发明授权
Hybrid filter banks for artificial neural networks 有权

公开(公告)号：US12067373B2

公开(公告)日：2024-08-20

申请号：US16836110

申请日：2020-03-31

Applicant: Arm Limited

Inventor： Dibakar Gope , Jesse Garrett Beu , Paul Nicholas Whatmough , Matthew Mattina

IPC: G06F7/483 , G06F7/544 , G06N3/04 , G06N3/063 , G06N3/08

CPC classification number: G06F7/483 , G06F7/5443 , G06N3/04 , G06N3/063 , G06N3/08

Abstract: The present disclosure advantageously provides a system including a memory, a processor, and a circuitry to execute one or more mixed precision layers of an artificial neural network (ANN), each mixed precision layer including high-precision weight filters and low precision weight filters. The circuitry is configured to perform one or more calculations on an input feature map having a plurality of input channels (cin) using the high precision weight filters to create a high precision output feature map having a first number of output channels (k), perform one or more calculations on the input feature map using the low precision weight filters to create a low precision output feature map having a second number of output channels (cout−k), and concatenate the high precision output feature map and the low precision output feature map to create a unified output feature map having a plurality of output channels (cout).

29.

发明公开
Bit Sparse Neural Network Optimization 审中-公开

公开(公告)号：US20240013052A1

公开(公告)日：2024-01-11

申请号：US17861824

申请日：2022-07-11

Applicant: Arm Limited

Inventor： Zhi-Gang Liu , Paul Nicholas Whatmough , John Fremont Brown, III

IPC: G06N3/08

CPC classification number: G06N3/082

Abstract: A method, system and apparatus provide bit-sparse neural network optimization. Rather than quantizing and pruning weight and activation elements at the word level, weight and activation elements are pruned at the bit level, which reduces the density of effective “set” bits in weight and activation data, which, advantageously, reduces the power consumption of the neural network inference process by reducing the degree of bit-level switching during inference.

30.

发明授权
System, method and apparatus for training neural networks using multiple datasets 有权

公开(公告)号：US11640533B2

公开(公告)日：2023-05-02

申请号：US16054358

申请日：2018-08-03

Applicant: Arm Limited

Inventor： Paul Nicholas Whatmough , Matthew Mattina , Jesse Garrett Beu

IPC: G06N3/08 , G06N20/00 , G06N3/04 , G06N3/082

Abstract: A system, an apparatus and methods for utilizing software and hardware portions of a neural network to fix, or hardwire, certain portions, while modifying other portions are provided. A first set of weights for layers of the first neural network are established, and selected weights are modified to generate a second set of weights, based on a second dataset. The second set of weights is then used to train a second neural network.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification