Current Spike Mitigation Technique for Neural Networks

    公开(公告)号:US20230297432A1

    公开(公告)日:2023-09-21

    申请号:US17697706

    申请日:2022-03-17

    申请人: Arm Limited

    IPC分类号: G06F9/50 G06N3/02

    CPC分类号: G06F9/505 G06N3/02

    摘要: Various implementations described herein are related to a method that monitors workloads of a neural network for current spikes. The method may determine current transitions of the workloads that result in rapid changes in load current consumption of the neural network. The method may modify load scheduling of the neural network so as to smooth and/or stabilize the current transitions of the workloads.

    Nibble Block Format
    5.
    发明申请

    公开(公告)号:US20230076138A1

    公开(公告)日:2023-03-09

    申请号:US17470470

    申请日:2021-09-09

    申请人: Arm Limited

    摘要: A matrix multiplication system and method are provided. The system includes a memory that stores one or more weight tensors, a processor and a matrix multiply accelerator (MMA). The processor converts each weight tensor into an encoded block set that is stored in the memory. Each encoded block set includes a number of encoded blocks, and each encoded block includes a data field and an index field. The MMA converts each encoded block set into a reconstructed weight tensor, and convolves each reconstructed weight tensor and an input data tensor to generate an output data matrix.

    Sparse Finetuning for Artificial Neural Networks

    公开(公告)号:US20210192323A1

    公开(公告)日:2021-06-24

    申请号:US16720380

    申请日:2019-12-19

    申请人: Arm Limited

    IPC分类号: G06N3/063 G06N3/04 G06N3/08

    摘要: The present disclosure advantageously provides a hardware accelerator for an artificial neural network (ANN), including a communication bus interface, a memory, a controller, and at least one processing engine (PE). The communication bus interface is configured to receive a plurality of finetuned weights associated with the ANN, receive input data, and transmit output data. The memory is configured to store the plurality of finetuned weights, the input data and the output data. The PE is configured to receive the input data, execute an ANN model using a plurality of fixed weights associated with the ANN and the plurality of finetuned weights, and generate the output data. Each finetuned weight corresponds to a fixed weight.

    Pipelined accumulator
    9.
    发明授权

    公开(公告)号:US11501151B2

    公开(公告)日:2022-11-15

    申请号:US16885704

    申请日:2020-05-28

    申请人: Arm Limited

    IPC分类号: G06N7/00 G06N3/063

    摘要: The present disclosure advantageously provides a pipelined accumulator that includes a data selector configured to receive a sequence of operands to be summed, an input register coupled to the data selector, an output register, coupled to the data selector, configured to store a sequence of partial sums and output a final sum, and a multi-stage add module coupled to the input register and the output register. The multi-stage add module is configured to store a sequence of partial sums and a final sum in a redundant format, and perform back-to-back accumulation into the output register.

    Mixed-Signal Artificial Neural Network Accelerator

    公开(公告)号:US20220180158A1

    公开(公告)日:2022-06-09

    申请号:US17116623

    申请日:2020-12-09

    申请人: Arm Limited

    IPC分类号: G06N3/063 G06N3/04

    摘要: An artificial neural network (ANN) accelerator is provided. The ANN accelerator includes digital controlled oscillators (DCOs), digital-to-time converters (DTCs) and a mixed-signal multiply-and-accumulate (MAC) array. Each DCO generates a first analog operand signal based on a first digital data value, and transmits the first analog operand signal along a respective column signal line. Each DTC generates a second analog operand signal based on a second digital data value, and transmits the second analog operand signal along a respective row signal line. The mixed-signal MAC array is coupled to the row and column signal lines, and includes mixed-signal MAC units. Each mixed-signal MAC unit includes an integrated clock gate (ICG) that generates a digital product signal based on the first and second analog operand signals, and a counter circuit that increments or decrements a count value stored in a register based on the digital product signal.