Array-integrated upstream/downstream router for circuit switched parallel connectivity

    公开(公告)号:US12067481B2

    公开(公告)日:2024-08-20

    申请号:US16159684

    申请日:2018-10-14

    发明人: Geoffrey Burr

    摘要: Array-integrated upstream/downstream routers for circuit-switched parallel connectivity are provided. A system comprises an array of neural cores having at least one dimension, a plurality of signal wires, and a plurality of routers. Each neural core comprises a plurality of ordered input wires, a plurality of ordered output wires, and a plurality of synapses, each synapse operatively coupled to one of the plurality of input wires and one of the plurality of output wires. The plurality of signal wires are disposed along each dimension of the array of neural cores. Each router is operatively coupled to one of the plurality of neural cores and to at least one signal wire along each dimension of the array of neural cores. Each of the plurality of routers is adapted to selectively route a signal from the at least one signal wire to its coupled neural core. Each of the plurality of routers is adapted to selectively route a signal from its coupled neural core to the at least one signal wire.

    FLEXIBLE AND ENERGY-EFFICIENT 2-D ROUTER MESH

    公开(公告)号:US20230096894A1

    公开(公告)日:2023-03-30

    申请号:US17487372

    申请日:2021-09-28

    IPC分类号: G06N3/04 G06N3/08

    摘要: An array of neural cores has at least two dimensions. Each of the neural cores comprises ordered input wires, ordered output wires, and synapses, each of the synapses operatively coupled to one of the input wires and one of the output wires. Signal wires are provided. At least one of the signal wires is disposed along each dimension of the array of neural cores. Each of the signal wires is disposed along at least one dimension of the array. Routers are provided, each of which is operatively coupled to (i) one of the neural cores and (ii) at least two of the signal wires, one along each of the dimensions of the array of neural cores. Each of the routers is configured to selectively route a signal from one of its at least two coupled signal wires to its coupled neural core. Each of the routers is configured to selectively route a signal from its coupled neural core to one of its at least two coupled signal wires.

    CALIBRATING PERIPHERAL VARIABILITY

    公开(公告)号:US20220405554A1

    公开(公告)日:2022-12-22

    申请号:US17350162

    申请日:2021-06-17

    IPC分类号: G06N3/063

    摘要: Embodiments herein disclose computer-implemented methods, computer program products and computer systems for balancing neural network weight asymmetries. The computer-implemented method may include providing a neural network with weights comprising one or more major conductance pairs and one or more minor conductance pairs. The method may further include programming the one or more major conductance pairs to force an inference output to an expected duration value, determining a positive weight coefficient based on the one or more major conductance pairs and a negative weight coefficient based on the one or more minor conductance pairs, determining one or more target weights based on one or more of the positive weight coefficient and the negative weight coefficient, programming the one or more minor conductance pairs to force the inference output to the expected duration value, and programming the one or more major conductance pairs with the one or more target weights.

    Special-purpose digital-compute hardware for efficient element-wise aggregation, scaling and offset

    公开(公告)号:US12045612B2

    公开(公告)日:2024-07-23

    申请号:US17931537

    申请日:2022-09-12

    IPC分类号: G06F9/30 G06F9/355 G06N20/00

    摘要: An efficient pipelined implementation of digital scaling, offset and aggregation operation supports element-by-element programmable scale and offset factors. The method includes time-multiplexed parallel pipelining of a plurality of digital data words, each of the plurality of digital data words encoding an N-bit signed integer, from one of a plurality of receive-registers through a datapath that can either (1) store the plurality of digital data words directly in a dedicated first memory, (2) store the plurality of digital data words directly in a dedicated second memory, or (3) direct the plurality of digital data words into a parallel set of fused-multiply-add units. The method further includes multiplying each digital data word by a corresponding data-word retrieved from the dedicated first memory to form product data words and adding the product data words to a corresponding data-word retrieved from the dedicated second memory to form an output sum-and-product data words.

    HARDWARE FOR PARALLEL LAYER-NORM COMPUTE
    9.
    发明公开

    公开(公告)号:US20240211532A1

    公开(公告)日:2024-06-27

    申请号:US18083011

    申请日:2022-12-16

    IPC分类号: G06F17/16 G06N3/063

    CPC分类号: G06F17/16 G06N3/063

    摘要: Systems and methods for performing layer normalization are described. A circuit can receive a sequence of input data across a plurality of clock cycles, where the sequence of input data represents a portion of an input vector. The circuit can determine a plurality of sums and a plurality of sums of squares corresponding to the sequence of input data. The circuit can determine, based on the plurality of sums of squares, a first scalar representing an inverse square-root of a variance of vector elements in the input vector. The circuit can determine a second scalar representing a negation of a product of the first scalar and a mean of the vector elements in the input vector. The circuit can determine, based on the first scalar, the second scalar and the received sequence of input data, an output vector that is a normalization of the input vector.