ARBITRATION OF DATA TRANSFER REQUESTS
    2.
    发明申请

    公开(公告)号:US20190278727A1

    公开(公告)日:2019-09-12

    申请号:US16425396

    申请日:2019-05-29

    IPC分类号: G06F13/364 H04L29/08

    摘要: A method for arbitrating data transfer requests from a plurality of nodes includes specifying one or more nodes among the plurality of nodes, the one or more nodes satisfying a predetermined condition, and selecting, if two or more nodes are specified among the plurality of nodes, one node from the two or more nodes using priority information, the priority information indicating correspondence between the plurality of nodes and a plurality of priorities each assigned to one of the plurality of nodes, the correspondence changing so that the plurality of priorities are assigned equally to each of the plurality of nodes and high and low relations appear equally between pairs of priorities each assigned to a pair of nodes of the plurality of nodes.

    Special-purpose digital-compute hardware for efficient element-wise aggregation, scaling and offset

    公开(公告)号:US12045612B2

    公开(公告)日:2024-07-23

    申请号:US17931537

    申请日:2022-09-12

    IPC分类号: G06F9/30 G06F9/355 G06N20/00

    摘要: An efficient pipelined implementation of digital scaling, offset and aggregation operation supports element-by-element programmable scale and offset factors. The method includes time-multiplexed parallel pipelining of a plurality of digital data words, each of the plurality of digital data words encoding an N-bit signed integer, from one of a plurality of receive-registers through a datapath that can either (1) store the plurality of digital data words directly in a dedicated first memory, (2) store the plurality of digital data words directly in a dedicated second memory, or (3) direct the plurality of digital data words into a parallel set of fused-multiply-add units. The method further includes multiplying each digital data word by a corresponding data-word retrieved from the dedicated first memory to form product data words and adding the product data words to a corresponding data-word retrieved from the dedicated second memory to form an output sum-and-product data words.

    HARDWARE FOR PARALLEL LAYER-NORM COMPUTE
    8.
    发明公开

    公开(公告)号:US20240211532A1

    公开(公告)日:2024-06-27

    申请号:US18083011

    申请日:2022-12-16

    IPC分类号: G06F17/16 G06N3/063

    CPC分类号: G06F17/16 G06N3/063

    摘要: Systems and methods for performing layer normalization are described. A circuit can receive a sequence of input data across a plurality of clock cycles, where the sequence of input data represents a portion of an input vector. The circuit can determine a plurality of sums and a plurality of sums of squares corresponding to the sequence of input data. The circuit can determine, based on the plurality of sums of squares, a first scalar representing an inverse square-root of a variance of vector elements in the input vector. The circuit can determine a second scalar representing a negation of a product of the first scalar and a mean of the vector elements in the input vector. The circuit can determine, based on the first scalar, the second scalar and the received sequence of input data, an output vector that is a normalization of the input vector.

    Arbitration of data transfer requests

    公开(公告)号:US11144489B2

    公开(公告)日:2021-10-12

    申请号:US16425396

    申请日:2019-05-29

    摘要: A method for arbitrating data transfer requests from a plurality of nodes includes specifying one or more nodes among the plurality of nodes, the one or more nodes satisfying a predetermined condition, and selecting, if two or more nodes are specified among the plurality of nodes, one node from the two or more nodes using priority information, the priority information indicating correspondence between the plurality of nodes and a plurality of priorities each assigned to one of the plurality of nodes, the correspondence changing so that the plurality of priorities are assigned equally to each of the plurality of nodes and high and low relations appear equally between pairs of priorities each assigned to a pair of nodes of the plurality of nodes.

    Packet broadcasting mechanism for mesh interconnected multi-computers

    公开(公告)号:US10999191B2

    公开(公告)日:2021-05-04

    申请号:US16739929

    申请日:2020-01-10

    摘要: A method is provided for packet broadcasting in a mesh-interconnected multi-computer network having a plurality of routers interconnected to a plurality of arbiters. The method includes live-lock free arbitering, by each of the plurality of arbiters, between two or more packet broadcast requests using a shared priority matrix, implemented by a binary matrix, that selects one of the two or more packet broadcast requests and includes a column for each of the plurality of routers, the shared priority matrix being shared amongst the plurality of arbiters and storing priority information determined from summing the matrix column values and relating to a correspondence between a plurality of packet broadcast requests, including the two or more packet broadcast requests, with respect to priorities assigned to each of the plurality of packet broadcast requests. Each of the columns of the shared priority matrix corresponds to a respective one of the routers.