HARDWARE ACCELERATION WITH PRECONDITIONERS
    5.
    发明公开

    公开(公告)号:US20230195457A1

    公开(公告)日:2023-06-22

    申请号:US17556252

    申请日:2021-12-20

    IPC分类号: G06F9/30 G06F17/16

    摘要: Techniques are provided to implement hardware accelerated application of preconditioners to solve linear equations. For example, a system includes a processor, and a resistive processing unit coupled to the processor. The resistive processing unit includes an array of cells which include respective resistive devices, wherein at least a portion of the resistive devices are tunable to encode entries of a preconditioning matrix which is storable in the array of cells. When the preconditioning matrix is stored in the array of cells, the processor is configured to apply the preconditioning matrix to a plurality of residual vectors by executing a process which includes performing analog matrix-vector multiplication operations on the preconditioning matrix and respective ones of the plurality of residual vectors to generate a plurality of output vectors used in one or more subsequent operations.

    DETERMINING TRIANGLES IN GRAPH DATA STRUCTURES USING CROSSPOINT ARRAY

    公开(公告)号:US20220300575A1

    公开(公告)日:2022-09-22

    申请号:US17207912

    申请日:2021-03-22

    摘要: Techniques for determining a count of triangles (tr) in a graph data structure using a crosspoint array is described. An adjacency matrix (a) representing the graph is mapped to the crosspoint array by configuring resistance values of crosspoint devices in the array. The count of triangles is initialized to zero (tr=0), and iteratively updated. The updating includes generating a first vector (x1) stochastically to include digital values in a predetermined range, which are converted into the voltage values. A multiplication of the adjacency matrix and the first vector (ax1) is computed using the crosspoint array. A second voltage vector (z1=ax1) is generated that includes voltage values representing the multiplication result. The adjacency matrix and the second voltage vector (z2=az1) are multiplied using the crosspoint array. The computer updates the number of triangles in the graph data structure as tr=tr+Z1T.

    MATRIX SKETCHING USING ANALOG CROSSBAR ARCHITECTURES

    公开(公告)号:US20210357540A1

    公开(公告)日:2021-11-18

    申请号:US16874819

    申请日:2020-05-15

    IPC分类号: G06F30/10

    摘要: A computer-implemented method is presented for performing matrix sketching by employing an analog crossbar architecture. The method includes low rank updating a first matrix for a first period of time, copying the first matrix into a dynamic correction computing device, switching to a second matrix to low rank update the second matrix for a second period of time, as the second matrix is low rank updated, feeding the first matrix with first stochastic pulses to reset the first matrix back to a first matrix symmetry point, copying the second matrix into the dynamic correction computing device, switching back to the first matrix to low rank update the first matrix for a third period of time, and as the first matrix is low rank updated, feeding the second matrix with second stochastic pulses to reset the second matrix back to a second matrix symmetry point.

    DIRECTED GRAPH AUTOENCODER DEVICES AND METHODS

    公开(公告)号:US20240296324A1

    公开(公告)日:2024-09-05

    申请号:US18356958

    申请日:2023-07-21

    IPC分类号: G06N3/08

    CPC分类号: G06N3/08

    摘要: A directed graph autoencoder device includes one or more memories and a processor coupled to the one or more memories and configured to implement a graph convolutional layer. The graph convolutional layer comprises a plurality of nodes and is configured to generate transformed dual vector representations by applying a source weight matrix and a target weight matrix to input dual vector representations of the plurality of nodes. The input dual vector representations comprise, for each node of the plurality of nodes, a source vector representation that corresponds to the node in its role as a source and a target vector representation that corresponds to the node in its role as a target. The graph convolutional layer is further configured to scale the transformed dual vector representations to generate scaled transformed dual vector representations. The graph convolutional layer is further configured to perform message passing using the scaled transformed dual vector representations.

    Hardware acceleration with preconditioners

    公开(公告)号:US11907715B2

    公开(公告)日:2024-02-20

    申请号:US17556252

    申请日:2021-12-20

    IPC分类号: G06F9/30 G06F17/16

    摘要: Techniques are provided to implement hardware accelerated application of preconditioners to solve linear equations. For example, a system includes a processor, and a resistive processing unit coupled to the processor. The resistive processing unit includes an array of cells which include respective resistive devices, wherein at least a portion of the resistive devices are tunable to encode entries of a preconditioning matrix which is storable in the array of cells. When the preconditioning matrix is stored in the array of cells, the processor is configured to apply the preconditioning matrix to a plurality of residual vectors by executing a process which includes performing analog matrix-vector multiplication operations on the preconditioning matrix and respective ones of the plurality of residual vectors to generate a plurality of output vectors used in one or more subsequent operations.