COMPUTE NEAR MEMORY CONVOLUTION ACCELERATOR
    2.
    发明公开

    公开(公告)号:US20230334006A1

    公开(公告)日:2023-10-19

    申请号:US18212079

    申请日:2023-06-20

    CPC classification number: G06F15/8046 G06F17/153 G06N3/063

    Abstract: A compute near memory (CNM) convolution accelerator enables a convolutional neural network (CNN) to use dedicated acceleration to achieve efficient in-place convolution operations with less impact on memory and energy consumption. A 2D convolution operation is reformulated as 1D row-wise convolution. The 1D row-wise convolution enables the CNM convolution accelerator to process input activations row-by-row, while using the weights one-by-one. Lightweight access circuits provide the ability to stream both weights and input rows as vectors to MAC units, which in turn enables modules of the CNM convolution accelerator to implement convolution for both [1×1] and chosen [n×n] sized filters.

    COMPUTE NEAR MEMORY CONVOLUTION ACCELERATOR
    4.
    发明申请

    公开(公告)号:US20200034148A1

    公开(公告)日:2020-01-30

    申请号:US16586975

    申请日:2019-09-28

    Abstract: A compute near memory (CNM) convolution accelerator enables a convolutional neural network (CNN) to use dedicated acceleration to achieve efficient in-place convolution operations with less impact on memory and energy consumption. A 2D convolution operation is reformulated as 1D row-wise convolution. The 1D row-wise convolution enables the CNM convolution accelerator to process input activations row-by-row, while using the weights one-by-one. Lightweight access circuits provide the ability to stream both weights and input row as vectors to MAC units, which in turn enables modules of the CNM convolution accelerator to implement convolution for both [1×1] and chosen [n×n] sized filters.

    END-TO-END DATA PROTECTION FOR COMPUTE IN MEMORY (CIM)/COMPUTE NEAR MEMORY (CNM)

    公开(公告)号:US20220107867A1

    公开(公告)日:2022-04-07

    申请号:US17553623

    申请日:2021-12-16

    Abstract: A near memory compute system includes multiple computation nodes, such as nodes for parallel distributed processing. The nodes include a memory device to store data and compute hardware to perform a computation on the data. Error correction code (ECC) logic performs ECC on the data prior to computation on the data by the compute hardware. The node also includes residue check logic to perform a residue check on a result of the computation.

Patent Agency Ranking