FREQUENCY DOMAIN NEURAL NETWORK ACCELERATOR

    公开(公告)号:US20210319289A1

    公开(公告)日:2021-10-14

    申请号:US16846966

    申请日:2020-04-13

    摘要: The present disclosure relates to systems and methods concerning a system including a host device and a convolutional neural network hardware accelerator. The hardware accelerator can be configured, at least in part by the host device, to generate activation data from spatial-domain input data and spatial-domain weight data using frequency-domain operations. The hardware accelerator can include one or more discrete Fourier transform units configured to generate a frequency-domain representation of the input data. The hardware accelerator can include a multiplication unit configured to generate a frequency-domain representation of the activation data by element-wise complex multiplication of the frequency-domain representation of the input data and a frequency-domain representation of the weight data. The hardware accelerator can also include an inverse discrete Fourier transform unit configured to generate a spatial-domain representation of the activation data from the frequency-domain representation of the activation data.

    APPARATUS AND METHOD FOR REPRESENTATION OF A SPARSE MATRIX IN A NEURAL NETWORK

    公开(公告)号:US20210240684A1

    公开(公告)日:2021-08-05

    申请号:US16783069

    申请日:2020-02-05

    摘要: The present disclosure relates to a method and an apparatus for representation of a sparse matrix in a neural network. In some embodiments, an exemplary operation unit includes a buffer for storing a representation of a sparse matrix in a neural network, a sparse engine communicatively coupled with the buffer, and a processing array communicatively coupled with the sparse engine. The sparse engine includes circuitry to: read the representation of the sparse matrix from the buffer, the representation comprising a first level bitmap, a second level bitmap, and an element array; decompress the first level bitmap to determine whether a block of the sparse matrix comprises a non-zero element; and in response to the block comprising a non-zero element, decompress the second level bitmap using the element array to obtain the block of the sparse matrix. The processing array includes circuitry to execute the neural network with the sparse matrix.

    MEMORY INTERCONNECTION ARCHITECTURE SYSTEMS AND METHODS

    公开(公告)号:US20220101887A1

    公开(公告)日:2022-03-31

    申请号:US17037134

    申请日:2020-09-29

    摘要: The systems and methods are configured to efficiently and effectively include processing capabilities in memory. In one embodiment, a processing in memory (PIM) chip a memory array, logic components, and an interconnection network. The memory array is configured to store information. In one exemplary implementation the memory array includes storage cells and array periphery components. The logic components can be configured to process information stored in the memory array. The interconnection network is configured to communicatively couple the logic components. The interconnection network can include interconnect wires, and a portion of the interconnect wires are located in a metal layer area that is located above the memory array.