APPARATUS WITH ACCELERATED MACHINE LEARNING PROCESSING

    公开(公告)号:US20220036243A1

    公开(公告)日:2022-02-03

    申请号:US17147858

    申请日:2021-01-13

    Abstract: An apparatus includes a global memory and a systolic array. The global memory is configured to store and provide an input feature map (IFM) vector stream from an IFM tensor and a kernel vector stream from a kernel tensor. The systolic array is configured to receive the IFM vector stream and the kernel vector stream from the global memory. The systolic array is on-chip together with the global memory. The systolic array includes a plurality of processing elements (PEs) each having a plurality of vector units, each of the plurality of vector units being configured to perform a dot-product operation on at least one IFM vector of the IFM vector stream and at least one kernel vector of the kernel vector stream per unit clock cycle to generate a plurality of output feature maps (OFMs).

    METHOD AND DEVICE OF ACCESSING MEMORY WITH NEAR MEMORY ACCELERATOR

    公开(公告)号:US20240311009A1

    公开(公告)日:2024-09-19

    申请号:US18439092

    申请日:2024-02-12

    CPC classification number: G06F3/061 G06F3/0629 G06F3/0673

    Abstract: Disclosed are a method of accessing a memory and an electronic device for performing the method. The electronic device includes a processor, and a memory electrically connected to the processor, wherein the processor may be configured to select a rank including bank groups of the memory, select a bank corresponding to a memory address to be accessed from among banks included in the selected rank, select a row and one or more columns from rows and columns of the selected bank corresponding to the memory address, and generate the memory address to access the memory based on an address mapping scheme according to the selected rank, the selected bank, the selected row, and the selected one or more columns.

Patent Agency Ranking