APPARATUS AND METHOD FOR A MASKED MULTIPLY INSTRUCTION TO SUPPORT NEURAL NETWORK PRUNING OPERATIONS

    公开(公告)号:US20190121837A1

    公开(公告)日:2019-04-25

    申请号:US16230814

    申请日:2018-12-21

    IPC分类号: G06F17/16 G06N3/02

    摘要: An apparatus and method for a masked multiply instruction to support neural network pruning operations. For example, one embodiment of a processor comprises: a decoder to decode a matrix multiplication with masking (GEMM) instruction identifying a destination matrix register to store a result, and source registers storing an A-matrix, a B-matrix, and a matrix mask; execution circuitry to execute the GEMM instruction, the execution circuitry to multiply a plurality of B-matrix elements with a plurality of A-matrix elements, each of the B-matrix elements associated with a mask value in the matrix mask, wherein if the mask value is set to a first value, then the execution circuitry is to multiply the B-matrix element with one or more of the A-matrix elements to generate a first partial result, and if the mask value is set to a second value, then the execution circuitry is to multiply an alternate B-matrix element with a one or more of the A-matrix elements to generate a second partial result.

    PHYSICAL PAGE TRACKING FOR HANDLING OVERCOMMITTED MEMORY

    公开(公告)号:US20190354487A1

    公开(公告)日:2019-11-21

    申请号:US15980523

    申请日:2018-05-15

    IPC分类号: G06F12/1009

    摘要: A system for computer memory management that implements a memory pool table, the memory pool table including entries that describe a plurality of memory pools, each memory pool representing a group of memory pages related by common attributes; a per-page tracking table, each entry in the per-page tracking table used to related a memory page with a memory pool of the memory pool table; and processing circuitry to: scan each entry in the per-page tracking table and, for each entry: determine an amount of memory released if the memory page related with the entry is swapped; aggregate the amount of memory for the respective memory pool related with the memory page related with the entry in the per-page tracking table, to produce a per-pool memory aggregate; and output the per-pool memory aggregate for the memory pools related with the memory pages in the per-page tracking table.