SPARSITY CONTROL BASED ON HARDWARE FOR DEEP-NEURAL NETWORKS

    公开(公告)号:US20190340511A1

    公开(公告)日:2019-11-07

    申请号:US16447216

    申请日:2019-06-20

    Abstract: Systems, methods, computer program products, and apparatuses to transform a weight space of an inference model to increase the compute efficiency of a target inference platform. A density of a weight space can be determined, and a transformation parameter derived based on the determined density. The weight space can be re-ordered based on the transformation parameter to balance the compute load between the processing elements (PEs) of the target platform, and as such, reduce the idle time and/or stalls of the PEs.

Patent Agency Ranking