DYNAMIC SPARSITY-BASED ACCELERATION OF NEURAL NETWORKS

    公开(公告)号:US20240119269A1

    公开(公告)日:2024-04-11

    申请号:US18543356

    申请日:2023-12-18

    IPC分类号: G06N3/048

    CPC分类号: G06N3/048

    摘要: A deep neural network (DNN) accelerator may facilitate dynamic sparsity-based acceleration and operate in various sparsity modes including a combined sparsity mode, a weight sparsity mode, an activation sparsity mode, and a dense mode. The DNN accelerator may receive a configuration parameter indicating whether to accelerate the layer based on sparsity in a weight tensor of the layer. The configuration parameter may be generated offline, e.g., before the execution of the DNN is started. The DNN accelerator computes one or more activations of the layer in a previous layer in the DNN. The one or more activations are one or more elements of an activation tensor of the layer. The DNN accelerator may determine a sparsity mode for the layer based on the configuration parameter and sparsity in the activation tensor. One or more sparse cells in the DNN accelerator may execute the layer in the sparsity mode.