APPARATUS AND A METHOD FOR NEURAL NETWORK COMPRESSION
Abstract:
There is provided an apparatus comprising means for training a neural network, wherein the training comprises applying a loss function configured to increase sparsity of a weight tensor of the neural network and to cause a plurality of non-zero elements of the weight tensor to be substantially equal to each other; and means for entropy coding the weight tensor to obtain a compressed neural network.
Information query
Patent Agency Ranking
0/0