CLUSTER COMPRESSION FOR COMPRESSING WEIGHTS IN NEURAL NETWORKS

    公开(公告)号:US20220343169A1

    公开(公告)日:2022-10-27

    申请号:US17811186

    申请日:2022-07-07

    申请人: Recogni Inc.

    IPC分类号: G06N3/08 G06N3/04

    摘要: A method for instantiating a convolutional neural network on a computing system. The convolutional neural network includes a plurality of layers, and instantiating the convolutional neural network includes training the convolutional neural network using a first loss function until a first classification accuracy is reached, clustering a set of F×K kernels of the first layer into a set of C clusters, training the convolutional neural network using a second loss function until a second classification accuracy is reached, creating a dictionary which maps each of a number of centroids to a corresponding centroid identifier, quantizing and compressing F filters of the first layer, storing F quantized and compressed filters of the first layer in a memory of the computing system, storing F biases of the first layer in the memory, and classifying data received by the convolutional neural network.

    Cluster compression for compressing weights in neural networks

    公开(公告)号:US11468316B2

    公开(公告)日:2022-10-11

    申请号:US16273592

    申请日:2019-02-12

    申请人: Recogni Inc.

    IPC分类号: G06N3/08 G06N3/04

    摘要: A method for instantiating a convolutional neural network on a computing system. The convolutional neural network includes a plurality of layers, and instantiating the convolutional neural network includes training the convolutional neural network using a first loss function until a first classification accuracy is reached, clustering a set of F×K kernels of the first layer into a set of C clusters, training the convolutional neural network using a second loss function until a second classification accuracy is reached, creating a dictionary which maps each of a number of centroids to a corresponding centroid identifier, quantizing and compressing F filters of the first layer, storing F quantized and compressed filters of the first layer in a memory of the computing system, storing F biases of the first layer in the memory, and classifying data received by the convolutional neural network.

    CLUSTER COMPRESSION FOR COMPRESSING WEIGHTS IN NEURAL NETWORKS

    公开(公告)号:US20190286980A1

    公开(公告)日:2019-09-19

    申请号:US16273592

    申请日:2019-02-12

    申请人: Recogni Inc.

    IPC分类号: G06N3/08 G06N3/04

    摘要: A method for instantiating a convolutional neural network on a computing system. The convolutional neural network includes a plurality of layers, and instantiating the convolutional neural network includes training the convolutional neural network using a first loss function until a first classification accuracy is reached, clustering a set of F×K kernels of the first layer into a set of C clusters, training the convolutional neural network using a second loss function until a second classification accuracy is reached, creating a dictionary which maps each of a number of centroids to a corresponding centroid identifier, quantizing and compressing F filters of the first layer, storing F quantized and compressed filters of the first layer in a memory of the computing system, storing F biases of the first layer in the memory, and classifying data received by the convolutional neural network.