PER KERNEL KMEANS COMPRESSION FOR NEURAL NETWORKS

    公开(公告)号:US20220027704A1

    公开(公告)日:2022-01-27

    申请号:US17366919

    申请日:2021-07-02

    Abstract: Methods and apparatus relating to techniques for incremental network quantization. In an example, an apparatus comprises logic, at least partially comprising hardware logic to determine a plurality of weights for a layer of a convolutional neural network (CNN) comprising a plurality of kernels; organize the plurality of weights into a plurality of clusters for the plurality of kernels; and apply a K-means compression algorithm to each of the plurality of clusters. Other embodiments are also disclosed and claimed.

Patent Agency Ranking