Per kernel Kmeans compression for neural networks

Invention Grant

US11055604B2 Per kernel Kmeans compression for neural networks 有权

Please log in to see more content

Patent Title: Per kernel Kmeans compression for neural networks
Application No.: US15702193

Application Date: 2017-09-12
Publication No.: US11055604B2

Publication Date: 2021-07-06
Inventor: Yonatan Glesner , Gal Novik , Dmitri Vainbrand , Gal Leibovich
Applicant: Intel Corporation
Applicant Address: US CA Santa Clara
Assignee: Intel Corporation
Current Assignee: Intel Corporation
Current Assignee Address: US CA Santa Clara
Agency: Jaffery Watson Mendosa & Hamilton LLP
Main IPC: G06N3/04
IPC: G06N3/04 ; G06F3/06

Per kernel Kmeans compression for neural networks

Abstract:

Methods and apparatus relating to techniques for incremental network quantization. In an example, an apparatus comprises logic, at least partially comprising hardware logic to determine a plurality of weights for a layer of a convolutional neural network (CNN) comprising a plurality of kernels; organize the plurality of weights into a plurality of clusters for the plurality of kernels; and apply a K-means compression algorithm to each of the plurality of clusters. Other embodiments are also disclosed and claimed.

Public/Granted literature

US20190080222A1 PER KERNEL KMEANS COMPRESSION FOR NEURAL NETWORKS Public/Granted day:2019-03-14

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑