Invention Grant
- Patent Title: Neural network compression
-
Application No.: US15892890Application Date: 2018-02-09
-
Publication No.: US11928601B2Publication Date: 2024-03-12
- Inventor: Yair Alon , Elad Eban
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06N3/084
- IPC: G06N3/084 ; G06N3/044 ; G06N3/08 ; G06N7/01 ; G06N20/00 ; G06N3/063

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for neural network compression. In one aspect, a method comprises receiving a neural network and identifying a particular set of multiple weights of the neural network. Multiple anchor points are determined based on current values of the particular set of weights of the neural network. The neural network is trained by, at each of multiple training iterations, performing operations comprising adjusting the values of the particular set of weights by backpropagating gradients of a loss function. The loss function comprises a first loss function term based on a prediction accuracy of the neural network and a second loss function term based on a similarity of the current values of the particular set of weights to the anchor points. After training, the values of the particular set of weights are quantized based on the anchor points.
Public/Granted literature
- US20190251445A1 NEURAL NETWORK COMPRESSION Public/Granted day:2019-08-15
Information query