Invention Grant
- Patent Title: Compressed convolutional neural network models
-
Application No.: US16788261Application Date: 2020-02-11
-
Publication No.: US11651192B2Publication Date: 2023-05-16
- Inventor: James C. Gabriel , Mohammad Rastegari , Hessam Bagherinezhad , Saman Naderiparizi , Anish Prabhu , Sophie Lebrecht , Jonathan Gelsey , Sayyed Karen Khatamifard , Andrew L. Chronister , David Bakin , Andrew Z. Luo
- Applicant: Apple Inc.
- Applicant Address: US CA Cupertino
- Assignee: Apple Inc.
- Current Assignee: Apple Inc.
- Current Assignee Address: US CA Cupertino
- Agency: BakerHostetler
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06N3/08

Abstract:
Systems and processes for training and compressing a convolutional neural network model include the use of quantization and layer fusion. Quantized training data is passed through a convolutional layer of a neural network model to generate convolutional results during a first iteration of training the neural network model. The convolutional results are passed through a batch normalization layer of the neural network model to update normalization parameters of the batch normalization layer. The convolutional layer is fused with the batch normalization layer to generate a first fused layer and the fused parameters of the fused layer are quantized. The quantized training data is passed through the fused layer using the quantized fused parameters to generate output data, which may be quantized for a subsequent layer in the training iteration.
Public/Granted literature
- US20200257960A1 COMPRESSED CONVOLUTIONAL NEURAL NETWORK MODELS Public/Granted day:2020-08-13
Information query