Invention Grant
- Patent Title: Error tolerant neural network model compression
-
Application No.: US14581969Application Date: 2014-12-23
-
Publication No.: US10229356B1Publication Date: 2019-03-12
- Inventor: Baiyang Liu , Michael Reese Bastian , Bjorn Hoffmeister , Sankaran Panchapagesan , Ariya Rastrow
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Knobbe, Martens, Olson & Bear, LLP
- Main IPC: G06N3/08
- IPC: G06N3/08

Abstract:
Features are disclosed for error tolerant model compression. Such features could be used to reduce the size of a deep neural network model including several hidden node layers. The size reduction in an error tolerant fashion ensures predictive applications relying on the model do not experience performance degradation due to model compression. Such predictive applications include automatic recognition of speech, image recognition, and recommendation engines. Partially quantized models are re-trained such that any degradation of accuracy is “trained out” of the model providing improved error tolerance with compression.
Information query