- 专利标题: Compression of deep neural networks
-
申请号: US16351712申请日: 2019-03-13
-
公开(公告)号: US11966837B2公开(公告)日: 2024-04-23
- 发明人: Dzung Phan , Lam Nguyen , Nam H. Nguyen , Jayant R. Kalagnanam
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理商 Stephanie L. Carusillo
- 主分类号: G06N3/08
- IPC分类号: G06N3/08 ; G06N3/047 ; H03M7/30
摘要:
In an approach for compressing a neural network, a processor receives a neural network, wherein the neural network has been trained on a set of training data. A processor receives a compression ratio. A processor compresses the neural network based on the compression ratio using an optimization model to solve for sparse weights. A processor re-trains the compressed neural network with the sparse weights. A processor outputs the re-trained neural network.
公开/授权文献
- US20200293876A1 COMPRESSION OF DEEP NEURAL NETWORKS 公开/授权日:2020-09-17
信息查询