- 专利标题: ASYMMETRIC QUANTIZATION FOR COMPRESSION AND FOR ACCELERATION OF INFERENCE FOR NEURAL NETWORKS
-
申请号: US16877582申请日: 2020-05-19
-
公开(公告)号: US20210004679A1公开(公告)日: 2021-01-07
- 发明人: Yingzhen YANG , Zhibiao ZHAO , Baoxin ZHAO , Jun HUAN , Jian OUYANG , Yong WANG , Jiaxin SHI
- 申请人: Baidu USA, LLC
- 申请人地址: US CA Sunnyvale
- 专利权人: Baidu USA, LLC
- 当前专利权人: Baidu USA, LLC
- 当前专利权人地址: US CA Sunnyvale
- 主分类号: G06N3/08
- IPC分类号: G06N3/08 ; G06N3/04
摘要:
Presented herein are embodiments of an improved asymmetric quantization, which may generally be referred to as improved asymmetric quantization (IAQ) embodiments. IAQ embodiments combine the benefits of conventional asymmetric quantization and symmetric quantization but also provide additional computation efficiencies. Embodiments of IAQ adopt an asymmetric range of the weights of a neural network layer, so they circumvent the limitation of symmetric range of symmetric quantization. Moreover, the inference process of a neural network quantized by an IAQ embodiment is much faster than that of the neural network quantized by conventional asymmetric quantization by quantizing an offset value of each layer.
公开/授权文献
信息查询