- 专利标题: Quantizing autoencoders in a neural network
-
申请号: US16282210申请日: 2019-02-21
-
公开(公告)号: US11977388B2公开(公告)日: 2024-05-07
- 发明人: Jon Hasselgren , Jacob Munkberg
- 申请人: NVIDIA Corporation
- 申请人地址: US CA Santa Clara
- 专利权人: NVIDIA Corporation
- 当前专利权人: NVIDIA Corporation
- 当前专利权人地址: US CA Santa Clara
- 代理机构: Davis Wright Tremaine LLP
- 主分类号: G06N3/088
- IPC分类号: G06N3/088 ; G05B13/02 ; G05D1/00 ; G06N3/02 ; G06N3/04 ; G06N3/043 ; G06N3/045
摘要:
The performance of a neural network is improved by applying quantization to data at various points in the network. In an embodiment, a neural network includes two paths. A quantization is applied to each path, such that when an output from each path is combined, further quantization is not required. In an embodiment, the neural network is an autoencoder that includes at least one skip connection. In an embodiment, the system determines a set of quantization parameters based on the characteristics of the data in the primary path and in the skip connection, such that both network paths produce output data in the same fixed point format. As a result, the data from both network paths can be combined without requiring an additional quantization.
公开/授权文献
- US20200272162A1 QUANTIZING AUTOENCODERS IN A NEURAL NETWORK 公开/授权日:2020-08-27
信息查询