-
公开(公告)号:US20220292337A1
公开(公告)日:2022-09-15
申请号:US17832303
申请日:2022-06-03
Inventor: Chao TIAN , Lei JIA , Xiaoping YAN , Junhui WEN , Guanglai DENG , Qiang LI
Abstract: A neural network processing method, a neural network processing unit (NPU) and a processing device are provided. The method includes: obtaining by a quantizing unit in the NPU float type input data, quantizing the float type input data to obtain quantized input data, and providing the quantized input data to an operation unit; performing by the operation unit of the NPU a matrix-vector operation and/or a convolution operation to the quantized input data to obtain an operation result of the quantized input data; and performing by the quantizing unit inverse quantization to the operation result output by the operation unit to obtain an inverse quantization result.