Quantization parameter optimization method and quantization parameter optimization device

    公开(公告)号:US11748600B2

    公开(公告)日:2023-09-05

    申请号:US17014699

    申请日:2020-09-08

    申请人: SOCIONEXT INC.

    发明人: Yukihiro Sasagawa

    摘要: A quantization parameter optimization method includes: determining a cost function in which a regularization term is added to an error function, the regularization term being a function of a quantization error that is an error between a weight parameter of a neural network and a quantization parameter that is a quantized weight parameter; updating the quantization parameter by use of the cost function; and determining, as an optimized quantization parameter of a quantization neural network, the quantization parameter with which a function value derived from the cost function satisfies a predetermined condition, the optimized quantization parameter being obtained as a result of repeating the updating, the quantization neural network being the neural network, the weight parameter of which has been quantized, wherein the function value derived from the regularization term and an inference accuracy of the quantization neural network are negatively correlated.