METHOD AND APPARATUS WITH OPTIMIZATION FOR DEEP LEARNING MODEL

    公开(公告)号:US20220237513A1

    公开(公告)日:2022-07-28

    申请号:US17587291

    申请日:2022-01-28

    Abstract: A method with quantization for a deep learning model includes: determining a second model by quantizing a first model based on a quantization parameter; determining a real value of multi optimization target parameter by testing the second model; calculating a loss function based on the real value of the multi optimization target parameter, an expected value of the multi optimization target parameter, and a constraint value of the multi optimization target parameter; updating the quantization parameter based on the loss function and using the second model as the first model; iteratively executing the foregoing operations until a preset condition is satisfied; and in response to the preset condition being satisfied, determining an optimal quantization parameter and using, as a final quantization model, the first model that executes quantization based on the optimal quantization parameter.

Patent Agency Ranking