-
公开(公告)号:US20220237513A1
公开(公告)日:2022-07-28
申请号:US17587291
申请日:2022-01-28
Applicant: Samsung Electronics Co., Ltd.
Inventor: Wenlong HE , Ihor VASYLTSOV , Gang SUN , Duanhui LIU
Abstract: A method with quantization for a deep learning model includes: determining a second model by quantizing a first model based on a quantization parameter; determining a real value of multi optimization target parameter by testing the second model; calculating a loss function based on the real value of the multi optimization target parameter, an expected value of the multi optimization target parameter, and a constraint value of the multi optimization target parameter; updating the quantization parameter based on the loss function and using the second model as the first model; iteratively executing the foregoing operations until a preset condition is satisfied; and in response to the preset condition being satisfied, determining an optimal quantization parameter and using, as a final quantization model, the first model that executes quantization based on the optimal quantization parameter.