-
1.
公开(公告)号:US20240104346A1
公开(公告)日:2024-03-28
申请号:US17945978
申请日:2022-09-15
Applicant: Huawei Technologies Co., Ltd.
Inventor: Lu HOU , Chaofan TAO , Wei ZHANG , Lifeng SHANG , Xin JIANG , Qun LIU , Li QIAN
IPC: G06N3/04
CPC classification number: G06N3/0454
Abstract: A method is provided for quantizing a neural network model performed by a processing system. The method comprises determining a scaling factor based on a distribution of weights associated with the neural network model, determining quantized weights based on the scaling factor and the weights associated with the distribution, determining a training loss of the neural network model based on the quantized weights during training of the neural network model, and determining an updated scaling factor for the neural network model based on a gradient of the training loss.