-
公开(公告)号:US11861498B2
公开(公告)日:2024-01-02
申请号:US17968688
申请日:2022-10-18
Inventor: Guibin Wang , Shijun Cong , Hao Dong , Lei Jia
Abstract: A method for compressing a neural network model includes acquiring a to-be-compressed neural network model. A first bit width, a second bit width and a target thinning rate corresponding to the to-be-compressed neural network model are determined. A target value is obtained according to the first bit width, the second bit width and the target thinning rate. Then the to-be-compressed neural network model is compressed using the target value, the first bit width and the second bit width to obtain a compression result of the to-be-compressed neural network model.