Invention Grant
- Patent Title: Method and apparatus for compressing neural network model
-
Application No.: US17968688Application Date: 2022-10-18
-
Publication No.: US11861498B2Publication Date: 2024-01-02
- Inventor: Guibin Wang , Shijun Cong , Hao Dong , Lei Jia
- Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Applicant Address: CN Beijing
- Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Current Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
- Current Assignee Address: CN Beijing
- Agency: Brooks Kushman P.C.
- Priority: CN 2111457675.5 2021.12.02
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04 ; G06N3/045

Abstract:
A method for compressing a neural network model includes acquiring a to-be-compressed neural network model. A first bit width, a second bit width and a target thinning rate corresponding to the to-be-compressed neural network model are determined. A target value is obtained according to the first bit width, the second bit width and the target thinning rate. Then the to-be-compressed neural network model is compressed using the target value, the first bit width and the second bit width to obtain a compression result of the to-be-compressed neural network model.
Public/Granted literature
- US20230177326A1 METHOD AND APPARATUS FOR COMPRESSING NEURAL NETWORK MODEL Public/Granted day:2023-06-08
Information query