Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Hao Dong"

1.

发明授权
Method and apparatus for compressing neural network model 有权

公开(公告)号：US11861498B2

公开(公告)日：2024-01-02

申请号：US17968688

申请日：2022-10-18

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Guibin Wang , Shijun Cong , Hao Dong , Lei Jia

IPC: G06N3/08 , G06N3/04 , G06N3/045

CPC classification number: G06N3/08 , G06N3/045

Abstract: A method for compressing a neural network model includes acquiring a to-be-compressed neural network model. A first bit width, a second bit width and a target thinning rate corresponding to the to-be-compressed neural network model are determined. A target value is obtained according to the first bit width, the second bit width and the target thinning rate. Then the to-be-compressed neural network model is compressed using the target value, the first bit width and the second bit width to obtain a compression result of the to-be-compressed neural network model.

Patent Agency Ranking