METHOD AND APPARATUS FOR COMPRESSING NEURAL NETWORK MODEL

Invention Publication

US20230177326A1 METHOD AND APPARATUS FOR COMPRESSING NEURAL NETWORK MODEL 审中-公开

Please log in to see more content

Patent Title: METHOD AND APPARATUS FOR COMPRESSING NEURAL NETWORK MODEL
Application No.: US17968688

Application Date: 2022-10-18
Publication No.: US20230177326A1

Publication Date: 2023-06-08
Inventor: Guibin WANG , Shijun CONG , Hao DONG , Lei JIA
Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Applicant Address: CN Beijing
Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Current Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.
Current Assignee Address: CN Beijing
Priority: CN 2111457675.5 2021.12.02
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/04

METHOD AND APPARATUS FOR COMPRESSING NEURAL NETWORK MODEL

Abstract:

A technical solution for compressing a neural network model which relates to the field of artificial intelligence technologies, such as deep learning technologies, cloud service technologies, is disclosed. The method for compressing a neural network model includes: acquiring a to-be-compressed neural network model; determining a first bit width, a second bit width and a target thinning rate corresponding to the to-be-compressed neural network model; obtaining a target value according to the first bit width, the second bit width and the target thinning rate; and compressing the to-be-compressed neural network model using the target value, the first bit width and the second bit width to obtain a compression result of the to-be-compressed neural network model.

Public/Granted literature

US11861498B2 Method and apparatus for compressing neural network model Public/Granted day:2024-01-02

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法