METHOD AND APPARATUS FOR COMPRESSING NEURAL NETWORK MODEL

    公开(公告)号:US20230177326A1

    公开(公告)日:2023-06-08

    申请号:US17968688

    申请日:2022-10-18

    CPC classification number: G06N3/08 G06N3/0454

    Abstract: A technical solution for compressing a neural network model which relates to the field of artificial intelligence technologies, such as deep learning technologies, cloud service technologies, is disclosed. The method for compressing a neural network model includes: acquiring a to-be-compressed neural network model; determining a first bit width, a second bit width and a target thinning rate corresponding to the to-be-compressed neural network model; obtaining a target value according to the first bit width, the second bit width and the target thinning rate; and compressing the to-be-compressed neural network model using the target value, the first bit width and the second bit width to obtain a compression result of the to-be-compressed neural network model.

Patent Agency Ranking