METHOD AND APPARATUS FOR COMPRESSING WEIGHTS OF NEURAL NETWORK
Abstract:
A method of compressing weights of a neural network includes compressing a weight set including the weights of a the neural network, determining modified weight sets by changing at least one of the weights, calculating compression efficiency values for the determined modified weight sets based on a result of compressing the weight set and results of compressing the determined modified weight sets, determining a target weight of the weights satisfying a compression efficiency condition among the weights based on the calculated compression efficiency values, and determining a final compression result by compressing the weights based on a result of replacing the determined target weight.
Public/Granted literature
Information query
Patent Agency Ranking
0/0