Deep convolutional neural network acceleration and compression method based on parameter quantification
Abstract:
An acceleration and compression method for a deep convolutional neural network based on quantization of a parameter provided by the present application comprises: quantizing the parameter of the deep convolutional neural network to obtain a plurality of subcode books and respective corresponding index values of the plurality of subcode books; acquiring an output feature map of the deep convolutional neural network according to the plurality of subcode books and respective corresponding index values of the plurality of subcode books. The present application may implement the acceleration and compression for a deep convolutional neural network.
Information query
Patent Agency Ranking
0/0