Neural Network Reordering, Weight Compression, and Processing
Abstract:
A neural network is trained to generate feature maps and associated weights. Reordering is performed to generate a functionally equivalent network. The reordering may be performed to improve at least one of compression of the weights, load balancing, and execution. In one implementation, zero value weights are grouped, permitting them to be skipped during execution.
Information query
Patent Agency Ranking
0/0