METHOD AND APPARATUS WITH NEURAL NETWORK DATA PROCESSING
Abstract:
A processor-implemented neural network data processing method includes: determining a total number of either one of a first feature value and values less than or equal to the first feature value, in feature data output from a layer of a neural network; determining a quantization parameter based on the determined number; quantizing the feature data based on the determined quantization parameter; and inputting the quantized feature data to a another layer of the neural network connected to the layer.
Information query
Patent Agency Ranking
0/0