Method and apparatus with neural network data quantizing

    公开(公告)号:US12106219B2

    公开(公告)日:2024-10-01

    申请号:US15931362

    申请日:2020-05-13

    CPC classification number: G06N3/084 G06N3/04 G06N3/0495

    Abstract: A neural network data quantizing method includes: obtaining local quantization data by firstly quantizing, based on a local maximum value for each output channel of a current layer of a neural network, global recovery data obtained by recovering output data of an operation of the current layer based on a global maximum value corresponding to a previous layer of the neural network; storing the local quantization data in a memory to perform an operation of a next layer of the neural network; obtaining global quantization data by secondarily quantizing, based on a global maximum value corresponding to the current layer, local recovery data obtained by recovering the local quantization data based on the local maximum value for each output channel of the current layer; and providing the global quantization data as input data for the operation of the next layer.

Patent Agency Ranking