NEURAL NETWORK METHOD AND APPARTUS WITH PARAMETER QUANTIZATION

    公开(公告)号:US20200026986A1

    公开(公告)日:2020-01-23

    申请号:US16282748

    申请日:2019-02-22

    Abstract: A neural network method of parameter quantization includes obtaining channel profile information for first parameter values of a floating-point type in each channel included in each of feature maps based on an input in a first dataset to a floating-point parameters pre-trained neural network; determining a probability density function (PDF) type, for each channel, appropriate for the channel profile information based on a classification network receiving the channel profile information as a dataset; determining a fixed-point representation, based on the determined PDF type, for each channel, statistically covering a distribution range of the first parameter values; and generating a fixed-point quantized neural network based on the fixed-point representation determined for each channel.

Patent Agency Ranking