Neural network method and apparatus with parameter quantization

    公开(公告)号:US11836603B2

    公开(公告)日:2023-12-05

    申请号:US16282748

    申请日:2019-02-22

    CPC classification number: G06N3/047 G06N3/04 G06N3/08

    Abstract: A neural network method of parameter quantization obtains channel profile information for first parameter values of a floating-point type in each channel included in each of feature maps based on an input in a first dataset to a floating-point parameters pre-trained neural network, and determines a probability density function (PDF) type, for each channel, appropriate for the channel profile information based on a classification network receiving the channel profile information as a dataset. The neural network method of parameter quantization determines a fixed-point representation, based on the determined PDF type, for each channel, statistically covering a distribution range of the first parameter values, and generates a fixed-point quantized neural network based on the fixed-point representation determined for each channel.

Patent Agency Ranking