Non-uniform quantization of pre-trained deep neural network

    公开(公告)号:US11710043B2

    公开(公告)日:2023-07-25

    申请号:US17744601

    申请日:2022-05-13

    CPC classification number: G06N3/082 G06N3/04

    Abstract: A system and a method of quantizing a pre-trained neural network, includes determining by a layer/channel bit-width determiner for each layer or channel of the pre-trained neural network a minimum quantization noise for the layer or the channel for each master bit-width value in a predetermined set of master bit-width values; and selecting by a bit-width selector for the layer or the channel the master bit-width value having the minimum quantization noise for the layer or the channel. In one embodiment, the minimum quantization noise for the layer or the channel is based on a square of a range of weights for the layer or the channel that is multiplied by a constant to a negative power of a current master bit-width value.

    Non-uniform quantization of pre-trained deep neural network

    公开(公告)号:US11348009B2

    公开(公告)日:2022-05-31

    申请号:US16181326

    申请日:2018-11-05

    Abstract: A system and a method of quantizing a pre-trained neural network, includes determining by a layer/channel bit-width determiner for each layer or channel of the pre-trained neural network a minimum quantization noise for the layer or the channel for each master bit-width value in a predetermined set of master bit-width values; and selecting by a bit-width selector for the layer or the channel the master bit-width value having the minimum quantization noise for the layer or the channel. In one embodiment, the minimum quantization noise for the layer or the channel is based on a square of a range of weights for the layer or the channel that is multiplied by a constant to a negative power of a current master bit-width value.

Patent Agency Ranking