METHOD AND MACHINE LEARNING SYSTEM TO PERFORM QUANTIZATION OF NEURAL NETWORK

    公开(公告)号:US20230058500A1

    公开(公告)日:2023-02-23

    申请号:US17699255

    申请日:2022-03-21

    申请人: Blaize, Inc.

    IPC分类号: G06V10/776 G06V10/44

    摘要: The present disclosure relates to a system and method of performing quantization of a neural network having multiple layers. The method comprises receiving a floating-point dataset as input dataset and determining a first shift constant for first layer of the neural network based on the input dataset. The method also comprises performing quantization for the first layer using the determined shift constant of the first layer. The method further comprises determining a next shift constant for next layer of the neural network based on output of a layer previous to the next layer, and performing quantization for the next layer using the determined next shift constant. The method further comprises iterating the steps of determining shift constant and performing quantization for all layers of the neural network to generate fixed point dataset as output.

    Method and machine learning system to perform quantization of neural network

    公开(公告)号:US12100196B2

    公开(公告)日:2024-09-24

    申请号:US17699255

    申请日:2022-03-21

    申请人: Blaize, Inc.

    CPC分类号: G06V10/776 G06V10/454

    摘要: The present disclosure relates to a system and method of performing quantization of a neural network having multiple layers. The method comprises receiving a floating-point dataset as input dataset and determining a first shift constant for first layer of the neural network based on the input dataset. The method also comprises performing quantization for the first layer using the determined shift constant of the first layer. The method further comprises determining a next shift constant for next layer of the neural network based on output of a layer previous to the next layer, and performing quantization for the next layer using the determined next shift constant. The method further comprises iterating the steps of determining shift constant and performing quantization for all layers of the neural network to generate fixed point dataset as output.

    METHOD AND SYSTEM FOR GENERATING A MIXED PRECISION MODEL

    公开(公告)号:US20230281423A1

    公开(公告)日:2023-09-07

    申请号:US18072785

    申请日:2022-12-01

    申请人: Blaize, Inc.

    IPC分类号: G06N3/04 G06V10/82 G06V10/28

    CPC分类号: G06N3/04 G06V10/82 G06V10/28

    摘要: Disclosed herein is a method and a system for generating a mixed precision quantization model for performing image processing. The method comprises receiving a validation dataset of images to train a neural network model. The method comprises for each image of the validation dataset, generating a union sensitivity list, selecting a group of layers, generating a mixed precision quantization model by quantizing the selected group of layers into a high precision format; computing accuracy of the mixed precision quantization model for comparison with a target accuracy; in response to determining the accuracy is less than the target accuracy, generating another mixed precision model by selecting a next group of layers and computing the accuracy. In response to determining the accuracy is greater than or equal to the target accuracy, storing the mixed precision quantization model as a final mixed precision quantization model for image processing.