COMPUTING DEVICE, COMPUTER SYSTEM, AND COMPUTING METHOD

    公开(公告)号:US20220147821A1

    公开(公告)日:2022-05-12

    申请号:US17344192

    申请日:2021-06-10

    Abstract: According to one embodiment, a processor is configured to calculate a calculation amount in inference time of a neural network, using a result of summing, with respect to a group to which quantization is applied, products of the number of product-sum operations and bit widths of weight for the product-sum operations in the neural network. Then, the processor is configured to optimize a value of the weight and a quantization step size to minimize the recognition error by the neural network based on the calculated calculation amount, and execute computing about the neural network based on the optimized weight and the quantization step size.

Patent Agency Ranking