Neural network accelerator and operating method thereof

    公开(公告)号:US11960986B2

    公开(公告)日:2024-04-16

    申请号:US17944454

    申请日:2022-09-14

    CPC classification number: G06N3/063 G06F9/30145 G06F9/5027

    Abstract: A neural network accelerator includes an operator that calculates a first operation result based on a first tiled input feature map and first tiled filter data, a quantizer that generates a quantization result by quantizing the first operation result based on a second bit width extended compared with a first bit width of the first tiled input feature map, a compressor that generates a partial sum by compressing the quantization result, and a decompressor that generates a second operation result by decompressing the partial sum, the operator calculates a third operation result based on a second tiled input feature map, second tiled filter data, and the second operation result, and an output feature map is generated based on the third operation result.

    Neural network accelerator and operating method thereof

    公开(公告)号:US11475285B2

    公开(公告)日:2022-10-18

    申请号:US16751503

    申请日:2020-01-24

    Abstract: A neural network accelerator includes an operator that calculates a first operation result based on a first tiled input feature map and first tiled filter data, a quantizer that generates a quantization result by quantizing the first operation result based on a second bit width extended compared with a first bit width of the first tiled input feature map, a compressor that generates a partial sum by compressing the quantization result, and a decompressor that generates a second operation result by decompressing the partial sum, the operator calculates a third operation result based on a second tiled input feature map, second tiled filter data, and the second operation result, and an output feature map is generated based on the third operation result.

Patent Agency Ranking