-
公开(公告)号:US20230162010A1
公开(公告)日:2023-05-25
申请号:US17532572
申请日:2021-11-22
Applicant: Google LLC
Inventor: Azalia Mirhoseini , Safeen Huda , Martin Christoph Maas , Paras Jagdish Jain , Jeffrey Adgate Dean
CPC classification number: G06N3/063 , G06F15/8046 , G06F11/3409 , G06F11/3062 , G06F11/3024
Abstract: Systems and methods are provided for designing approximate, low-power deep learning accelerator chips that have little to no accuracy loss when executing a deep learning model. A set of approximate systolic arrays may be generated. The performance of each approximate systolic array in the set of approximate systolic arrays processing a deep neural network (DNN) may be determined. Each layer in the DNN may be mapped to an approximate systolic array in the set of approximate systolic arrays. A subset of the set of approximate systolic arrays may be selected for inclusion in the inference chip design based on the mapping and the performance of each approximate systolic array in the set of approximate systolic arrays.