- Patent Title: Automated methods for conversions to a lower precision data format
-
Application No.: US15838273Application Date: 2017-12-11
-
Publication No.: US10997492B2Publication Date: 2021-05-04
- Inventor: Szymon Migacz , Hao Wu , Dilip Sequeira , Ujval Kapasi , Maxim Milakov , Slawomir Kierat , Zacky Zhou , Yilin Zhang , Alex Fit-Florea
- Applicant: NVIDIA Corporation
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA Corporation
- Current Assignee: NVIDIA Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Hogan Lovells US LLP
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06N3/08 ; G06N3/063 ; G06N3/02 ; G06N3/10 ; G06N7/00 ; G06T9/00

Abstract:
Aspects of the present invention are directed to computer-implemented techniques for performing data compression and conversion between data formats of varying degrees of precision, and more particularly for improving the inferencing (application) of artificial neural networks using a reduced precision (e.g., INT8) data format. Embodiments of the present invention generate candidate conversions of data output, then employ a relative measure of quality to identify the candidate conversion with the greatest accuracy (i.e., least divergence from the original higher precision values). The representation can be then be used during inference to perform computations on the resulting output data.
Public/Granted literature
- US20180211152A1 AUTOMATED METHODS FOR CONVERSIONS TO A LOWER PRECISION DATA FORMAT Public/Granted day:2018-07-26
Information query