Patent search ap:("NVIDIA CORPORATION") AND inv:"Ahmad Kiswani" Page 1

1.

发明授权
Tensor processing using low precision format 有权

公开(公告)号：US12299577B2

公开(公告)日：2025-05-13

申请号：US15624577

申请日：2017-06-15

Applicant: NVIDIA CORPORATION

Inventor： Boris Ginsburg , Sergei Nikolaev , Ahmad Kiswani , Hao Wu , Amir Gholaminejad , Slawomir Kierat , Michael Houston , Alex Fit-Florea

IPC: G06N3/084 , G06F17/16 , G06N3/04 , G06N3/045 , G06N3/088

Abstract: Aspects of the present invention are directed to computer-implemented techniques for improving the training of artificial neural networks using a reduced precision (e.g., float16) data format. Embodiments of the present invention rescale tensor values prior to performing matrix operations (such as matrix multiplication or matrix addition) to prevent overflow and underflow. To preserve accuracy throughout the performance of the matrix operations, the scale factors are defined using a novel data format to represent tensors, wherein a matrix is represented by the tuple X, where X=(a, v[.]), wherein a is a float scale factor and v[.] are scaled values stored in the float16 format. The value of any element X[i] according to this data format would be equal to a*v[i].

Patent Agency Ranking