-
公开(公告)号:US20230351542A1
公开(公告)日:2023-11-02
申请号:US18306033
申请日:2023-04-24
Applicant: Intel Corporation
Inventor: Naveen K. MELLEMPUDI , DHEEVATSA MUDIGERE , DIPANKAR DAS , SRINIVAS SRIDHARAN
IPC: G06T1/20 , G06F5/01 , G06F7/501 , G06F7/523 , G06F7/544 , G06F17/15 , G06F17/16 , G06N3/063 , G06N3/084 , G06N3/044 , G06N3/045
CPC classification number: G06T1/20 , G06F5/01 , G06F7/501 , G06F7/523 , G06F7/5443 , G06F17/153 , G06F17/16 , G06N3/063 , G06N3/084 , G06N3/044 , G06N3/045 , G06F2207/382 , G06F2207/4824
Abstract: One embodiment provides for a graphics processing unit to perform computations associated with a neural network, the graphics processing unit comprising a hardware processing unit having a dynamic precision fixed-point unit that is configurable to convert elements of a floating-point tensor to convert the floating-point tensor into a fixed-point tensor.
-
公开(公告)号:US20200265545A1
公开(公告)日:2020-08-20
申请号:US16853405
申请日:2020-04-20
Applicant: Intel Corporation
Inventor: Naveen MELLEMPUDI , DHEEVATSA MUDIGERE , DIPANKAR DAS , SRINIVAS SRIDHARAN
IPC: G06T1/20 , G06N3/08 , G06N3/04 , G06F7/544 , G06F17/15 , G06F7/501 , G06F5/01 , G06F7/523 , G06F17/16 , G06N3/063
Abstract: One embodiment provides for a graphics processing unit to perform computations associated with a neural network, the graphics processing unit comprising compute unit including a hardware logic unit having dynamic precision fixed-point logic, the compute unit to receive a set of dynamic fixed-point tensors, compute, via the dynamic precision fixed-point logic, a right-shift value using an absolute maximum value within the set of dynamic fixed-point tensors and a dynamic range of the set of dynamic fixed-point tensors, right-shift data values within the set of dynamic fixed-point tensors based on the right-shift value, increment a shared exponent associated with the set of dynamic fixed-point tensors based on the right-shift value, perform a compute operation on the set of dynamic fixed-point tensors, and generate an output tensor via the compute operation on the set of dynamic fixed-point tensors.
-
公开(公告)号:US20240160931A1
公开(公告)日:2024-05-16
申请号:US18532795
申请日:2023-12-07
Applicant: Intel Corporation
Inventor: Abhisek KUNDU , NAVEEN MELLEMPUDI , DHEEVATSA MUDIGERE , Dipankar DAS
CPC classification number: G06N3/08 , G06F9/46 , G06N3/044 , G06N3/045 , G06N3/063 , G06N3/084 , G06N5/04 , G06T15/005 , G06T17/20
Abstract: One embodiment provides for a computer-readable medium storing instructions that cause one or more processors to perform operations comprising determining a per-layer scale factor to apply to tensor data associated with layers of a neural network model and converting the tensor data to converted tensor data. The tensor data may be converted from a floating point datatype to a second datatype that is an 8-bit datatype. The instructions further cause the one or more processors to generate an output tensor based on the converted tensor data and the per-layer scale factor.
-
公开(公告)号:US20230087364A1
公开(公告)日:2023-03-23
申请号:US18060414
申请日:2022-11-30
Applicant: Intel Corporation
Inventor: Abhisek KUNDU , NAVEEN MELLEMPUDI , DHEEVATSA MUDIGERE , Dipankar DAS
Abstract: One embodiment provides for a computer-readable medium storing instructions that cause one or more processors to perform operations comprising determining a per-layer scale factor to apply to tensor data associated with layers of a neural network model and converting the tensor data to converted tensor data. The tensor data may be converted from a floating point datatype to a second datatype that is an 8-bit datatype. The instructions further cause the one or more processors to generate an output tensor based on the converted tensor data and the per-layer scale factor.
-
公开(公告)号:US20240412318A1
公开(公告)日:2024-12-12
申请号:US18751799
申请日:2024-06-24
Applicant: Intel Corporation
Inventor: Naveen K. MELLEMPUDI , DHEEVATSA MUDIGERE , DIPANKAR DAS , SRINIVAS SRIDHARAN
IPC: G06T1/20 , G06F5/01 , G06F7/501 , G06F7/523 , G06F7/544 , G06F17/15 , G06F17/16 , G06N3/044 , G06N3/045 , G06N3/063 , G06N3/084
Abstract: One embodiment provides for a graphics processing unit to perform computations associated with a neural network, the graphics processing unit comprising a hardware processing unit having a dynamic precision fixed-point unit that is configurable to convert elements of a floating-point tensor to convert the floating-point tensor into a fixed-point tensor.
-
公开(公告)号:US20180322386A1
公开(公告)日:2018-11-08
申请号:US15869502
申请日:2018-01-12
Applicant: Intel Corporation
Inventor: SRINIVAS SRIDHARAN , DHEEVATSA MUDIGERE
CPC classification number: G06N3/08 , G06F9/54 , G06N3/04 , G06N3/063 , G06N3/084 , G06T15/005 , G06T15/04 , G06T15/80 , G06T17/10 , G06T17/20
Abstract: One embodiment provides for a system to configure distributed training of a neural network. The system includes memory to store a library to facilitate transmission of data during distributed training of the neural network; a network interface to transmit and receive gradient data associated with the trainable parameters; a general-purpose processor to execute instructions provided by the library, the instructions to cause the general-purpose processor to configure the network interface to transmit and receive the gradient data associated with the trainable parameters during a workflow of a machine learning framework; and a graphics processor to perform compute operations associated with machine learning framework workflow to generate the gradient data associated with the trainable parameters, wherein, based on the machine learning framework workflow, the library is to interleave the compute operations on the graphics processor with transmission and receipt of gradient data via the network interface.
-
公开(公告)号:US20220327656A1
公开(公告)日:2022-10-13
申请号:US17730364
申请日:2022-04-27
Applicant: Intel Corporation
Inventor: Naveen K. MELLEMPUDI , DHEEVATSA MUDIGERE , DIPANKAR DAS , SRINIVAS SRIDHARAN
IPC: G06T1/20 , G06F5/01 , G06F7/501 , G06F7/523 , G06F7/544 , G06F17/15 , G06F17/16 , G06N3/04 , G06N3/063 , G06N3/08
Abstract: One embodiment provides for a graphics processing unit to perform computations associated with a neural network, the graphics processing unit comprising a hardware processing unit having a dynamic precision fixed-point unit that is configurable to quantize elements of a floating-point tensor to convert the floating-point tensor into a dynamic fixed-point tensor.
-
公开(公告)号:US20210110508A1
公开(公告)日:2021-04-15
申请号:US17083588
申请日:2020-10-29
Applicant: Intel Corporation
Inventor: Naveen MELLEMPUDI , DHEEVATSA MUDIGERE , DIPANKAR DAS , SRINIVAS SRIDHARAN
IPC: G06T1/20 , G06N3/063 , G06F17/16 , G06F7/523 , G06F5/01 , G06F7/501 , G06F17/15 , G06N3/04 , G06F7/544 , G06N3/08
Abstract: One embodiment provides for a graphics processing unit to perform computations associated with a neural network, the graphics processing unit comprising compute unit including a hardware logic unit having dynamic precision fixed-point logic, the compute unit to receive a set of dynamic fixed-point tensors, compute, via the dynamic precision fixed-point logic, a right-shift value using an absolute maximum value within the set of dynamic fixed-point tensors and a dynamic range of the set of dynamic fixed-point tensors, right-shift data values within the set of dynamic fixed-point tensors based on the right-shift value, increment a shared exponent associated with the set of dynamic fixed-point tensors based on the right-shift value, perform a compute operation on the set of dynamic fixed-point tensors, and generate an output tensor via the compute operation on the set of dynamic fixed-point tensors.
-
公开(公告)号:US20180322607A1
公开(公告)日:2018-11-08
申请号:US15881991
申请日:2018-01-29
Applicant: Intel Corporation
Inventor: Naveen MELLEMPUDI , DHEEVATSA MUDIGERE , DIPANKAR DAS , SRINIVAS SRIDHARAN
CPC classification number: G06T1/20 , G06F5/01 , G06F7/501 , G06F7/523 , G06F17/153 , G06F17/16 , G06N3/063
Abstract: One embodiment provides for a graphics processing unit to perform computations associated with a neural network, the graphics processing unit comprising compute unit including a hardware logic unit having dynamic precision fixed-point logic; a decode unit to decode an instruction for execution by the compute unit, the instruction to cause the compute unit to perform a matrix arithmetic operation on a set of dynamic fixed-point tensors; and a dynamic precision manager to dynamically adjust the precision of a compute operation performed by the compute unit during the matrix arithmetic operation, the dynamic precision manager to adjust the precision of the compute operation to prevent an arithmetic overflow.
-
公开(公告)号:US20180314940A1
公开(公告)日:2018-11-01
申请号:US15869515
申请日:2018-01-12
Applicant: Intel Corporation
Inventor: Abhisek KUNDU , NAVEEN MELLEMPUDI , DHEEVATSA MUDIGERE , Dipankar DAS
Abstract: One embodiment provides for a computing device comprising a parallel processor compute unit to perform a set of parallel integer compute operations; a ternarization unit including a weight ternarization circuit and an activation quantization circuit; wherein the weight ternarization circuit is to convert a weight tensor from a floating-point representation to a ternary representation including a ternary weight and a scale factor; wherein the activation quantization circuit is to convert an activation tensor from a floating-point representation to an integer representation; and wherein the parallel processor compute unit includes one or more circuits to perform the set of parallel integer compute operations on the ternary representation of the weight tensor and the integer representation of the activation tensor.
-
-
-
-
-
-
-
-
-