Invention Grant
- Patent Title: Optimized compute hardware for machine learning operations
-
Application No.: US16983107Application Date: 2020-08-03
-
Publication No.: US11334796B2Publication Date: 2022-05-17
- Inventor: Dipankar Das , Roger Gramunt , Mikhail Smelyanskiy , Jesus Corbal , Dheevatsa Mudigere , Naveen K. Mellempudi , Alexander F. Heinecke
- Applicant: Intel Corporation
- Applicant Address: US CA Santa Clara
- Assignee: Intel Corporation
- Current Assignee: Intel Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Jaffery Watson Mendonsa & Hamilton LLP
- Main IPC: G06F17/16
- IPC: G06F17/16 ; G06F9/30 ; G06F9/38 ; G06F7/544 ; G06N3/08 ; G06N3/063 ; G06N3/04

Abstract:
A processing cluster of a processing cluster array comprises a plurality of registers to store input values of vector input operands, the input values of at least some of the vector input operands having different bit lengths than those of other input values of other vector input operands, and a compute unit to execute a dot-product instruction with the vector input operands to perform a number of parallel multiply operations and an accumulate operation per 32-bit lane based on a bit length of the smallest-sized input value of a first vector input operand relative to the 32-bit lane.
Public/Granted literature
- US20210019631A1 OPTIMIZED COMPUTE HARDWARE FOR MACHINE LEARNING OPERATIONS Public/Granted day:2021-01-21
Information query