Invention Publication
- Patent Title: APPARATUSES, METHODS, AND SYSTEMS FOR 8-BIT FLOATING-POINT MATRIX DOT PRODUCT INSTRUCTIONS
-
Application No.: EP24177816.6Application Date: 2021-09-14
-
Publication No.: EP4398097A2Publication Date: 2024-07-10
- Inventor: Mellempudi, Naveen , Heinecke, Alexander F. , Valentine, Robert , Charney, Mark J. , Hughes, Christopher J. , Georganas, Evangelos , Sperber, Zeev , Gradstein, Amit , Rubanovich, Simon
- Applicant: INTEL Corporation
- Applicant Address: US Santa Clara, CA 95054 2200 Mission College Blvd.
- Assignee: INTEL Corporation
- Current Assignee: INTEL Corporation
- Current Assignee Address: US Santa Clara, CA 95054 2200 Mission College Blvd.
- Agency: Samson & Partner Patentanwälte mbB
- Priority: US 2017134373 2020.12.26
- The original application number of the division: 21196474.7 2021.09.14
- Main IPC: G06F9/30
- IPC: G06F9/30
Abstract:
Systems, methods, and apparatuses relating to 8-bit floating-point matrix dot product instructions are described. For example, a processor comprises: a plurality of vector registers to store a plurality of packed data elements including 8-bit floating point data elements and 32-bit floating point data elements; decode circuitry to decode a single matrix multiplication instruction having fields to indicate an opcode and locations of an M by K first source matrix including a first plurality of the 8-bit floating point data elements, a K by N second source matrix including a second plurality of the 8-bit floating point data elements, and an M by N third source matrix having a plurality of 32-bit floating point data elements, each of the first and second plurality of 8-bit floating point data elements comprising a sign bit, a 5-bit exponent value, and a 2-bit mantissa value; and execution circuitry comprising matrix acceleration circuitry to accelerate matrix operations, wherein responsive to the single matrix multiplication instruction, the execution circuitry is to generate each 32-bit floating point result data element of a result matrix based on a corresponding row of the first plurality of 8-bit floating point data elements and a corresponding column of the second plurality of 8-bit floating point data elements, the execution circuitry to generate a respective plurality of products corresponding to the corresponding row of the first plurality of 8-bit floating point data elements and the corresponding column of the second plurality of 8-bit floating point data elements and to accumulate the plurality of products with a corresponding 32-bit floating point data element of the third source matrix to generate the 32-bit floating point result data element of the result matrix.
Public/Granted literature
- EP4398097A3 APPARATUSES, METHODS, AND SYSTEMS FOR 8-BIT FLOATING-POINT MATRIX DOT PRODUCT INSTRUCTIONS Public/Granted day:2024-09-04
Information query