Invention Publication
- Patent Title: APPARATUSES, METHODS, AND SYSTEMS FOR 8-BIT FLOATING-POINT MATRIX DOT PRODUCT INSTRUCTIONS
-
Application No.: EP23195872.9Application Date: 2021-09-14
-
Publication No.: EP4276608A3Publication Date: 2024-01-10
- Inventor: Mellempudi, Naveen , Heinecke, Alexander F. , Valentine, Robert , Charney, Mark J. , Hughes, Christopher J. , Georganas, Evangelos , Sperber, Zeev , Gradstein, Amit , Rubanovich, Simon
- Applicant: Intel Corporation
- Applicant Address: US Santa Clara, CA 95054 2200 Mission College Boulevard
- Agency: Samson & Partner Patentanwälte mbB
- Priority: US202017134373 20201226
- Main IPC: G06F9/30
- IPC: G06F9/30
Abstract:
Systems, methods, and apparatuses relating to 8-bit floating-point matrix dot product instructions are described. For example, a processing unit comprises circuitry to perform operations corresponding to an instruction, the instruction to specify a first matrix having M rows by 4*K columns of 8-bit floating-point data elements, a second matrix having 4*K rows by N columns of 8-bit floating-point data elements, and a third matrix having M rows by N columns of 32-bit single precision floating-point data elements. The operations includes to, for each row m of the M rows of the first matrix, and for each column n of the N columns of the second matrix: convert 4*K 8-bit floating-point data elements of the row m of the first matrix to 4*K corresponding higher precision floating-point data elements having a higher precision than an 8-bit floating-point data element, and convert 4*K 8-bit floating-point data elements of the column n of the second matrix to 4*K corresponding higher precision floating-point data elements having a higher precision than the 8-bit floating-point data element; multiply the 4*K higher precision floating-point data elements corresponding to the row m of the first matrix with corresponding ones of the 4*K higher precision floating-point data elements corresponding to the column n of the second matrix to generate 4*K products; accumulate the 4*K products with a 32-bit single precision floating-point data element corresponding to a row m of the M rows, and a column n of the N columns, of the third matrix, to generate a result 32-bit single precision floating-point data element; and store the result 32-bit single precision floating-point data element at the row m and the column n of the third matrix.
Public/Granted literature
- EP4276608A2 APPARATUSES, METHODS, AND SYSTEMS FOR 8-BIT FLOATING-POINT MATRIX DOT PRODUCT INSTRUCTIONS Public/Granted day:2023-11-15
Information query