-
公开(公告)号:US20210089301A1
公开(公告)日:2021-03-25
申请号:US16582406
申请日:2019-09-25
Applicant: Intel Corporation
Inventor: SUBRAMANIAM MAIYURAN , VARGHESE GEORGE , JOYDEEP RAY , ASHUTOSH GARG , JORGE PARRA , SHUBH SHAH , SHUBRA MARWAHA
Abstract: Embodiments described herein provide an apparatus comprising a plurality of processing resources including a first processing resource and a second processing resource, a shared local memory communicatively coupled to the first processing resource and the second processing resource, and a processor to receive an instruction to initiate a matrix multiplication operation, write a first set of matrix data into a first set of registers, and share the first set of matrix data between the first processing resource and the second processing resource for use in the matrix multiplication operation. Other embodiments may be described and claimed.
-
公开(公告)号:US20210073318A1
公开(公告)日:2021-03-11
申请号:US16561715
申请日:2019-09-05
Applicant: Intel Corporation
Inventor: SUBRAMANIAM MAIYURAN , MATHEW NEVIN , JORGE PARRA , ASHUTOSH GARG , SHUBRA MARWAHA , SHUBH SHAH
Abstract: An apparatus to facilitate acceleration of matrix multiplication operations. The apparatus comprises a systolic array including matrix multiplication hardware to perform multiply-add operations on received matrix data comprising data from a plurality of input matrices and sparse matrix acceleration hardware to detect zero values in the matrix data and perform one or more optimizations on the matrix data to reduce multiply-add operations to be performed by the matrix multiplication hardware.
-
13.
公开(公告)号:US20160189327A1
公开(公告)日:2016-06-30
申请号:US14583300
申请日:2014-12-26
Applicant: Intel Corporation
Inventor: SUBRAMANIAM MAIYURAN , SHUBH B. SHAH , ASHUTOSH GARG , JIN XU , THOMAS A. PIAZZA , JORGE F. GARCIA PABON , MICHAEL K. DWYER
CPC classification number: G06T1/20 , G06F7/00 , G06F9/3001 , G06F9/3016 , G06F9/30181 , G06T15/80 , G09G5/00 , G09G5/001 , G09G5/363 , G09G2330/021 , G09G2360/08
Abstract: Systems and methods may provide a graphics processor that may identify operating conditions under which certain floating point instructions may utilize power to fewer hardware resources compared to when the instructions are executing under other operating conditions. The operating conditions may be determined by examining operands used in a given instruction, including the relative magnitudes of the operands and whether the operands may be taken as equal to certain defined values. The floating point instructions may include instructions for an addition operation, a multiplication operation, a compare operation, and/or a fused multiply-add operation.
Abstract translation: 系统和方法可以提供图形处理器,其可以识别与在其他操作条件下执行指令时相比,某些浮点指令可以利用较少的硬件资源的功率的操作条件。 操作条件可以通过检查给定指令中使用的操作数来确定,包括操作数的相对大小以及操作数是否可以被视为等于某些定义的值。 浮点指令可以包括用于加法运算,乘法运算,比较运算和/或融合乘法运算的指令。
-
-