-
公开(公告)号:US20230297373A1
公开(公告)日:2023-09-21
申请号:US18307088
申请日:2023-04-26
Applicant: Intel Corporation
Inventor: SUBRAMANIAM MAIYURAN , GUEI-YUAN LUEH , SUPRATIM PAL , ASHUTOSH GARG , CHANDRA S. GURRAM , JORGE E. PARRA , JUNJIE GU , KONRAD TRIFUNOVIC , HONG BIN LIAO , MIKE B. MACPHERSON , SHUBH B. SHAH , SHUBRA MARWAHA , STEPHEN JUNKINS , TIMOTHY R. BAUER , VARGHESE GEORGE , WEIYU CHEN
CPC classification number: G06F9/3001 , G06F9/30145 , G06T1/20 , G06F9/3887 , G06F9/3802
Abstract: Embodiments described herein provided for an instruction and associated logic to enable GPGPU program code to access special purpose hardware logic to accelerate dot product operations. One embodiment provides for a graphics processing unit comprising a fetch unit to fetch a single instruction for execution, a decode unit to decode the single instruction into a decoded instruction, wherein the decoded instruction is to cause the graphics processing unit to perform a set of parallel dot product operations on elements of input matrices, and a systolic dot product unit to execute the decoded instruction across one or more parallel processor lanes using multiple systolic layers associated with multiple pipeline stages. The multiple pipeline stages include one or more sets of interconnected multipliers and adders to compute multiple concurrent dot products.
-
公开(公告)号:US20210303299A1
公开(公告)日:2021-09-30
申请号:US17304153
申请日:2021-06-15
Applicant: Intel Corporation
Inventor: SUBRAMANIAM MAIYURAN , GUEI-YUAN LUEH , SUPRATIM PAL , ASHUTOSH GARG , CHANDRA S. GURRAM , JORGE E. PARRA , JUNJIE GU , KONRAD TRIFUNOVIC , HONG BIN LIAO , MIKE B. MACPHERSON , SHUBH B. SHAH , SHUBRA MARWAHA , STEPHEN JUNKINS , TIMOTHY R. BAUER , VARGHESE GEORGE , WEIYU CHEN
Abstract: Embodiments described herein provided for an instruction and associated logic to enable GPGPU program code to access special purpose hardware logic to accelerate dot product operations. One embodiment provides for a graphics processing unit comprising a fetch unit to fetch an instruction for execution and a decode unit to decode the instruction into a decoded instruction. The decoded instruction is a matrix instruction to cause the graphics processing unit to perform a parallel dot product operation. The GPGPU also includes systolic dot product circuitry to execute the decoded instruction across one or more SIMD lanes using multiple systolic layers, wherein to execute the decoded instruction, a dot product computed at a first systolic layer is to be output to a second systolic layer, wherein each systolic layer includes one or more sets of interconnected multipliers and adders, each set of multipliers and adders to generate a dot product.
-
公开(公告)号:US20190324746A1
公开(公告)日:2019-10-24
申请号:US15957728
申请日:2018-04-19
Applicant: Intel Corporation
Inventor: SUBRAMANIAM MAIYURAN , GUEI-YUAN LUEH , SUPRATIM PAL , ASHUTOSH GARG , CHANDRA S. GURRAM , JORGE E. PARRA , JUNJIE GU , KONRAD TRIFUNOVIC , HONG BIN LIAO , MIKE B. MACPHERSON , SHUBH B. SHAH , SHUBRA MARWAHA , STEPHEN JUNKINS , TIMOTHY R. BAUER , VARGHESE GEORGE , WEIYU CHEN
Abstract: Embodiments described herein provided for an instruction and associated logic to enable GPGPU program code to access special purpose hardware logic to accelerate dot product operations. One embodiment provides for a graphics processing unit comprising a fetch unit to fetch an instruction for execution and a decode unit to decode the instruction into a decoded instruction. The decoded instruction is a matrix instruction to cause the graphics processing unit to perform a parallel dot product operation. The GPGPU also includes a systolic dot product unit to execute the decoded instruction across one or more SIMD lanes using multiple systolic layers, wherein to execute the decoded instruction, a dot product computed at a first systolic layer is to be output to a second systolic layer, wherein each systolic layer includes one or more sets of interconnected multipliers and adders, each set of multipliers and adders to generate a dot product.
-
公开(公告)号:US20180285262A1
公开(公告)日:2018-10-04
申请号:US15476918
申请日:2017-03-31
Applicant: INTEL CORPORATION
Inventor: ANNA TRIKALINOU , RAJESH M. SANKARAN , STEPHEN JUNKINS
IPC: G06F12/084 , G06F12/14 , G06F12/1045 , G06F12/1036 , G06F12/109 , G06F12/1072 , G06F12/02
Abstract: Various embodiments described herein include an input/output memory management unit (IOMMU) that can restrict write accesses originating from a shared virtual memory (SVM) device towards a CPU's code page, so that the SVM device cannot be used to attack and manipulate the CPU's behavior. In some embodiments, the IOMMU may perform a security check so that whenever a write request arrives from an SVM device, if the page requested is present and is executable, an access violation fault is generated and the request is terminated. In some such embodiments, this may prevent a malicious or vulnerable device from corrupting the CPU's process memory and causing an arbitrary code execution with the CPU process's privileges.
-
-
-