-
公开(公告)号:US20250123881A1
公开(公告)日:2025-04-17
申请号:US18927065
申请日:2024-10-25
Applicant: Intel Corporation
Inventor: Rajesh M. SANKARAN , Gilbert NEIGER , Narayan RANGANATHAN , Stephen R. VAN DOREN , Joseph NUZMAN , Niall D. MCDONNELL , Michael A. O'HANLON , Lokpraveen B. MOSUR , Tracy Garrett DRYSDALE , Eriko NURVITADHI , Asit K. MISHRA , Ganesh VENKATESH , Deborah T. MARR , Nicholas P. CARTER , Jonathan D. PEARCE , Edward T. GROCHOWSKI , Richard J. GRECO , Robert VALENTINE , Jesus CORBAL , Thomas D. FLETCHER , Dennis R. BRADFORD , Dwight P. MANLEY , Mark J. CHARNEY , Jeffry J. COOK , Paul CAPRIOLI , Koichi YAMADA , Kent D. GLOSSOP , David B. SHEFFIELD
Abstract: Embodiments of systems, methods, and apparatuses for heterogeneous computing are described. In some embodiments, a hardware heterogeneous scheduler dispatches instructions for execution on one or more plurality of heterogeneous processing elements, the instructions corresponding to a code fragment to be processed by the one or more of the plurality of heterogeneous processing elements, wherein the instructions are native instructions to at least one of the one or more of the plurality of heterogeneous processing elements.
-
2.
公开(公告)号:US20230042934A1
公开(公告)日:2023-02-09
申请号:US17560170
申请日:2021-12-22
Applicant: Intel Corporation
Inventor: Utkarsh Y. KAKAIYA , Philip LANTZ , Sanjay KUMAR , Rajesh SANKARAN , Narayan RANGANATHAN , Saurabh GAYEN , Dhananjay JOSHI , Nikhil P. RAO
IPC: G06F11/07
Abstract: Apparatus and method for high-performance page fault handling. For example, one embodiment of an apparatus comprises: one or more accelerator engines to process work descriptors submitted by clients to a plurality of work queues; fault processing hardware logic associated with the one or more accelerator engines, the fault processing hardware logic to implement a specified page fault handling mode for each work queue of the plurality of work queues, the page fault handling modes including a first page fault handling mode and a second page fault handling mode.
-
公开(公告)号:US20180089099A1
公开(公告)日:2018-03-29
申请号:US15280965
申请日:2016-09-29
Applicant: INTEL CORPORATION
Inventor: Ashok RAJ , Sivakumar RADHAKRISHNAN , Dan J. WILLIAMS , Vishal VERMA , Narayan RANGANATHAN , Chet R. DOUGLAS
IPC: G06F12/10
CPC classification number: G06F12/10 , G06F3/0608 , G06F3/0631 , G06F3/0644 , G06F3/0646 , G06F3/0647 , G06F3/065 , G06F3/0659 , G06F3/0683 , G06F12/0246 , G06F2212/1032 , G06F2212/152 , G06F2212/65
Abstract: In one embodiment, a block data transfer interface employing offload data transfer engine in accordance with the present description includes an offload data transfer engine executing a data transfer command set to transfer a block of data in a transfer data path from a source memory to a new region of a destination memory, wherein the transfer data path bypasses a central processing unit to minimize or reduce involvement of the central processing unit in the block transfer. In response to a successful transfer indication, a logical address is re-mapped to a physical address of the new region of the destination memory, instead of a physical address of the original region of the destination memory. In one embodiment, the re-mapping is performed by a central processing unit. In another embodiment, the re-mapping is performed by the offload data transfer engine. Other aspects are described herein.
-
公开(公告)号:US20180004687A1
公开(公告)日:2018-01-04
申请号:US15201373
申请日:2016-07-01
Applicant: Intel Corporation
Inventor: Francesc GUIM BERNAT , Karthik KUMAR , Thomas WILLHALM , Narayan RANGANATHAN , Pete D. VOGT
IPC: G06F13/16 , G06F13/42 , G06F13/40 , H04L29/08 , H04L12/803
Abstract: An extension of node architecture and proxy requests enables a node to expose memory computation capability to remote nodes. A remote node can request execution of an operation by a remote memory computation resource, and the remote memory computation resource can execute the request locally and return the results of the computation. The node includes processing resources, a fabric interface, and a memory subsystem including a memory computation resource. The local execution of the request by the memory computation resource can reduce latency and bandwidth concerns typical with remote requests.
-
公开(公告)号:US20230418655A1
公开(公告)日:2023-12-28
申请号:US18207870
申请日:2023-06-09
Applicant: Intel Corporation
Inventor: Rajesh M. SANKARAN , Gilbert NEIGER , Narayan RANGANATHAN , Stephen R. VAN DOREN , Joseph NUZMAN , Niall D. MCDONNELL , Michael A. O'HANLON , Lokpraveen B. MOSUR , Tracy Garrett DRYSDALE , Eriko NURVITADHI , Asit K. MISHRA , Ganesh VENKATESH , Deborah T. MARR , Nicholas P. CARTER , Jonathan D. PEARCE , Edward T. GROCHOWSKI , Richard J. GRECO , Robert VALENTINE , Jesus CORBAL , Thomas D. FLETCHER , Dennis R. BRADFORD , Dwight P. MANLEY , Mark J. CHARNEY , Jeffrey J. COOK , Paul CAPRIOLI , Koichi YAMADA , Kent D. GLOSSOP , David B. SHEFFIELD
CPC classification number: G06F9/48 , G06F9/3001 , G06F9/383 , G06F9/3004 , G06F9/30036
Abstract: Embodiments of systems, methods, and apparatuses for heterogeneous computing are described. In some embodiments, a hardware heterogeneous scheduler dispatches instructions for execution on one or more plurality of heterogeneous processing elements, the instructions corresponding to a code fragment to be processed by the one or more of the plurality of heterogeneous processing elements, wherein the instructions are native instructions to at least one of the one or more of the plurality of heterogeneous processing elements.
-
6.
公开(公告)号:US20230040226A1
公开(公告)日:2023-02-09
申请号:US17559612
申请日:2021-12-22
Applicant: INTEL CORPORATION
Inventor: Saurabh GAYEN , Dhananjay JOSHI , Philip LANTZ , Rajesh SANKARAN , Narayan RANGANATHAN
Abstract: Apparatus and method for managing pipeline depth of a data processing device. For example, one embodiment of an apparatus comprises: an interface to receive a plurality of work requests from a plurality of clients; and a plurality of engines to perform the plurality of work requests; wherein the work requests are to be dispatched to the plurality of engines from a plurality of work queues, the work queues to store a work descriptor per work request, each work descriptor to include information needed to perform a corresponding work request, wherein the plurality of work queues include a first work queue to store work descriptors associated with first latency characteristics and a second work queue to store work descriptors associated with second latency characteristics; engine configuration circuitry to configure a first engine to have a first pipeline depth based on the first latency characteristics and to configure a second engine to have a second pipeline depth based on the second latency characteristics.
-
公开(公告)号:US20220164218A1
公开(公告)日:2022-05-26
申请号:US17381521
申请日:2021-07-21
Applicant: Intel Corporation
Inventor: Rajesh M. SANKARAN , Gilbert NEIGER , Narayan RANGANATHAN , Stephen R. VAN DOREN , Joseph NUZMAN , Niall D. MCDONNELL , Michael A. O'HANLON , Lokpraveen B. MOSUR , Tracy Garrett DRYSDALE , Eriko NURVITADHI , Asit K. MISHRA , Ganesh VENKATESH , Deborah T. MARR , Nicholas P. CARTER , Jonathan D. PEARCE , Edward T. GROCHOWSKI , Richard J. GRECO , Robert VALENTINE , Jesus CORBAL , Thomas D. FLETCHER , Dennis R. BRADFORD , Dwight P. MANLEY , Mark J. CHARNEY , Jeffrey J. COOK , Paul CAPRIOLI , Koichi YAMADA , Kent D. GLOSSOP , David B. SHEFFIELD
Abstract: Embodiments of systems, methods, and apparatuses for heterogeneous computing are described. In some embodiments, a hardware heterogeneous scheduler dispatches instructions for execution on one or more plurality of heterogeneous processing elements, the instructions corresponding to a code fragment to be processed by the one or more of the plurality of heterogeneous processing elements, wherein the instructions are native instructions to at least one of the one or more of the plurality of heterogeneous processing elements.
-
8.
公开(公告)号:US20240281379A2
公开(公告)日:2024-08-22
申请号:US18058882
申请日:2022-11-28
Applicant: Intel Corporation
Inventor: Narayan RANGANATHAN , Vamsi SRIPATHI
IPC: G06F12/0862 , G06F12/0811
CPC classification number: G06F12/0862 , G06F12/0811
Abstract: Examples relate to a non-transitory, computer-readable medium, method, and computer system for prefetching data during executing of a computer program. The non-transitory, computer-readable medium comprises machine-readable instructions that, when the program code is executed on a processor, causes the processor to pre-fetch to a processor cache, using data transfer offloading circuitry of the processor, data being accessed by an application program from a main memory of the computer system, and to execute the computer program using the pre-fetched data that is stored in the processor cache.
-
9.
公开(公告)号:US20240143507A1
公开(公告)日:2024-05-02
申请号:US18058882
申请日:2022-11-28
Applicant: Intel Corporation
Inventor: Narayan RANGANATHAN , Vamsi SRIPATHI
IPC: G06F12/0862 , G06F12/0811
CPC classification number: G06F12/0862 , G06F12/0811
Abstract: Examples relate to a non-transitory, computer-readable medium, method, and computer system for prefetching data during executing of a computer program. The non-transitory, computer-readable medium comprises machine-readable instructions that, when the program code is executed on a processor, causes the processor to pre-fetch to a processor cache, using data transfer offloading circuitry of the processor, data being accessed by an application program from a main memory of the computer system, and to execute the computer program using the pre-fetched data that is stored in the processor cache.
-
10.
公开(公告)号:US20180165144A1
公开(公告)日:2018-06-14
申请号:US15372734
申请日:2016-12-08
Applicant: INTEL CORPORATION
Inventor: Ashok RAJ , Narayan RANGANATHAN , Mohan J. KUMAR , Vincent J. ZIMMER
CPC classification number: G06F11/0787 , G06F9/3016 , G06F11/0766
Abstract: A processor includes an instruction decoder to receive an instruction to perform a machine check operation, the instruction having a first operand and a second operand. The processor further includes a machine check logic coupled to the instruction decoder to determine that the instruction is to determine a type of a machine check bank based on a command value stored in a first storage location indicated by the first operand, to determine a type of a machine check bank identified by a machine check bank identifier (ID) stored in a second storage location indicated by the second operand, and to store the determined type of the machine check bank in the first storage location indicated by the first operand.
-
-
-
-
-
-
-
-
-