-
公开(公告)号:US20250053530A1
公开(公告)日:2025-02-13
申请号:US18749130
申请日:2024-06-20
Applicant: Intel Corporation
Inventor: Philip R. Lantz , Sanjay Kumar , Rajesh M. Sankaran , Saurabh Gayen
IPC: G06F13/364 , G06F9/50 , G06F13/24
Abstract: Embodiments of apparatuses, methods, and systems for highly scalable accelerators are described. In an embodiment, an apparatus includes an interface to receive a plurality of work requests from a plurality of clients and a plurality of engines to perform the plurality of work requests. The work requests are to be dispatched to the plurality of engines from a plurality of work queues. The work queues are to store a work descriptor per work request. Each work descriptor is to include all information needed to perform a corresponding work request.
-
公开(公告)号:US11734209B2
公开(公告)日:2023-08-22
申请号:US17550977
申请日:2021-12-14
Applicant: Intel Corporation
Inventor: Sanjay Kumar , Rajesh M. Sankaran , Philip R. Lantz , Utkarsh Y. Kakaiya , Kun Tian
CPC classification number: G06F13/24 , G06F9/45558 , G06F9/4812 , G06F2009/45579
Abstract: Implementations of the disclosure provide processing device comprising: an interrupt managing circuit to receive an interrupt message directed to an application container from an assignable interface (AI) of an input/output (I/O) device. The interrupt message comprises an address space identifier (ASID), an interrupt handle and a flag to distinguish the interrupt message from a direct memory access (DMA) message. Responsive to receiving the interrupt message, a data structure associated with the interrupt managing circuit is identified. An interrupt entry from the data structure is selected based on the interrupt handle. It is determined that the ASID associated with the interrupt message matches an ASID in the interrupt entry. Thereupon, an interrupt in the interrupt entry is forwarded to the application container.
-
公开(公告)号:US11650947B2
公开(公告)日:2023-05-16
申请号:US17410063
申请日:2021-08-24
Applicant: Intel Corporation
Inventor: Philip R. Lantz , Sanjay Kumar , Rajesh M. Sankaran , Saurabh Gayen
IPC: G06F13/364 , G06F13/24 , G06F9/50
CPC classification number: G06F13/364 , G06F9/5027 , G06F13/24
Abstract: Embodiments of apparatuses, methods, and systems for highly scalable accelerators are described. In an embodiment, an apparatus includes an interface to receive a plurality of work requests from a plurality of clients and a plurality of engines to perform the plurality of work requests. The work requests are to be dispatched to the plurality of engines from a plurality of work queues. The work queues are to store a work descriptor per work request. Each work descriptor is to include all information needed to perform a corresponding work request.
-
公开(公告)号:US20230032236A1
公开(公告)日:2023-02-02
申请号:US17875198
申请日:2022-07-27
Applicant: Intel Corporation
Inventor: Rajesh M. Sankaran , Philip R. Lantz , Narayan Ranganathan , Saurabh Gayen , Sanjay Kumar , Nikhil Rao , Dhananjay A. Joshi , Hai Ming Khor , Utkarsh Y. Kakaiya
IPC: G06F3/06
Abstract: Methods and apparatus relating to data streaming accelerators are described. In an embodiment, a hardware accelerator such as a Data Streaming Accelerator (DSA) logic circuitry provides high-performance data movement and/or data transformation for data to be transferred between a processor (having one or more processor cores) and a storage device. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US11556363B2
公开(公告)日:2023-01-17
申请号:US16479395
申请日:2017-03-31
Applicant: Intel Corporation
Inventor: Sanjay Kumar , Philip R. Lantz , Kun Tian , Utkarsh Y. Kakaiya , Rajesh M. Sankaran
IPC: G06F9/455 , G06F9/30 , G06F12/1009
Abstract: Techniques for transferring virtual machines and resource management in a virtualized computing environment are described. In one embodiment, for example, an apparatus may include at least one memory, at least one processor, and logic for transferring a virtual machine (VM), at least a portion of the logic comprised in hardware coupled to the at least one memory and the at least one processor, the logic to generate a plurality of virtualized capability registers for a virtual device (VDEV) by virtualizing a plurality of device-specific capability registers of a physical device to be virtualized by the VM, the plurality of virtualized capability registers comprising a plurality of device-specific capabilities of the physical device, determine a version of the physical device to support via a virtual machine monitor (VMM), and expose a subset of the virtualized capability registers associated with the version to the VM. Other embodiments are described and claimed.
-
公开(公告)号:US11200183B2
公开(公告)日:2021-12-14
申请号:US16493148
申请日:2017-03-31
Applicant: INTEL CORPORATION
Inventor: Sanjay Kumar , Rajesh M. Sankaran , Philip R. Lantz , Utkarsh Y. Kakaiya , Kun Tian
Abstract: Implementations of the disclosure provide processing device comprising: an interrupt managing circuit to receive an interrupt message directed to an application container from an assignable interface (AI) of an input/output (I/O) device. The interrupt message comprises an address space identifier (ASID), an interrupt handle and a flag to distinguish the interrupt message from a direct memory access (DMA) message. Responsive to receiving the interrupt message, a data structure associated with the interrupt managing circuit is identified. An interrupt entry from the data structure is selected based on the interrupt handle. It is determined that the ASID associated with the interrupt message matches an ASID in the interrupt entry. Thereupon, an interrupt in the interrupt entry is forwarded to the application container.
-
27.
公开(公告)号:US11099880B2
公开(公告)日:2021-08-24
申请号:US16481441
申请日:2017-02-22
Applicant: INTEL CORPORATION
Inventor: Sanjay Kumar , Rajesh M. Sankaran , Gilbert Neiger , Philip R. Lantz , Jason W. Brandt , Vedvyas Shanbhogue , Utkarsh Y. Kakaiya , Kun Tian
IPC: G06F9/455 , G06F12/1045 , G06F12/109 , G06F9/30
Abstract: A processing device comprises an address translation circuit to intercept a work request from an I/O device. The work request comprises a first ASID to map to a work queue. A second ASID of a host is allocated for the first ASID based on the work queue. The second ASID is allocated to at least one of: an ASID register for a dedicated work queue (DWQ) or an ASID translation table for a shared work queue (SWQ). Responsive to receiving a work submission from the SVM client to the I/O device, the first ASID of the application container is translated to the second ASID of the host machine for submission to the I/O device using at least one of: the ASID register for the DWQ or the ASID translation table for the SWQ based on the work queue associated with the I/O device.
-
公开(公告)号:US11055147B2
公开(公告)日:2021-07-06
申请号:US16351396
申请日:2019-03-12
Applicant: Intel Corporation
Inventor: Utkarsh Y. Kakaiya , Rajesh Sankaran , Sanjay Kumar , Kun Tian , Philip Lantz
Abstract: Techniques for scalable virtualization of an Input/Output (I/O) device are described. An electronic device composes a virtual device comprising one or more assignable interface (AI) instances of a plurality of AI instances of a hosting function exposed by the I/O device. The electronic device emulates device resources of the I/O device via the virtual device. The electronic device intercepts a request from the guest pertaining to the virtual device, and determines whether the request from the guest is a fast-path operation to be passed directly to one of the one or more AI instances of the I/O device or a slow-path operation that is to be at least partially serviced via software executed by the electronic device. For a slow-path operation, the electronic device services the request at least partially via the software executed by the electronic device.
-
公开(公告)号:US10228981B2
公开(公告)日:2019-03-12
申请号:US15584979
申请日:2017-05-02
Applicant: Intel Corporation
Inventor: Utkarsh Y. Kakaiya , Rajesh Sankaran , Sanjay Kumar , Kun Tian , Philip Lantz
Abstract: Techniques for scalable virtualization of an Input/Output (I/O) device are described. An electronic device composes a virtual device comprising one or more assignable interface (AI) instances of a plurality of AI instances of a hosting function exposed by the I/O device. The electronic device emulates device resources of the I/O device via the virtual device. The electronic device intercepts a request from the guest pertaining to the virtual device, and determines whether the request from the guest is a fast-path operation to be passed directly to one of the one or more AI instances of the I/O device or a slow-path operation that is to be at least partially serviced via software executed by the electronic device. For a slow-path operation, the electronic device services the request at least partially via the software executed by the electronic device.
-
公开(公告)号:US09747208B2
公开(公告)日:2017-08-29
申请号:US15411658
申请日:2017-01-20
Applicant: Intel Corporation
Inventor: Sanjay Kumar , Rajesh M. Sankaran , Subramanya R. Dulloor , Andrew V. Anderson
IPC: G06F13/00 , G06F12/0804 , G06F11/14
CPC classification number: G06F12/0804 , G06F9/467 , G06F11/07 , G06F11/073 , G06F11/0778 , G06F11/0793 , G06F11/14 , G06F11/1482 , G06F12/0868 , G06F2201/805 , G06F2201/82 , G06F2212/1032 , G06F2212/608
Abstract: A processor includes a memory management unit and a front end including a decoder. The decoder includes logic to receive a flush-on-commit (FoC) instruction to flush dirty data from a volatile cache to a persistent memory upon commitment of a store associated with the FoC instruction. The memory management unit includes logic to, based upon a flush-on-fail (FoF) mode, skip execution of the flush-on-commit instruction and to flush the dirty data from the volatile cache upon a subsequent FoF operation.
-
-
-
-
-
-
-
-
-